| dc.contributor.author |
Jasonarson, Atli |
| dc.contributor.author |
Steingrímsson, Steinþór |
| dc.contributor.author |
Sigurðsson, Einar Freyr |
| dc.contributor.author |
Daðason, Jón Friðrik |
| dc.date.accessioned |
2022-09-27T14:05:47Z |
| dc.date.available |
2022-09-27T14:05:47Z |
| dc.date.issued |
2022-09-08 |
| dc.identifier.uri |
http://hdl.handle.net/20.500.12537/272 |
| dc.description |
ENGLISH:
This Universal Dependencies parser for Icelandic was trained with COMBO on IcePaHC and UD_Icelandic-Modern, the latter one having been revised before training, as some duplicate sentences had to be removed. It utilizes information from an ELECTRA language model (https://huggingface.co/jonfd/electra-base-igc-is). Its UAS (unlabeled attachment score) is 89.13 and its LAS (labeled attachment score) is 85.97. |
| dc.description |
ICELANDIC:
Þessi UD-þáttari var þjálfaður með COMBO á IcePaHC og UD_Icelandic-Modern en síðarnefnda málheildin var uppfærð fyrir þjálfun tólsins, þar sem fjarlægðar voru úr henni endurteknar setningar. Þáttarinn nýtir sér upplýsingar úr ELECTRA-mállíkani. Hann skorar 89.13 á UAS (unlabeled attachment score) og 85.97 á LAS (labeled attachment score).
COMBO: https://gitlab.clarin-pl.eu/syntactic-tools/combo/
IcePaHC: https://github.com/UniversalDependencies/UD_Icelandic-IcePaHC/
UD_Icelandic-Modern: https://github.com/UniversalDependencies/UD_Icelandic-Modern/
electra-base-igc-is: https://huggingface.co/jonfd/electra-base-igc-is |
| dc.language.iso |
isl |
| dc.publisher |
The Árni Magnússon Institute for Icelandic Studies |
| dc.relation.isreplacedby |
http://hdl.handle.net/20.500.12537/301 |
| dc.rights |
Apache License 2.0 |
| dc.rights.uri |
https://opensource.org/license/apache2-0-php/ |
| dc.rights.label |
PUB |
| dc.subject |
universal dependencies |
| dc.subject |
parsing |
| dc.title |
COMBO-based UD Parser 22.10 |
| dc.type |
toolService |
| metashare.ResourceInfo#ContentInfo.detailedType |
tool |
| metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent |
true |
| has.files |
yes |
| branding |
Clarin IS Repository |
| contact.person |
Steinþór Steingrímsson steinthor.steingrimsson@arnastofnun.is The Árni Magnússon Institute for Icelandic Studies |
| sponsor |
Ministry of Education, Science and Culture (Mennta- og menningamálaráðuneytið) I5 – Parsers Language Technology for Icelandic 2019-2023 nationalFunds |
| files.size |
453263616 |
| files.count |
1 |