dc.contributor.author |
Jasonarson, Atli |
dc.contributor.author |
Steingrímsson, Steinþór |
dc.contributor.author |
Sigurðsson, Einar Freyr |
dc.contributor.author |
Daðason, Jón Friðrik |
dc.date.accessioned |
2022-09-27T14:05:47Z |
dc.date.available |
2022-09-27T14:05:47Z |
dc.date.issued |
2022-09-08 |
dc.identifier.uri |
http://hdl.handle.net/20.500.12537/272 |
dc.description |
ENGLISH:
This Universal Dependencies parser for Icelandic was trained with COMBO on IcePaHC and UD_Icelandic-Modern, the latter one having been revised before training, as some duplicate sentences had to be removed. It utilizes information from an ELECTRA language model (https://huggingface.co/jonfd/electra-base-igc-is). Its UAS (unlabeled attachment score) is 89.13 and its LAS (labeled attachment score) is 85.97. |
dc.description |
ICELANDIC:
Þessi UD-þáttari var þjálfaður með COMBO á IcePaHC og UD_Icelandic-Modern en síðarnefnda málheildin var uppfærð fyrir þjálfun tólsins, þar sem fjarlægðar voru úr henni endurteknar setningar. Þáttarinn nýtir sér upplýsingar úr ELECTRA-mállíkani. Hann skorar 89.13 á UAS (unlabeled attachment score) og 85.97 á LAS (labeled attachment score).
COMBO: https://gitlab.clarin-pl.eu/syntactic-tools/combo/
IcePaHC: https://github.com/UniversalDependencies/UD_Icelandic-IcePaHC/
UD_Icelandic-Modern: https://github.com/UniversalDependencies/UD_Icelandic-Modern/
electra-base-igc-is: https://huggingface.co/jonfd/electra-base-igc-is |
dc.language.iso |
isl |
dc.publisher |
The Árni Magnússon Institute for Icelandic Studies |
dc.relation.isreplacedby |
http://hdl.handle.net/20.500.12537/301 |
dc.rights |
Apache License 2.0 |
dc.rights.uri |
https://opensource.org/license/apache2-0-php/ |
dc.rights.label |
PUB |
dc.subject |
universal dependencies |
dc.subject |
parsing |
dc.title |
COMBO-based UD Parser 22.10 |
dc.type |
toolService |
metashare.ResourceInfo#ContentInfo.detailedType |
tool |
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent |
true |
has.files |
yes |
branding |
Clarin IS Repository |
contact.person |
Steinþór Steingrímsson steinthor.steingrimsson@arnastofnun.is The Árni Magnússon Institute for Icelandic Studies |
sponsor |
Ministry of Education, Science and Culture (Mennta- og menningamálaráðuneytið) I5 – Parsers Language Technology for Icelandic 2019-2023 nationalFunds |
files.size |
453263616 |
files.count |
1 |