Sýna einfalda færslu atriðis
dc.contributor.author |
Jasonarson, Atli |
dc.contributor.author |
Steingrímsson, Steinþór |
dc.contributor.author |
Sigurðsson, Einar Freyr |
dc.contributor.author |
Daðason, Jón Friðrik |
dc.date.accessioned |
2022-09-27T14:05:47Z |
dc.date.available |
2022-09-27T14:05:47Z |
dc.date.issued |
2022-09-08 |
dc.identifier.uri |
http://hdl.handle.net/20.500.12537/272 |
dc.description |
ENGLISH:
This Universal Dependencies parser for Icelandic was trained with COMBO on IcePaHC and UD_Icelandic-Modern, the latter one having been revised before training, as some duplicate sentences had to be removed. It utilizes information from an ELECTRA language model (https://huggingface.co/jonfd/electra-base-igc-is). Its UAS (unlabeled attachment score) is 89.13 and its LAS (labeled attachment score) is 85.97. |
dc.description |
ICELANDIC:
Þessi UD-þáttari var þjálfaður með COMBO á IcePaHC og UD_Icelandic-Modern en síðarnefnda málheildin var uppfærð fyrir þjálfun tólsins, þar sem fjarlægðar voru úr henni endurteknar setningar. Þáttarinn nýtir sér upplýsingar úr ELECTRA-mállíkani. Hann skorar 89.13 á UAS (unlabeled attachment score) og 85.97 á LAS (labeled attachment score).
COMBO: https://gitlab.clarin-pl.eu/syntactic-tools/combo/
IcePaHC: https://github.com/UniversalDependencies/UD_Icelandic-IcePaHC/
UD_Icelandic-Modern: https://github.com/UniversalDependencies/UD_Icelandic-Modern/
electra-base-igc-is: https://huggingface.co/jonfd/electra-base-igc-is |
dc.language.iso |
isl |
dc.publisher |
The Árni Magnússon Institute for Icelandic Studies |
dc.relation.isreplacedby |
http://hdl.handle.net/20.500.12537/301 |
dc.rights |
Apache License 2.0 |
dc.rights.uri |
https://opensource.org/license/apache2-0-php/ |
dc.rights.label |
PUB |
dc.subject |
universal dependencies |
dc.subject |
parsing |
dc.title |
COMBO-based UD Parser 22.10 |
dc.type |
toolService |
metashare.ResourceInfo#ContentInfo.detailedType |
tool |
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent |
true |
has.files |
yes |
branding |
Clarin IS Repository |
contact.person |
Steinþór Steingrímsson steinthor.steingrimsson@arnastofnun.is The Árni Magnússon Institute for Icelandic Studies |
sponsor |
Ministry of Education, Science and Culture (Mennta- og menningamálaráðuneytið) I5 – Parsers Language Technology for Icelandic 2019-2023 nationalFunds |
files.size |
453263616 |
files.count |
1 |
Files in this item
This item is
Publicly Available
and licensed under:
Apache License 2.0
- Name
- combo-electra-dependency-parser.zip
- Size
- 432.27
MB
- Format
- application/zip
- Description
- combo-electra-dependency-parser
- MD5
- 491e7cf3aaa3cecedd0731a441cc0ac4
Download file
Preview
- combo-dependency-parser
- README.md2 kB
- test.py905 B
- requirements.txt29 B
- ud-transformer-parser
- config.json9 kB
- best.th466 MB
- vocabulary
- feats_labels.txt565 B
- non_padded_namespaces.txt12 B
- xpostag_labels.txt1 kB
- deprel_labels.txt231 B
- .lock0 B
- lemma_characters.txt231 B
- token_characters.txt260 B
- upostag_labels.txt77 B
- test_file.txt96 B
Sýna einfalda færslu atriðis