Show simple item record

 
dc.contributor.author Jónsson, Haukur Páll
dc.contributor.author Loftson, Hrafn
dc.contributor.author Steingrímsson, Steinþór
dc.date.accessioned 2021-06-04T12:43:58Z
dc.date.available 2021-06-04T12:43:58Z
dc.date.issued 2021-06-01
dc.identifier.uri http://hdl.handle.net/20.500.12537/115
dc.description A Part-of-Speech (PoS) tagger for Icelandic. In this submission, you will find pretrained models for ABLTagger v3.0.0. In this submission we provide two versions, small and large, of PoS taggers that work with the revised tagset that achieve an accuracy of ~96.7% and ~97.8% on MIM-Gold (cross-validation, excluding "x" and "e" tags), respectively. For installation, usage, and other instructions see https://github.com/cadia-lvl/POS/releases/tag/m5 You should also check if a newer version is out (see README.md - versions) on CLARIN: - Model files ------------------------------------------------------------------------------------------- Markari fyrir íslensku. Í þessum pakka er ABLTagger v3.0.0. Í þessari útgáfu eru tvö forþjálfuð líkön, lítið og stórt, sem virka fyrir nýja markamengið og ná 96,7% og 97,8% nákvæmni á MÍM-Gull (krossprófanir, án "x" og "e" marka). Fyrir uppsetningar-, notenda- og aðrar leiðbeiningar sjá https://github.com/cadia-lvl/POS/releases/tag/m5 Einnig er gott að athuga þar hvort ný útgáfa sé komin út (sjá README.md - versions) Á CLARIN: - Gögn fyrir líkan
dc.language.iso isl
dc.publisher Reykjavik University
dc.rights Apache License 2.0
dc.rights.uri https://opensource.org/license/apache2-0-php/
dc.rights.label PUB
dc.source.uri https://github.com/cadia-lvl/POS
dc.subject pos-tagging
dc.title ABLTagger (PoS) - 3.0.0
dc.type toolService
metashare.ResourceInfo#ContentInfo.detailedType tool
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent true
has.files yes
branding Clarin IS Repository
contact.person Haukur Páll Jónsson haukurpalljonsson@gmail.com Reykjavik University
sponsor Ministry of Education, Science and Culture Support tools: Part-of-speech tagger (I4) Language Technology for Icelandic 2019-2023 nationalFunds
files.size 466082776
files.count 2


 Files in this item

 Download all files in item (444.49 MB)
This item is
Publicly Available
and licensed under:
Apache License 2.0
Icon
Name
pos.tar.gz
Size
50.32 MB
Format
application/gzip
Description
Unknown
MD5
ffeec63a142f0941ed6b9e899d529dc4
 Download file  Preview
 File Preview  
    • tokenizer_config.json73 B
    • model.pt53 MB
    • config.json379 B
    • vocab.txt254 kB
    • known_lemmas.txt567 kB
    • dictionaries.pickle14 kB
    • special_tokens_map.json112 B
    • hyperparamters.json1 kB
    • known_toks.txt1 MB
Icon
Name
pos-large.tar.gz
Size
394.17 MB
Format
application/gzip
Description
Unknown
MD5
b648ef18f22955e6152a8121af3da655
 Download file  Preview
 File Preview  
    • tokenizer_config.json73 B
    • model.pt424 MB
    • config.json485 B
    • vocab.txt253 kB
    • known_lemmas.txt567 kB
    • dictionaries.pickle14 kB
    • special_tokens_map.json112 B
    • hyperparamters.json1 kB
    • known_toks.txt1 MB

Show simple item record