| dc.contributor.author |
Helgadóttir, Sigrún |
| dc.date.accessioned |
2020-06-11T13:46:11Z |
| dc.date.available |
2020-06-11T13:46:11Z |
| dc.date.issued |
2012 |
| dc.identifier.uri |
http://hdl.handle.net/20.500.12537/34 |
| dc.description |
Testing and training sets for pos-tagging from IFD 2012.11 (Icelandic Frequency Dictionary) which contains fragments from 100 texts, published between the years 1980 and 1989.
The testing and training pairs were created in such a way that all the 100 texts that constitute the corpus were divided into ten roughly equal parts. Each of these ten parts forms one test set and a corresponding training set contains the other nine parts.
----------------
Þjálfunar- og prófunarsafn fyrir málfræðilega mörkun sem unnin voru upp úr Orðtíðinibókinni (2012.11) en hún inniheldur brot úr 100 textum sem gefnir voru út á árunum 1980 til 1989.
Pörin voru búin til þannig að hverri skrá var skipt upp í tíu nokkurn veginn jafna hluta. Hver þessara tíu hluta myndar eitt prófunarsafn og samstætt þjálfunarsafn hefur að geyma hina hlutana níu í hvert sinn. |
| dc.language.iso |
isl |
| dc.publisher |
The Árni Magnússon Institute for Icelandic Studies |
| dc.relation.isreplacedby |
http://hdl.handle.net/20.500.12537/37 |
| dc.rights |
Icelandic Frequency Dictonary |
| dc.rights.uri |
https://repository.clarin.is/repository/xmlui/page/license-frequency-dictionary |
| dc.rights.label |
PUB |
| dc.source.uri |
http://www.malfong.is/index.php?lang=en&pg=ordtidnibok |
| dc.subject |
test sets |
| dc.subject |
training sets |
| dc.subject |
lemmatized |
| dc.subject |
pos-tagged |
| dc.title |
Icelandic Frequency Dictionary 2012.11 - training/testing sets |
| dc.type |
corpus |
| metashare.ResourceInfo#ContentInfo.mediaType |
text |
| has.files |
yes |
| branding |
Clarin IS Repository |
| contact.person |
Steinþór Steingrímsson steinthor.steingrimsson@arnastofnun.is The Árni Magnússon Institute for Icelandic Studies |
| size.info |
20 files |
| size.info |
590299 tokens |
| size.info |
519180 words |
| size.info |
36912 sentences |
| files.size |
17922223 |
| files.count |
1 |