Show simple item record

 
dc.contributor.author Friðriksdóttir, Steinunn Rut
dc.contributor.author Daníelsson, Hjalti
dc.contributor.author Steingrímsson, Steinþór
dc.date.accessioned 2022-05-09T14:22:37Z
dc.date.available 2022-05-09T14:22:37Z
dc.date.issued 2022-05-06
dc.identifier.uri http://hdl.handle.net/20.500.12537/209
dc.description Word Embeddings - Word2Vec optimized for IceBATS 22.04 contains two word embedding models, induced from the IGC (http://hdl.handle.net/20.500.12537/192). The word embedding models are optimized to obtain a high average score as measured by IceBATS. One model is trained on lemmatized data and the other on unlemmatized data.
dc.description Word Embeddings - Word2Vec optimized for IceBATS 22.04 inniheldur tvö orðvigralíkön, þjálfuð á RMH (http://hdl.handle.net/20.500.12537/192). Orðvigralíkönin eru þjálfuð með stillingum sem eiga að gefa há meðaltalsgildi þegar þau eru keyrð á IceBATS prófunarsafnið. Annað módelið er þjálfað með lemmuðum gögnum en hitt með ólemmuðum.
dc.language.iso isl
dc.publisher The Árni Magnússon Institute for Icelandic Studies
dc.rights Creative Commons - Attribution 4.0 International (CC BY 4.0)
dc.rights.uri https://creativecommons.org/licenses/by/4.0/
dc.rights.label PUB
dc.source.uri http://embeddings.arnastofnun.is/
dc.subject word embeddings
dc.title Word Embeddings – Word2Vec optimized for IceBATS 22.04
dc.type toolService
metashare.ResourceInfo#ContentInfo.detailedType other
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent true
has.files yes
branding Clarin IS Repository
contact.person Steinþór Steingrímsson steinthor.steingrimsson@arnastofnun.is The Árni Magnússon Institute for Icelandic Studies
sponsor Ministry of Education, Science and Culture I2 Language Technology for Icelandic 2019-2023 nationalFunds
files.size 2640185036
files.count 1


 Files in this item

This item is
Publicly Available
and licensed under:
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Icon
Name
word2vec_models.zip
Size
2.46 GB
Format
application/zip
Description
Unknown
MD5
5efaabfe93aa252b30d8c95fa001885b
 Download file  Preview
 File Preview  
    • IGC_2021_lemmatized__350__13__9__5__0_05__1_vectors.kv.vectors.npy1 GB
    • IGC_2021_unlemmatized__200__20__1__5__0_02__0_vectors.kv.vectors.npy1 GB
    • READ.ME1 kB
    • IGC_2021_unlemmatized__200__20__1__5__0_02__0_vectors.kv63 MB
    • IGC_2021_lemmatized__350__13__9__5__0_05__1_vectors.kv32 MB

Show simple item record