What's New

 corpus 
corpus
Description:
The Icelandic Confusion Set Corpus (ICoSC) is available under a CC-BY licence. It was compiled during the course of three months in 2019 by Steinunn Rut Friðriksdóttir and Anton Karl Ingason of the language technology ...
 This item contains 1 file (214.9 MB).
 
Publicly Available
 lexicalConceptualResource 
lexicalConceptualResource
Description:
The DMII Core contains the core vocabulary of current Icelandic, i.e., common non-domain specific words, and a selection of named Icelandic entities, i.e., personal names, common place names, and a few names of important ...
 This item contains no files.
 toolService 
toolService
Description:
Tokenizer is a compact pure-Python (2 and 3) executable program and module for tokenizing Icelandic text. It converts input text to streams of tokens, where each token is a separate word, punctuation sign, number/amount, ...
 This item contains 1 file (239.62 KB).

Most Viewed Items

Top Last Week
 corpus 
corpus
Description:
The Icelandic Confusion Set Corpus (ICoSC) is available under a CC-BY licence. It was compiled during the course of three months in 2019 by Steinunn Rut Friðriksdóttir and Anton Karl Ingason of the language technology ...
 This item contains 1 file (214.9 MB).
 
Publicly Available
 toolService 
toolService
Description:
Tokenizer is a compact pure-Python (2 and 3) executable program and module for tokenizing Icelandic text. It converts input text to streams of tokens, where each token is a separate word, punctuation sign, number/amount, ...
 This item contains 1 file (239.62 KB).
 lexicalConceptualResource 
lexicalConceptualResource
Description:
The DMII Core contains the core vocabulary of current Icelandic, i.e., common non-domain specific words, and a selection of named Icelandic entities, i.e., personal names, common place names, and a few names of important ...
 This item contains no files.