What's New

 toolService 
toolService
Description:
A dockerized Named Entity Recognition (NER) API for Icelandic. It uses a ELECTRA-base language model, that has been fine tuned for NER using MIM-GOLD-NER. It achieves F1-score of ~91.9 on the test set for MIM-GOLD-NER. ...
 This item contains 1 file (389.74 MB).
 
Publicly Available
 lexicalConceptualResource 
lexicalConceptualResource
Description:
A single RDF file housing the Icelandic Wordweb in an LT-appropriate format. The Wordweb's features have been encoded with OntoLex and SKOS, with the new version designed in such a way as to replicate all core functionality ...
 This item contains 1 file (79.66 MB).
 
Publicly Available
 corpus 
corpus
Description:
[ENGLISH] IGC-Laws is a part of the IGC-project (Icelandic Gigaword corpus) that aims to collect as much as possible of Icelandic texts that can be published under an open or restricted license. IGC-Laws contains 1) the ...
 This item contains 1 file (597.31 MB).
 
Publicly Available

Most Viewed Items

Top Last Week
 corpus 
corpus
Description:
The Icelandic Contemporary Corpus (IceConTree) is a machine-parsed treebank parsed according to the IcePaHC annotation scheme. It consists of texts from the Icelandic Gigaword Corpus, parsed using the IceNeuralParsingPipeline. ...
 This item contains 1 file (3.92 GB).
 
Publicly Available
 corpus 
corpus
Description:
ParIce is an English-Icelandic parallel corpus. This is the first parallel corpus built for the purposes of language technology development and research for Icelandic. It includes 3.5 million translation segment pairs from ...
 This item contains 1 file (696.19 MB).
 
Publicly Available
 corpus 
corpus
Description:
The Icelandic Confusion Set Corpus (ICoSC) is available under a CC-BY licence. It was compiled during the course of 8 months by Steinunn Rut Friðriksdóttir and Anton Karl Ingason of the language technology department in ...
 This item contains 1 file (951.8 MB).
 
Publicly Available