What's New

 toolService 
toolService
Description:
A python package that punctuates Icelandic text. The input data is unpunctuated text and punctuated text is returned. The user can choose between three different punctuation models, a BERT-based Transformer, a seq2seq ...
 This item contains 3 files (106.52 MB).
 
Publicly Available
 toolService 
toolService
Description:
Moses phrase-based statistical machine translation (Moses PBSMT) is a system which is used to develop and run machine translation models. It is distributed here as four packages: 1. Code from a github repository to train ...
 This item contains 4 files (10.98 GB).
 
Publicly Available

Most Viewed Items

Top Last Week
 corpus 
corpus
Description:
This Icelandic named entity (NE) corpus, MIM-GOLD-NER, is a version of the MIM-GOLD corpus tagged for NEs. Over 48 thousand NEs are tagged in this corpus of one million tokens, which can be used for training named entity ...
 This item contains 13 files (8.53 MB).
 
Publicly Available
 corpus 
corpus
Description:
GreynirCorpus contains 7 million sentences that have been parsed into full constituency trees by an automatic rule-based parser. It also contains a gold standard corpus of 2,610 hand annotated parse trees. GreynirCorpus ...
 This item contains 2 files (1.52 GB).
 
Publicly Available
 corpus 
corpus
Description:
The Icelandic Contemporary Corpus (IceConTree) is a machine-parsed treebank parsed according to the IcePaHC annotation scheme. It consists of texts from the Icelandic Gigaword Corpus, parsed using the IceNeuralParsingPipeline. ...
 This item contains 1 file (3.92 GB).
 
Publicly Available