What's New

 languageDescription 
languageDescription
Description:
Icegrams er Python 3 pakki sem inniheldur stórt safn orðaþrennda (trigrams) fyrir íslensku. Í safninu eru um 14 milljónir ólíkra þrennda ásamt tíðniupplýsingum. Öllu safninu hefur verið þjappað niður í u.þ.b. 43 megabæti ...
 This item contains 2 files (149.38 KB).
 
Publicly Available
 corpus 
corpus
Description:
GreynirCorpus er þáttuð málheild sem inniheldur 7 milljónir málsgreina, að mestu úr fréttatextum, sem hafa verið fullþáttaðar með sjálfvirkum regluþáttara. Málheildin inniheldur einnig gullstaðal með 2.610 handþáttuðum ...
 This item contains 2 files (1.52 GB).
 
Publicly Available
 toolService 
toolService
Description:
A Part-of-Speech (PoS) tagger for Icelandic. In this submission, you will find ABLTagger v1.0.0. This is a PoS tagger that works with the revised tagset and achieves an accuracy of 95.59% on MIM-Gold (cross-validation). ...
 This item contains 5 files (8.58 GB).
 
Publicly Available

Most Viewed Items

Top Last Week
 toolService 
toolService
Description:
A Part-of-Speech (PoS) tagger for Icelandic. In this submission, you will find ABLTagger v1.0.0. This is a PoS tagger that works with the revised tagset and achieves an accuracy of 95.59% on MIM-Gold (cross-validation). ...
 This item contains 5 files (8.58 GB).
 
Publicly Available
 toolService 
toolService
Description:
Tokenizer is a compact pure-Python (2 and 3) executable program and module for tokenizing Icelandic text. It converts input text to streams of tokens, where each token is a separate word, punctuation sign, number/amount, ...
 This item contains 1 file (239.62 KB).
 toolService 
toolService
Author(s):
Description:
IceNLP is an open source Natural Language Processing (NLP) toolkit for analyzing and processing Icelandic text. The toolkit is implemented in Java.
 This item contains 2 files (14.17 MB).
 
Publicly Available