What's New

 corpus 
corpus
Description:
This test set contains sentences for intelligibility testing of a TTS system. It is a set of 50 sentences where each sentence occurs twice: once in its correct version and once containing one spelling error. Half of the ...
 This item contains 1 file (4.27 MB).
 
Publicly Available
 lexicalConceptualResource 
lexicalConceptualResource
Description:
The Icelandic Pronunciation Dictionary contains manually revised transcriptions in four pronunciation variants of Icelandic: the standard pronunciation, the northern post-aspiration variant ("harðmæli"), the north-eastern ...
 This item contains 1 file (3.42 MB).
 
Publicly Available
 corpus 
corpus
Description:
This release of data from the Samrómur collection focuses on queries. It contains 17,475 (20 hours) of validated speech-recordings in Icelandic. The corpus is a result of the crowd-sourcing effort run by the Language and ...
 This item contains 1 file (984.23 MB).
 
Publicly Available

Most Viewed Items

Top Last Week
 corpus 
corpus
Description:
The Icelandic Gigaword corpus (IGC) is a tagged and lemmatized corpus. The 20.05 version consists of approximately 1,532 million running words of text. Each running word is accompanied by a morphosyntactic tag and lemma ...
 This item contains 1 file (10.31 GB).
 
Publicly Available
 lexicalConceptualResource 
lexicalConceptualResource
Description:
The Icelandic Pronunciation Dictionary contains manually revised transcriptions in four pronunciation variants of Icelandic: the standard pronunciation, the northern post-aspiration variant ("harðmæli"), the north-eastern ...
 This item contains no files.
 languageDescription 
languageDescription
Description:
"Icelandic Language Models with Pronunciations 22.01" is a set of four n-gram language models in ARPA format and a pronunciation dictionary containing all the words in the language models. „Íslensk mállíkön með ...
 This item contains 1 file (2.8 GB).
 
Publicly Available