What's New

 corpus 
corpus
Description:
ENGLISH: Talrómur 4 is a speech corpus containing recordings of children's voices. Three children at the age of 10, two girls and one boy, were recorded in four to five sessions each. The corpus consists of 2,881 audio ...
 This item contains no files.
 corpus 
corpus
Description:
[English] This is a JSONL version of the 2024 release of the Icelandic Gigaword Corpus (IGC), prepared for language model training. The archive contains training and validation sets of unannotated documents from the ...
 This item contains 2 files (2.48 GB).
 
Publicly Available
 corpus 
corpus
Description:
[English] This is a JSONL version of the 2024 release of the Icelandic Gigaword Corpus (IGC), prepared for language model training. The archive contains training and validation sets of unannotated, CC-BY-licensed documents ...
 This item contains 2 files (2.01 GB).
 
Publicly Available