What's New
toolService
Description:
ENGLISH:
This project provides an OpenAI Whisper-compatible ASR service with automatic language detection and optimized Icelandic speech-to-text. The Icelandic models used are trained by Language and Voice Lab at the ...
This item contains 1 file (670.21
KB).
Publicly Available
corpus
Description:
ENGLISH:
Talrómur 4 is a speech corpus containing recordings of children's voices. Three children at the age of 10, two girls and one boy, were recorded in four to five sessions each. The corpus consists of 2,881 audio ...
This item contains no files.
corpus
Description:
[English]
This is a JSONL version of the 2024 release of the Icelandic Gigaword Corpus (IGC), prepared for language model training. The archive contains training and validation sets of unannotated documents from the ...
This item contains 2 files (2.48
GB).
Publicly Available