Files in this item

This item is
Publicly Available
and licensed under:
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Icon
Name
README.txt
Size
3.83 KB
Format
Text file
Description
Unknown
MD5
bee3bc2439355fee77156ab87078562a
 Download file  Preview
 File Preview  
Icelandic broadcast speech About the Icelandic broadcast speech corpus --------------------------- The Icelandic broadcast speech corpus is 193 hours of radio and TV data from RÚV. The radio data consists of episodes of Spegillinn, morning news, evening news, Morgunútvarpið, Morgunvaktin and Samfélagið. The TV data consists of episodes of Kastljós. All the data is from episodes broadcast in the period the period from January 2020 to August 2021. The data contains 40,746 utterances from 1,360 speakers. The data is aligned and segmented, ready for ASR training. The data set includes both prompted speech (e.g. from the News) and conversational speech (e.g. Morgunvaktin and Kastljósið). This data set is published by RÚV, transcribed by Creditinfo and aligned at Reykjavik University with the help of Tiro's automatic speech recognizer. Special thanks to Tiro for supplying transcriptions with per-word timestamps from their automatic speech recognizer, which were essential in the alig . . .
Icon
Name
metadata.tsv
Size
14.31 MB
Format
Unknown
Description
Unknown
MD5
43c1705145d20497f3bad1560502206a
 Download file
Icon
Name
speaker_information.tsv
Size
49.51 KB
Format
Unknown
Description
Unknown
MD5
512657b60add3e71843b85ab0d7ce0c7
 Download file
Icon
Name
cut_audio_RELEASE.z01
Size
3.91 GB
Format
Unknown
Description
Unknown
MD5
6b3ab042fa8ad276fced07352676f406
 Download file
Icon
Name
cut_audio_RELEASE.z02
Size
3.91 GB
Format
Unknown
Description
Unknown
MD5
459be745d78ac230ed815eb5f00d3e06
 Download file
Icon
Name
cut_audio_RELEASE.z03
Size
3.91 GB
Format
Unknown
Description
Unknown
MD5
aa5de9a1fef84ae23f3782584d124c9b
 Download file
Icon
Name
cut_audio_RELEASE.z04
Size
3.91 GB
Format
Unknown
Description
Unknown
MD5
436e5ccac3187b72d021888739979eac
 Download file
Icon
Name
cut_audio_RELEASE.z05
Size
3.91 GB
Format
Unknown
Description
Unknown
MD5
141bdc0612d9d9cfcae1aafb553a507f
 Download file
Icon
Name
cut_audio_RELEASE.z06
Size
3.91 GB
Format
Unknown
Description
Unknown
MD5
420de02e960b61d27b7045a8a1c1c45c
 Download file
Icon
Name
cut_audio_RELEASE.zip
Size
2.31 GB
Format
application/zip
Description
Unknown
MD5
096ae66512323726f3b3ecaffd9673af
 Download file