• Home
  • Repository
  • About CLARIN-IS
  • CLARIN
  •  Login
  • English íslenska
  • CLARIN-IS Repository Home
  • View Item
  •  
  •   What can you do?
  •   Browse  
    •    All of the Repository  
      •   Issue Date
      •   Authors
      •   Titles
      •   Subjects
      •   Publisher
      •   Language
      •   Type
      •   Rights Label
  •   My Account  
    •    Login
  •   Statistics  
    •    StatisticsBETA
  •   General Information  
    •    Deposit
    •    Cite
    •    Submission Lifecycle
    •    FAQ
    •    About
    •    Help Desk
 
 

Talrómur 4 (26.04)

 
Clarin IS Repository
  Authors
Nikulásdóttir, Anna Björk and Schnell, Daniel
  Item identifier
http://hdl.handle.net/20.500.12537/383
 Date issued
2026-04-10
 Type
audio, corpus
 Size
4 hours
 Language(s)
Icelandic
 Description
ENGLISH: Talrómur 4 is a speech corpus containing recordings of children's voices. Three children at the age of 10, two girls and one boy, were recorded in four to five sessions each. The corpus consists of 2,881 audio clips of various length, from one word utterances up to paragraphs of 50 seconds. Texts accompany each recording. The audio is recorded at 48 kHz sample rate and 24 bit depth. Each audio file is stored in .flac format. In addition to the audio recordings, this corpus includes Voice Activity Detection (VAD) values for each utterance, obtained using OmniVAD. The data is available for research and development of children's TTS voices under a restrictive license from University of Iceland. Please get in touch with contact person for further information. ÍSLENSKA: Talrómur 4 er talgagnasafn með upptökum á barnaröddum. Þrjú tíu ára börn, tvær stúlkur og einn drengur, voru tekin upp í fjórum til fimm upptökulotum hvert. Gagnasafnið inniheldur 2.881 upptökur af mismunandi lengd, frá einu orði upp í lengri málsgreinar allt að 50 sekúndur að lengd. Texti fylgir hverri upptöku. Hljóðskrárnar voru teknar upp í 48 kHz og með 24 bita dýpt. Skrárnar eru geymdar á .flac sniði Auk hljóðskránna inniheldur þessi útgáfa raddvirknimerkingar (Voice Activity Detection values) fyrir hverja segð, fengnar með OmniVAD. Gagnasafnið er ekki opið en hægt er að fá aðgang að því til rannsókna og þróunar á barna-talgervilsröddum samkvæmt leyfi frá Háskóla Íslands. Vinsamlegast hafið samband við tengilið verkefnisins fyrir frekari upplýsingar.
 Publisher
Grammatek ehf
 
University of Iceland
 Acknowledgement

Ministry of Culture, Innovation and Higher Education

Project code: Recordings of Children's Voices for TTS

Project name: Language Technology for Icelandic II

 Subject(s)
TTS Speech Children
 Collection(s)
Clarin IS
Show full item record
 
 

Partners, Coordination, Funding

  • Arni Magnusson Institute for Icelandic Studies
  • Ministry of Culture and Business Affairs

Repository

  • Main page
  • Submission Lifecycle
  • FAQ
  • About and Policies

More

  • CLARIN
  • META-Net

CLARIN-IS is fully supported by the Ministry of Culture and Business Affairs

Copyright (c) 2023. Arni Magnusson Institute for Icelandic Studies. All rights reserved.