Sýna einfalda færslu atriðis
| dc.contributor.author |
Nikulásdóttir, Anna Björk |
| dc.contributor.author |
Schnell, Daniel |
| dc.date.accessioned |
2026-04-10T12:20:04Z |
| dc.date.available |
2026-04-10T12:20:04Z |
| dc.date.issued |
2026-04-10 |
| dc.identifier.uri |
http://hdl.handle.net/20.500.12537/383 |
| dc.description |
ENGLISH:
Talrómur 4 is a speech corpus containing recordings of children's voices. Three children at the age of 10, two girls and one boy, were recorded in four to five sessions each. The corpus consists of 2,881 audio clips of various length, from one word utterances up to paragraphs of 50 seconds. Texts accompany each recording. The audio is recorded at 48 kHz sample rate and 24 bit depth. Each audio file is stored in .flac format. In addition to the audio recordings, this corpus includes Voice Activity Detection (VAD) values for each utterance, obtained using OmniVAD.
The data is available for research and development of children's TTS voices under a restrictive license from University of Iceland. Please get in touch with contact person for further information.
ÍSLENSKA:
Talrómur 4 er talgagnasafn með upptökum á barnaröddum. Þrjú tíu ára börn, tvær stúlkur og einn drengur, voru tekin upp í fjórum til fimm upptökulotum hvert. Gagnasafnið inniheldur 2.881 upptökur af mismunandi lengd, frá einu orði upp í lengri málsgreinar allt að 50 sekúndur að lengd. Texti fylgir hverri upptöku.
Hljóðskrárnar voru teknar upp í 48 kHz og með 24 bita dýpt. Skrárnar eru geymdar á .flac sniði
Auk hljóðskránna inniheldur þessi útgáfa raddvirknimerkingar (Voice Activity Detection values) fyrir hverja segð, fengnar með OmniVAD.
Gagnasafnið er ekki opið en hægt er að fá aðgang að því til rannsókna og þróunar á barna-talgervilsröddum samkvæmt leyfi frá Háskóla Íslands. Vinsamlegast hafið samband við tengilið verkefnisins fyrir frekari upplýsingar. |
| dc.language.iso |
isl |
| dc.publisher |
Grammatek ehf |
| dc.publisher |
University of Iceland |
| dc.subject |
TTS |
| dc.subject |
Speech |
| dc.subject |
Children |
| dc.title |
Talrómur 4 (26.04) |
| dc.type |
corpus |
| metashare.ResourceInfo#ContentInfo.mediaType |
audio |
| has.files |
no |
| branding |
Clarin IS Repository |
| contact.person |
Anna Björk Nikulásdóttir anna@grammatek.com Grammatek ehf |
| contact.person |
Iris Edda Nowenstein irisen@hi.is University of Iceland |
| sponsor |
Ministry of Culture, Innovation and Higher Education Recordings of Children's Voices for TTS Language Technology for Icelandic II nationalFunds |
| size.info |
4 hours |
| files.size |
0 |
| files.count |
0 |
Sýna einfalda færslu atriðis