dc.contributor.author | Fong, Judy Y |
dc.contributor.author | Gudnason, Jon |
dc.date.accessioned | 2021-05-27T21:08:21Z |
dc.date.available | 2021-05-27T21:08:21Z |
dc.date.issued | 2021-05-27 |
dc.identifier.uri | http://hdl.handle.net/20.500.12537/109 |
dc.description | English This archive contains files generated from the recipe in kaldi-speaker-diarization/v5/. Its contents should be placed in a similar directory type, with symbolic links to diarization/, sid/, steps/, etc. It was created when Kaldi's master branch was at git commit 321d3959dabf667ea73cc98881400614308ccbbb. v5 These models are trained on the Althingi Parliamentary Speech corpus available on malfong.is. It uses MFCCS, x-vectors, PLDA and AHC. The recipe uses the Icelandic Rúv-di corpus as two hold out sets for tuning parameters. The Icelandic Rúv-di corpus is currently not publicly available. Íslenska Þetta skjalasafn inniheldur skrár frá kaldi-speaker-diarization v5. Innihaldi skjalasafnsins ætti að setja í eins möppu, með hlekki (symlinks) á diarization, sid, steps, o.s.frv. Notast var við Kaldi af master grein og Git commit 321d3959dabf667ea73cc98881400614308ccbbb. v5 Þessi líkön eru þjálfuð á gagnasafninu Alþingisræður til talgreiningar sem er aðgengilegt á malfong.is. Þau nota MFCC, x-vigra, PLDA, og AHC. Uppskriftin notar RÚV-di gagnasafnið sem hold-out gagnasöfn til að stilla forsendur. Eins og er þá er RÚV-di gagnasafnið ekki aðgengilegt almenningi. |
dc.language.iso | isl |
dc.publisher | Reykjavik University |
dc.rights | Creative Commons - Attribution 4.0 International (CC BY 4.0) |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ |
dc.rights.label | PUB |
dc.source.uri | https://github.com/cadia-lvl/kaldi-speaker-diarization |
dc.subject | diarization |
dc.subject | kaldi |
dc.subject | speaker diarization |
dc.subject | broadcast |
dc.subject | parliamentary |
dc.subject | althingi |
dc.subject | rúv-di speaker diarization |
dc.subject | rúv |
dc.title | RÚV-DI Speaker Diarization v5 models (21.05) |
dc.type | toolService |
metashare.ResourceInfo#ContentInfo.detailedType | tool |
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent | false |
has.files | yes |
branding | Clarin IS Repository |
contact.person | Judy Fong judy@judyyfong.xyz Reykjavík University |
contact.person | Jon Gudnason jg@ru.is Reykjavik University |
sponsor | Ministry of Education, Science and Culture (Iceland) Dialects, acoustic analysis and speaker diarization (H14) Language Technology for Icelandic 2019-2023 nationalFunds |
files.size | 26247332 |
files.count | 1 |
Files in this item
This item is
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution 4.0 International (CC BY 4.0)
- Name
- ruvdi_v5.tar.gz
- Size
- 25.03 MB
- Format
- application/gzip
- Description
- RÚV-DI v5 models
- MD5
- 58423ead186652b47578be7522867594
- ruvdi_v5
- run.sh16 kB
- README.txt4 kB
- conf
- vad.conf55 B
- mfcc.conf270 B
- local
- exp
- xvector_nnet_1a
- final.raw14 MB
- nnet.config4 kB
- min_chunk_size3 B
- configs
- xconfig1 kB
- network.xconfig960 B
- xconfig.expanded.22 kB
- xconfig.expanded.12 kB
- final.config4 kB
- ref.raw14 MB
- vars43 B
- ref.config4 kB
- srand3 B
- extract.config43 B
- xvectors_ruvdi2
- transform.mat64 kB
- mean.vec1 kB
- plda130 kB
- xvectors_ruvdi1
- transform.mat64 kB
- mean.vec1 kB
- plda130 kB
- max_chunk_size4 B
- xvector_nnet_1a