Sýna einfalda færslu atriðis
dc.contributor.author |
Schnell, Daniel |
dc.date.accessioned |
2025-09-09T09:44:35Z |
dc.date.available |
2025-09-09T09:44:35Z |
dc.date.issued |
2025-09-04 |
dc.identifier.uri |
http://hdl.handle.net/20.500.12537/365 |
dc.description |
ENGLISH:
Revoxx - Speech Recording Application
Revoxx is a speech recording application specifically designed for creating high-quality TTS datasets quickly and reliably. Born from the experience gained during the recording of Talrómur 3 (the Icelandic emotional speech dataset, http://hdl.handle.net/20.500.12537/344), Revoxx condenses these learnings into a streamlined tool that minimizes recording and post-processing time. The application features automatic text size adjustment to screen real-estate, separate recording engineer and speaker views with multi-screen support (including Apple Sidecar for iPad), and maintains a complete archive of all raw recordings - even deleted takes. Key features include session-based recording organization with consistent audio settings and metadata across all recordings, automatic progress tracking, real-time mel spectrogram monitoring, industry-standard Peak/RMS level presets, advanced search and navigation by label/emotion/text, and batch export capabilities with optional VAD-based voice timestamps. Revoxx supports both emotional and non-emotional recordings, making it ideal for creating diverse speech datasets.
ICELANDIC:
Revoxx - Upptökuforrit fyrir talgagnasöfn
Revoxx er upptökuforrit sem er sérstaklega hannað til að taka upp og útbúa hágæða gagnasöfn til þjálfunar á talgervlum. Forritið byggir á reynslu af upptökum á Talrómi 3 (íslenskt gagnasafn með tilfinningaríku tali, http://hdl.handle.net/20.500.12537/344) og hefur það að markmiði að lágmarka upptöku- og eftirvinnslutíma. Forritið býður upp á sjálfvirka textastærðaraðlögun að skjástærð, aðskilin upptökustjóra- og raddgjafaviðmót með fjölskjáastuðningi (þar með talið Apple Sidecar fyrir iPad), og heldur utan um heildarsafn allra frumupptaka, að þeim upptökum meðtöldum sem kann að hafa verið eytt á meðan á upptökum stóð. Helstu eiginleikar eru lotubundið upptökuskipulag með samræmdum hljóðstillingum og lýsigögnum fyrir allar upptökur, sjálfvirk framvinduskráning, mel-rófsrita vöktun í rauntíma, staðlaðar hámarks/RMS-stigs forstillingar, leitarvirkni eftir merkingum/tilfinningum/texta, og magnútflutningsgeta (e. batch export) með valfrjálsum VAD-tímastimplum. Revoxx styður bæði upptökur á hlutlausu og tilfinningaríku tali, sem gerir það kjörið fyrir fjölbreytt raddgagnasöfn. |
dc.publisher |
Grammatek ehf |
dc.rights |
Apache License 2.0 |
dc.rights.uri |
https://opensource.org/license/apache2-0-php/ |
dc.rights.label |
PUB |
dc.source.uri |
https://github.com/icelandic-lt/revoxx |
dc.subject |
TTS |
dc.subject |
speech corpus |
dc.subject |
audio recording |
dc.title |
Revoxx - Speech Recording Application |
dc.type |
toolService |
metashare.ResourceInfo#ContentInfo.detailedType |
tool |
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent |
false |
has.files |
yes |
branding |
Clarin IS Repository |
contact.person |
Daniel Schnell dschnell@grammatek.com Grammatek ehf |
sponsor |
Ministry of Culture, Innovation and Higher Education Recordings of children's voices for TTS Language Technology for Icelandic nationalFunds |
files.size |
1238222 |
files.count |
1 |
Files in this item
This item is
Publicly Available
and licensed under:
Apache License 2.0
- Name
- revoxx-1.0.2.tar.gz
- Size
- 1.18
MB
- Format
- application/gzip
- Description
- source code
- MD5
- cdb4488a352d919764f3462ced483c21
Download file
Preview
Sýna einfalda færslu atriðis