Show simple item record

 
dc.contributor.author Schnell, Daniel
dc.date.accessioned 2025-09-09T09:44:35Z
dc.date.available 2025-09-09T09:44:35Z
dc.date.issued 2025-09-04
dc.identifier.uri http://hdl.handle.net/20.500.12537/365
dc.description ENGLISH: Revoxx - Speech Recording Application Revoxx is a speech recording application specifically designed for creating high-quality TTS datasets quickly and reliably. Born from the experience gained during the recording of Talrómur 3 (the Icelandic emotional speech dataset, http://hdl.handle.net/20.500.12537/344), Revoxx condenses these learnings into a streamlined tool that minimizes recording and post-processing time. The application features automatic text size adjustment to screen real-estate, separate recording engineer and speaker views with multi-screen support (including Apple Sidecar for iPad), and maintains a complete archive of all raw recordings - even deleted takes. Key features include session-based recording organization with consistent audio settings and metadata across all recordings, automatic progress tracking, real-time mel spectrogram monitoring, industry-standard Peak/RMS level presets, advanced search and navigation by label/emotion/text, and batch export capabilities with optional VAD-based voice timestamps. Revoxx supports both emotional and non-emotional recordings, making it ideal for creating diverse speech datasets. ICELANDIC: Revoxx - Upptökuforrit fyrir talgagnasöfn Revoxx er upptökuforrit sem er sérstaklega hannað til að taka upp og útbúa hágæða gagnasöfn til þjálfunar á talgervlum. Forritið byggir á reynslu af upptökum á Talrómi 3 (íslenskt gagnasafn með tilfinningaríku tali, http://hdl.handle.net/20.500.12537/344) og hefur það að markmiði að lágmarka upptöku- og eftirvinnslutíma. Forritið býður upp á sjálfvirka textastærðaraðlögun að skjástærð, aðskilin upptökustjóra- og raddgjafaviðmót með fjölskjáastuðningi (þar með talið Apple Sidecar fyrir iPad), og heldur utan um heildarsafn allra frumupptaka, að þeim upptökum meðtöldum sem kann að hafa verið eytt á meðan á upptökum stóð. Helstu eiginleikar eru lotubundið upptökuskipulag með samræmdum hljóðstillingum og lýsigögnum fyrir allar upptökur, sjálfvirk framvinduskráning, mel-rófsrita vöktun í rauntíma, staðlaðar hámarks/RMS-stigs forstillingar, leitarvirkni eftir merkingum/tilfinningum/texta, og magnútflutningsgeta (e. batch export) með valfrjálsum VAD-tímastimplum. Revoxx styður bæði upptökur á hlutlausu og tilfinningaríku tali, sem gerir það kjörið fyrir fjölbreytt raddgagnasöfn.
dc.publisher Grammatek ehf
dc.rights Apache License 2.0
dc.rights.uri https://opensource.org/license/apache2-0-php/
dc.rights.label PUB
dc.source.uri https://github.com/icelandic-lt/revoxx
dc.subject TTS
dc.subject speech corpus
dc.subject audio recording
dc.title Revoxx - Speech Recording Application
dc.type toolService
metashare.ResourceInfo#ContentInfo.detailedType tool
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent false
has.files yes
branding Clarin IS Repository
contact.person Daniel Schnell dschnell@grammatek.com Grammatek ehf
sponsor Ministry of Culture, Innovation and Higher Education Recordings of children's voices for TTS Language Technology for Icelandic nationalFunds
files.size 1238222
files.count 1


 Files in this item

This item is
Publicly Available
and licensed under:
Apache License 2.0
Icon
Name
revoxx-1.0.2.tar.gz
Size
1.18 MB
Format
application/gzip
Description
source code
MD5
cdb4488a352d919764f3462ced483c21
 Download file  Preview
 File Preview  
  • revoxx-1.0.2
    • shell_scripts
      • publish-pypi.sh1 kB
      • publish-testpypi.sh1 kB
    • t3_scripts
      • t3_addendum.txt1 kB
      • README.txt1 kB
      • t3_intensity_script.txt26 kB
    • README.md12 kB
    • .gitignore669 B
    • tests
      • test_session_controller.py13 kB
      • test_new_session_dialog.py5 kB
      • test_navigation_controller.py12 kB
      • test_file_manager.py7 kB
      • test_stable_sorting.py7 kB
      • test_audio_controller.py14 kB
      • test_ipc_communication.py8 kB
      • test_session_models.py9 kB
      • test_audio_queue_manager.py6 kB
      • test_device_controller.py21 kB
      • test_session_manager.py16 kB
      • test_display_controller.py14 kB
      • test_utterance_list_dialog_sorting.py7 kB
      • test_dialog_controller.py12 kB
      • __init__.py41 B
      • test_file_operations_controller.py13 kB
      • test_active_recordings.py17 kB
      • test_config.py2 kB
      • test_dataset_exporter.py20 kB
      • test_process_manager.py9 kB
    • .flake8246 B
    • scripts_module
      • __init__.py43 B
      • export.py5 kB
      • vadiate.py3 kB
    • pyproject.toml3 kB
    • revoxx
      • __init__.py758 B
      • controllers
        • file_operations_controller.py11 kB
        • __init__.py804 B
        • session_controller.py8 kB
        • dialog_controller.py15 kB
        • device_controller.py18 kB
        • audio_controller.py21 kB
        • process_manager.py12 kB
        • display_controller.py24 kB
        • navigation_controller.py11 kB
      • resources
        • keyboard_shortcuts.txt810 B
        • templates
          • index_format_without_intensity.txt362 B
          • index_format_with_intensity.txt518 B
          • dataset_readme.txt779 B
        • microphone.png130 kB
      • __main__.py117 B
      • dataset
        • exporter.py19 kB
        • __init__.py128 B
      • audio
        • __init__.py117 B
        • buffer_manager.py3 kB
        • player.py15 kB
        • queue_manager.py9 kB
        • recorder.py19 kB
        • audio_queue_processor.py6 kB
        • audio_buffer.py3 kB
        • level_calculator.py5 kB
        • processors
          • processor_base.py1 kB
          • mel_spectrogram.py17 kB
          • __init__.py655 B
          • clipping_detector.py2 kB
        • shared_state.py17 kB
      • app.py30 kB
      • constants.py10 kB
      • session
        • script_parser.py2 kB
        • models.py9 kB
        • __init__.py328 B
        • manager.py12 kB
        • inspector.py7 kB
      • ui
        • widget_initializer.py5 kB
        • style_config.py6 kB
        • level_meter
          • led_level_meter.py34 kB
          • __init__.py216 B
          • config.py3 kB
        • window_manager.py12 kB
        • window_factory.py11 kB
        • __init__.py110 B
        • dialogs
          • open_session_dialog.py15 kB
          • __init__.py287 B
          • utterance_list_base.py22 kB
          • dataset_dialog.py28 kB
          • help_dialog.py2 kB
          • import_text_dialog.py29 kB
          • new_session_dialog.py22 kB
          • utterance_order_dialog.py4 kB
          • find_dialog.py3 kB
          • dialog_utils.py9 kB
          • session_settings_dialog.py12 kB
          • user_guide_dialog.py6 kB
          • progress_dialog.py2 kB
        • frequency_axis.py11 kB
        • menus
          • application_menu.py26 kB
          • audio_devices.py10 kB
        • recording_display_state.py912 B
        • spectrogram
          • __init__.py123 B
          • widget.py36 kB
          • controllers
            • zoom_controller.py7 kB
            • edge_indicator.py3 kB
            • playback_controller.py3 kB
            • clipping_visualizer.py5 kB
            • __init__.py350 B
          • display_utils.py2 kB
          • playback_handler.py15 kB
          • display_base.py15 kB
          • recording_handler.py7 kB
          • recording_display.py6 kB
          • mel_processor_manager.py4 kB
        • emotion_indicator.py8 kB
        • font_manager.py8 kB
        • icon.py1 kB
        • info_overlay.py7 kB
        • themes.py9 kB
        • window_base.py42 kB
      • utils
        • __init__.py261 B
        • process_cleanup.py4 kB
        • state.py5 kB
        • active_recordings.py12 kB
        • text_importer.py11 kB
        • settings_manager.py4 kB
        • spectrogram_utils.py2 kB
        • text_utils.py1 kB
        • config.py7 kB
        • device_manager.py14 kB
        • file_manager.py13 kB
        • audio_utils.py5 kB
      • doc
        • USER_GUIDE.md14 kB
    • .github
      • workflows
        • build.yml1 kB
        • publish.yml2 kB
        • tests.yml1 kB
    • doc
      • import_raw_text.png306 kB
      • screenshot1.png546 kB
    • LICENSE11 kB
    • MANIFEST.in251 B
    • pax_global_header52 B

Show simple item record