Show simple item record

 
dc.contributor.author Schnell, Daniel
dc.date.accessioned 2026-04-24T14:08:45Z
dc.date.available 2026-04-24T14:08:45Z
dc.date.issued 2026-04-10
dc.identifier.uri http://hdl.handle.net/20.500.12537/385
dc.description ENGLISH: Revoxx - Speech Recording Application Revoxx is a speech recording application specifically designed for creating high-quality TTS datasets quickly and reliably. Born from the experience gained during the recording of Talrómur 3 (the Icelandic emotional speech dataset, http://hdl.handle.net/20.500.12537/344), Revoxx condenses these learnings into a streamlined tool that minimizes recording and post-processing time. The application features automatic text size adjustment to screen real-estate, separate recording engineer and speaker views with multi-screen support (including Apple Sidecar for iPad), and maintains a complete archive of all raw recordings - even deleted takes. Key features include session-based recording organization with consistent audio settings and metadata across all recordings, automatic progress tracking, real-time mel spectrogram monitoring, industry-standard Peak/RMS level presets, advanced search and navigation by label/emotion/text, and batch export capabilities with optional VAD-based voice timestamps. Revoxx supports both emotional and non-emotional recordings, making it ideal for creating diverse speech datasets. For further documentation see project URL. ICELANDIC: Revoxx - Upptökuforrit fyrir talgagnasöfn Revoxx er upptökuforrit sem er sérstaklega hannað til að taka upp og útbúa hágæða gagnasöfn til þjálfunar á talgervlum. Forritið byggir á reynslu af upptökum á Talrómi 3 (íslenskt gagnasafn með tilfinningaríku tali, http://hdl.handle.net/20.500.12537/344) og hefur það að markmiði að lágmarka upptöku- og eftirvinnslutíma. Forritið býður upp á sjálfvirka textastærðaraðlögun að skjástærð, aðskilin upptökustjóra- og raddgjafaviðmót með fjölskjáastuðningi (þar með talið Apple Sidecar fyrir iPad), og heldur utan um heildarsafn allra frumupptaka, að þeim upptökum meðtöldum sem kann að hafa verið eytt á meðan á upptökum stóð. Helstu eiginleikar eru lotubundið upptökuskipulag með samræmdum hljóðstillingum og lýsigögnum fyrir allar upptökur, sjálfvirk framvinduskráning, mel-rófsrita vöktun í rauntíma, staðlaðar hámarks/RMS-stigs forstillingar, leitarvirkni eftir merkingum/tilfinningum/texta, og magnútflutningsgeta (e. batch export) með valfrjálsum VAD-tímastimplum. Revoxx styður bæði upptökur á hlutlausu og tilfinningaríku tali, sem gerir það kjörið fyrir fjölbreytt raddgagnasöfn. Sjá GitHub hirslu fyrir frekari skjölun.
dc.publisher Grammatek ehf
dc.relation.replaces http://hdl.handle.net/20.500.12537/365
dc.rights Apache License 2.0
dc.rights.uri https://opensource.org/license/apache2-0-php/
dc.rights.label PUB
dc.source.uri https://github.com/icelandic-lt/revoxx
dc.subject speech
dc.subject audio recording
dc.title Revoxx - Speech Recording Application (v1.3.2)
dc.type toolService
metashare.ResourceInfo#ContentInfo.detailedType tool
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent false
has.files yes
branding Clarin IS Repository
demo.uri https://pypi.org/project/revoxx/
contact.person Daniel Schnell dschnell@grammatek.com Grammatek ehf
sponsor Ministry of Culture, Innovation and Higher Education Recordings of children's voices for TTS Language Technology for Icelandic II nationalFunds
files.size 1298199
files.count 1


 Files in this item

This item is
Publicly Available
and licensed under:
Apache License 2.0
Icon
Name
revoxx-1.3.2.tar.gz
Size
1.24 MB
Format
application/gzip
Description
revoxx repository
MD5
a0775c483abc52e01066c67566ba8d48
 Download file  Preview
 File Preview  
  • revoxx-1.3.2
    • README.md14 kB
    • PKG-INFO16 kB
    • tests
      • test_session_controller.py13 kB
      • test_new_session_dialog.py5 kB
      • test_navigation_controller.py13 kB
      • test_file_manager.py7 kB
      • test_stable_sorting.py7 kB
      • test_audio_controller.py17 kB
      • test_ipc_communication.py8 kB
      • test_tk_compat.py1 kB
      • test_session_models.py10 kB
      • test_audio_queue_manager.py6 kB
      • test_device_controller.py29 kB
      • test_session_manager.py16 kB
      • test_display_controller.py14 kB
      • test_device_manager.py6 kB
      • test_utterance_list_dialog_sorting.py7 kB
      • test_dialog_controller.py13 kB
      • test_file_operations_controller.py13 kB
      • test_active_recordings.py17 kB
      • test_config.py2 kB
      • test_dataset_exporter.py35 kB
      • test_process_manager.py9 kB
    • setup.cfg38 B
    • scripts_module
      • __init__.py43 B
      • omnivad_processor.py3 kB
      • export.py5 kB
      • vadiate.py7 kB
    • revoxx.egg-info
      • entry_points.txt127 B
      • requires.txt330 B
      • SOURCES.txt4 kB
      • PKG-INFO16 kB
      • top_level.txt22 B
      • dependency_links.txt1 B
    • pyproject.toml4 kB
    • revoxx
      • __init__.py758 B
      • controllers
        • file_operations_controller.py11 kB
        • __init__.py936 B
        • session_controller.py9 kB
        • asr_auto_controller.py6 kB
        • dialog_controller.py16 kB
        • device_controller.py19 kB
        • flag_controller.py7 kB
        • audio_controller.py30 kB
        • process_manager.py13 kB
        • navigation_controller.py12 kB
        • display_controller.py27 kB
        • edit_controller.py28 kB
      • resources
        • keyboard_shortcuts.txt1 kB
        • templates
          • index_format_without_intensity.txt363 B
          • index_format_with_intensity.txt519 B
          • dataset_readme.txt478 B
        • microphone.png130 kB
      • __main__.py117 B
      • dataset
        • exporter.py25 kB
        • __init__.py128 B
        • asr_verifier.py8 kB
      • audio
        • __init__.py117 B
        • buffer_manager.py3 kB
        • player.py20 kB
        • queue_manager.py10 kB
        • recorder.py22 kB
        • audio_queue_processor.py6 kB
        • audio_buffer.py3 kB
        • worker_state.py559 B
        • edit_commands.py10 kB
        • processors
          • processor_base.py1 kB
          • mel_spectrogram.py17 kB
          • __init__.py655 B
          • clipping_detector.py2 kB
        • level_calculator.py5 kB
        • editor.py15 kB
        • shared_state.py17 kB
        • undo_stack.py4 kB
      • app.py37 kB
      • constants.py12 kB
      • session
        • script_parser.py2 kB
        • models.py10 kB
        • __init__.py328 B
        • manager.py12 kB
        • inspector.py7 kB
      • ui
        • widget_initializer.py5 kB
        • style_config.py6 kB
        • level_meter
          • led_level_meter.py34 kB
          • __init__.py216 B
          • config.py3 kB
        • window_manager.py12 kB
        • window_factory.py11 kB
        • __init__.py110 B
        • dialogs
          • open_session_dialog.py15 kB
          • __init__.py287 B
          • utterance_list_base.py26 kB
          • dataset_dialog.py37 kB
          • asr_dialog.py15 kB
          • help_dialog.py2 kB
          • import_text_dialog.py29 kB
          • new_session_dialog.py22 kB
          • utterance_order_dialog.py4 kB
          • find_dialog.py3 kB
          • dialog_utils.py10 kB
          • session_settings_dialog.py12 kB
          • user_guide_dialog.py11 kB
          • progress_dialog.py2 kB
        • frequency_axis.py11 kB
        • menus
          • application_menu.py32 kB
          • audio_devices.py12 kB
        • recording_display_state.py912 B
        • spectrogram
          • selection_interaction.py13 kB
          • view_context.py2 kB
          • selection_state.py4 kB
          • __init__.py123 B
          • widget.py44 kB
          • controllers
            • zoom_controller.py7 kB
            • edge_indicator.py3 kB
            • playback_controller.py8 kB
            • clipping_visualizer.py5 kB
            • __init__.py465 B
            • selection_visualizer.py13 kB
          • display_utils.py2 kB
          • playback_handler.py22 kB
          • display_base.py15 kB
          • recording_handler.py7 kB
          • recording_display.py6 kB
          • mel_processor_manager.py4 kB
        • emotion_indicator.py8 kB
        • font_manager.py8 kB
        • icon.py1 kB
        • info_overlay.py7 kB
        • themes.py9 kB
        • window_base.py46 kB
      • utils
        • adaptive_frame_rate.py3 kB
        • __init__.py261 B
        • process_cleanup.py4 kB
        • tk_compat.py2 kB
        • state.py5 kB
        • active_recordings.py14 kB
        • text_importer.py11 kB
        • settings_manager.py5 kB
        • spectrogram_utils.py2 kB
        • text_utils.py1 kB
        • config.py7 kB
        • device_manager.py15 kB
        • file_manager.py15 kB
        • audio_utils.py5 kB
      • doc
        • USER_GUIDE.md26 kB
    • doc
      • import_raw_text.png306 kB
      • screenshot1.png546 kB
    • LICENSE11 kB
    • MANIFEST.in251 B

Show simple item record