Show simple item record

 
dc.contributor.author Nikulásdóttir, Anna Björk
dc.date.accessioned 2022-09-30T17:13:44Z
dc.date.available 2022-09-30T17:13:44Z
dc.date.issued 2022-10-01
dc.identifier.uri http://hdl.handle.net/20.500.12537/296
dc.description ENGLISH: Grapheme-to-phoneme (g2p) module for Icelandic. The module can be used to transcribe Icelandic in four pronunciation variants (standard pronunciation, north Iceland, north-east Iceland, south Iceland), with different levels of detail and in four different phonetic alphabets. Default output is X-SAMPA phonetic alphabet without syllabification or stress labeling, according to standard pronunciation. The module transcribes English words using the Icelandic phoneset but close to English transcription rules. A transcription dictionary is also a part of the package. The package can be installed from PyPI: pip install ice-g2p ICELANDIC: Hljóðritunarforrit (g2p) fyrir íslensku. Forritið má nota til þess að hljóðrita íslensku skv. fjórum framburðartilbrigðum (hefðbundnum framburði, harðmæli, rödduðum framburði og hv-framburði), með mismiklum upplýsingum og í fjórum mismunandi hljóðritunarstafrófum. Séu engar stillingar sérvaldar þá skilar forritið úttaki í X-SAMPA hljóðritunarstafrófinu, án atkvæðaskiptinga eða áherslumerkinga, skv. hefðbundnum framburði. Forritið hljóðritar ensk orð með íslenskum hljóðritunartáknum en eins nálægt enskum reglum og mögulegt er. Framburðarorðabók fylgir pakkanum. Hægt er að sækja pakkann á PyPI: pip install ice-g2p
dc.language.iso isl
dc.publisher Grammatek ehf.
dc.rights Apache License 2.0
dc.rights.uri https://opensource.org/license/apache2-0-php/
dc.rights.label PUB
dc.source.uri https://github.com/grammatek/ice-g2p/releases/tag/v1.2.1
dc.subject phonetics
dc.subject pronunciation
dc.subject grapheme-to-phoneme models
dc.subject phonetic transcription
dc.subject dialectal variation
dc.subject g2p
dc.subject text-to-speech
dc.title Grapheme-to-phoneme (g2p) module for Icelandic (22.10)
dc.type toolService
metashare.ResourceInfo#ContentInfo.detailedType tool
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent true
has.files yes
branding Clarin IS Repository
contact.person Anna Björk Nikulásdóttir anna@grammatek.com Grammatek ehf.
sponsor Ministry of Education, Science and Culture Automatic Transcriptions (T9) Language Technology for Icelandic 2019-2023 nationalFunds
files.size 3667169
files.count 1


 Files in this item

This item is
Publicly Available
and licensed under:
Apache License 2.0
Icon
Name
ice-g2p-1.2.1.tar.gz
Size
3.5 MB
Format
application/gzip
MD5
ceb4c4f856b0b68bd8e0d9b4404ef6d5
 Download file  Preview
 File Preview  
  • ice-g2p-1.2.1
    • src
      • ice_g2p
        • __init__.py0 B
        • transcriber.py6 kB
        • g2p_lstm.py4 kB
        • syllabification.py6 kB
        • dictionaries.py1 kB
        • trigrams.py428 kB
        • stress.py5 kB
        • data
          • vowels_ipa.txt111 B
          • modifier_map.csv3 MB
          • cons_clusters_ipa.txt95 B
          • cons_clusters_sampa.txt95 B
          • head_map.csv3 MB
          • sampa_ipa_single_flite.csv651 B
          • vowels_sampa.txt84 B
        • converter.py3 kB
        • tree_builder.py5 kB
        • syllable.py1 kB
        • entry.py3 kB
        • dictionaries
          • ice_pron_dict_north_clear.csv1 MB
          • ice_pron_dict_english_clear.csv1 MB
          • ice_pron_dict_standard_clear.csv1 MB
        • main.py7 kB
        • fairseq_models
          • .DS_Store6 kB
        • syllab_stress_processing.py3 kB
        • fetch_models.py1 kB
    • setup.py1 kB
    • README.md6 kB
    • .gitignore41 B
    • setup.cfg1 kB
    • grammatek-logo-small.png13 kB
    • pyproject.toml103 B
    • requirements.txt36 B
    • test
      • g2p_lstm_test.py4 kB
      • __init__.py0 B
    • LICENSE11 kB
    • MANIFEST.in131 B
    • pax_global_header52 B

Show simple item record