Sýna einfalda færslu atriðis
dc.contributor.author |
Nikulásdóttir, Anna Björk |
dc.date.accessioned |
2022-09-30T17:13:44Z |
dc.date.available |
2022-09-30T17:13:44Z |
dc.date.issued |
2022-10-01 |
dc.identifier.uri |
http://hdl.handle.net/20.500.12537/296 |
dc.description |
ENGLISH:
Grapheme-to-phoneme (g2p) module for Icelandic. The module can be used to transcribe Icelandic in four pronunciation variants (standard pronunciation, north Iceland, north-east Iceland, south Iceland), with different levels of detail and in four different phonetic alphabets. Default output is X-SAMPA phonetic alphabet without syllabification or stress labeling, according to standard pronunciation. The module transcribes English words using the Icelandic phoneset but close to English transcription rules. A transcription dictionary is also a part of the package. The package can be installed from PyPI: pip install ice-g2p
ICELANDIC:
Hljóðritunarforrit (g2p) fyrir íslensku. Forritið má nota til þess að hljóðrita íslensku skv. fjórum framburðartilbrigðum (hefðbundnum framburði, harðmæli, rödduðum framburði og hv-framburði), með mismiklum upplýsingum og í fjórum mismunandi hljóðritunarstafrófum. Séu engar stillingar sérvaldar þá skilar forritið úttaki í X-SAMPA hljóðritunarstafrófinu, án atkvæðaskiptinga eða áherslumerkinga, skv. hefðbundnum framburði. Forritið hljóðritar ensk orð með íslenskum hljóðritunartáknum en eins nálægt enskum reglum og mögulegt er. Framburðarorðabók fylgir pakkanum. Hægt er að sækja pakkann á PyPI: pip install ice-g2p |
dc.language.iso |
isl |
dc.publisher |
Grammatek ehf. |
dc.rights |
Apache License 2.0 |
dc.rights.uri |
https://opensource.org/license/apache2-0-php/ |
dc.rights.label |
PUB |
dc.source.uri |
https://github.com/grammatek/ice-g2p/releases/tag/v1.2.1 |
dc.subject |
phonetics |
dc.subject |
pronunciation |
dc.subject |
grapheme-to-phoneme models |
dc.subject |
phonetic transcription |
dc.subject |
dialectal variation |
dc.subject |
g2p |
dc.subject |
text-to-speech |
dc.title |
Grapheme-to-phoneme (g2p) module for Icelandic (22.10) |
dc.type |
toolService |
metashare.ResourceInfo#ContentInfo.detailedType |
tool |
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent |
true |
has.files |
yes |
branding |
Clarin IS Repository |
contact.person |
Anna Björk Nikulásdóttir anna@grammatek.com Grammatek ehf. |
sponsor |
Ministry of Education, Science and Culture Automatic Transcriptions (T9) Language Technology for Icelandic 2019-2023 nationalFunds |
files.size |
3667169 |
files.count |
1 |
Files in this item
This item is
Publicly Available
and licensed under:
Apache License 2.0
- Name
- ice-g2p-1.2.1.tar.gz
- Size
- 3.5
MB
- Format
- application/gzip
- MD5
- ceb4c4f856b0b68bd8e0d9b4404ef6d5
Download file
Preview
- ice-g2p-1.2.1
- src
- ice_g2p
- __init__.py0 B
- transcriber.py6 kB
- g2p_lstm.py4 kB
- syllabification.py6 kB
- dictionaries.py1 kB
- trigrams.py428 kB
- stress.py5 kB
- data
- vowels_ipa.txt111 B
- modifier_map.csv3 MB
- cons_clusters_ipa.txt95 B
- cons_clusters_sampa.txt95 B
- head_map.csv3 MB
- sampa_ipa_single_flite.csv651 B
- vowels_sampa.txt84 B
- converter.py3 kB
- tree_builder.py5 kB
- syllable.py1 kB
- entry.py3 kB
- dictionaries
- ice_pron_dict_north_clear.csv1 MB
- ice_pron_dict_english_clear.csv1 MB
- ice_pron_dict_standard_clear.csv1 MB
- main.py7 kB
- fairseq_models
- syllab_stress_processing.py3 kB
- fetch_models.py1 kB
- setup.py1 kB
- README.md6 kB
- .gitignore41 B
- setup.cfg1 kB
- grammatek-logo-small.png13 kB
- pyproject.toml103 B
- requirements.txt36 B
- test
- g2p_lstm_test.py4 kB
- __init__.py0 B
- LICENSE11 kB
- MANIFEST.in131 B
Sýna einfalda færslu atriðis