dc.contributor.author |
Nikulásdóttir, Anna Björk |
dc.date.accessioned |
2022-09-30T17:13:44Z |
dc.date.available |
2022-09-30T17:13:44Z |
dc.date.issued |
2022-10-01 |
dc.identifier.uri |
http://hdl.handle.net/20.500.12537/296 |
dc.description |
ENGLISH:
Grapheme-to-phoneme (g2p) module for Icelandic. The module can be used to transcribe Icelandic in four pronunciation variants (standard pronunciation, north Iceland, north-east Iceland, south Iceland), with different levels of detail and in four different phonetic alphabets. Default output is X-SAMPA phonetic alphabet without syllabification or stress labeling, according to standard pronunciation. The module transcribes English words using the Icelandic phoneset but close to English transcription rules. A transcription dictionary is also a part of the package. The package can be installed from PyPI: pip install ice-g2p
ICELANDIC:
Hljóðritunarforrit (g2p) fyrir íslensku. Forritið má nota til þess að hljóðrita íslensku skv. fjórum framburðartilbrigðum (hefðbundnum framburði, harðmæli, rödduðum framburði og hv-framburði), með mismiklum upplýsingum og í fjórum mismunandi hljóðritunarstafrófum. Séu engar stillingar sérvaldar þá skilar forritið úttaki í X-SAMPA hljóðritunarstafrófinu, án atkvæðaskiptinga eða áherslumerkinga, skv. hefðbundnum framburði. Forritið hljóðritar ensk orð með íslenskum hljóðritunartáknum en eins nálægt enskum reglum og mögulegt er. Framburðarorðabók fylgir pakkanum. Hægt er að sækja pakkann á PyPI: pip install ice-g2p |
dc.language.iso |
isl |
dc.publisher |
Grammatek ehf. |
dc.rights |
Apache License 2.0 |
dc.rights.uri |
https://opensource.org/license/apache2-0-php/ |
dc.rights.label |
PUB |
dc.source.uri |
https://github.com/grammatek/ice-g2p/releases/tag/v1.2.1 |
dc.subject |
phonetics |
dc.subject |
pronunciation |
dc.subject |
grapheme-to-phoneme models |
dc.subject |
phonetic transcription |
dc.subject |
dialectal variation |
dc.subject |
g2p |
dc.subject |
text-to-speech |
dc.title |
Grapheme-to-phoneme (g2p) module for Icelandic (22.10) |
dc.type |
toolService |
metashare.ResourceInfo#ContentInfo.detailedType |
tool |
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent |
true |
has.files |
yes |
branding |
Clarin IS Repository |
contact.person |
Anna Björk Nikulásdóttir anna@grammatek.com Grammatek ehf. |
sponsor |
Ministry of Education, Science and Culture Automatic Transcriptions (T9) Language Technology for Icelandic 2019-2023 nationalFunds |
files.size |
3667169 |
files.count |
1 |