dc.contributor.author |
Nikulásdóttir, Anna Björk |
dc.contributor.author |
Ármannsson, Bjarki |
dc.contributor.author |
Bergþórsdóttir, Bryndís |
dc.contributor.author |
Rögnvaldsson, Eiríkur |
dc.date.accessioned |
2021-01-19T20:32:06Z |
dc.date.available |
2021-01-19T20:32:06Z |
dc.date.issued |
2021-01-19 |
dc.identifier.uri |
http://hdl.handle.net/20.500.12537/97 |
dc.description |
The Icelandic Pronunciation Dictionary contains manually revised transcriptions in four pronunciation variants of Icelandic: the standard pronunciation, the northern post-aspiration variant ("harðmæli"), the north-eastern variant post-aspiration + voiced pronunciation, and the southern hv-variant. For descriptions of Icelandic pronunciation variants, see the respective documents `IPA_Pronunciation.pdf` or `SAMPA_Pronunciation.pdf` (the documents have identical content but the former describes the matter using the IPA phonetic alphabet and the latter uses the SAMPA phonetic alphabet).
The file `sampa_ipa_single.tsv` contains the set of SAMPA symbols used in the dictionaries and their mappings to IPA on the one hand and on a custom developed single-character alphabet developed for use in end-to-end speech synthesis.
The project is funded by the Icelandic Government as a part of the Language Technology Programme for Icelandic 2019–2023 which is described in the following publication:
Anna Björk Nikulásdóttir, Jón Guðnason, Anton Karl Ingason, Hrafn Loftsson, Eiríkur Rögnvaldsson, Einar Freyr Sigurðsson, Steinþór Steingrímsson. 2020. Language Technology Programme for Icelandic 2019–2023. Proceedings of LREC 2020 (https://arxiv.org/pdf/2003.09244.pdf)
Íslensk framburðarorðabók inniheldur handyfirfarnar hljóðritanir í fjórum framburðartilbrigðum íslensku: það sem kalla má hefðbundinn framburð, norðlenskt harðmæli, harðmæli + raddaðan framburð sem einkennandi er fyrir norð-austurland, og sunnlenskan hv-framburð. Nánari lýsingar á framburði og framburðartilbrigðum í íslensku er að finna í skjölunum `IPA_Pronunciation.pdf` eða `SAMPA_Pronunciation.pdf` (þessi skjöl eru eins að öllu leyti, nema að fyrra skjalið notar IPA-hljóðritunarstafrófið en það seinna SAMPA-stafrófið til þess að lýsa framburði).
Skjalið `sampa_ipa_single.tsv` inniheldur lista SAMPA hljóðritunartákna sem notuð eru í orðabókinni og varpanir yfir á tákn í IPA annars vegar og yfir á nýtt stafróf hins vegar, þar sem hvert hljóð er alltaf táknað með einum bókstaf/tölustaf, en slíkt stafróf getur nýst vel í "end-to-end" talgervingu.
Verkefnið er hluti af Máltækniáætlun fyrir íslensku 2019-2023.
Anna Björk Nikulásdóttir, Jón Guðnason, Anton Karl Ingason, Hrafn Loftsson, Eiríkur Rögnvaldsson, Einar Freyr Sigurðsson, Steinþór Steingrímsson. 2020. Language Technology Programme for Icelandic 2019–2023. Proceedings of LREC 2020 (https://arxiv.org/pdf/2003.09244.pdf) |
dc.language.iso |
isl |
dc.publisher |
Grammatek ehf |
dc.relation.isreferencedby |
https://ieeexplore.ieee.org/document/8639590 |
dc.relation.isreplacedby |
http://hdl.handle.net/20.500.12537/154 |
dc.source.uri |
https://github.com/grammatek/iceprondict |
dc.subject |
phonetics |
dc.subject |
pronunciation |
dc.subject |
dialectal variation |
dc.subject |
phonetic transcription |
dc.subject |
pronunciation dictionary |
dc.subject |
ipa |
dc.subject |
sampa |
dc.title |
Icelandic Pronunciation Dictionary (21.01) |
dc.type |
lexicalConceptualResource |
metashare.ResourceInfo#ContentInfo.detailedType |
wordList |
metashare.ResourceInfo#ContentInfo.mediaType |
text |
hidden |
false |
hasMetadata |
false |
has.files |
no |
branding |
Clarin IS Repository |
contact.person |
Anna Björk Nikulásdóttir anna@grammatek.com Grammatek ehf. |
sponsor |
Ministry of Education, Science and Culture Pronunciation dictionary (G6) Language Technology for Icelandic 2019-2023 nationalFunds |
size.info |
28,765 entries |
files.size |
0 |
files.count |
0 |