dc.contributor.author |
Schnell, Daniel |
dc.date.accessioned |
2024-04-23T16:52:15Z |
dc.date.available |
2024-04-23T16:52:15Z |
dc.date.issued |
2024-04-16 |
dc.identifier.uri |
http://hdl.handle.net/20.500.12537/329 |
dc.description |
IceHoC is a binary classifier for Icelandic homographs following the pattern V-ll-(V|$) where the 'll' can be pronounced either /tl/ or /l/. The classifier was trained on the Labeled Corpus of Icelandic Homographs (http://hdl.handle.net/20.500.12537/327). Please refer to the projects README for further discussions and guidelines for usage.
IceHoC er tól sem flokkar íslensk samstafa orð sem fylgja mynstrinu V-ll-(V|$), eða sérhljóð-ll-sérhljóð_eða_lok_orðs. Í þessum orðum er 'll' borið fram ýmist /tl/ eða /l/, eftir merkingu orðsins. IceHoC var þjálfað á málheild íslenskra samstafa orða (http://hdl.handle.net/20.500.12537/327). Fyrir nánari umfjöllun og leiðbeiningar um notkun, sjá README. |
dc.language.iso |
isl |
dc.publisher |
Grammatek ehf. |
dc.rights |
Apache License 2.0 |
dc.rights.uri |
https://opensource.org/license/apache2-0-php/ |
dc.rights.label |
PUB |
dc.source.uri |
https://github.com/grammatek/IceHoc/releases/tag/M12 |
dc.subject |
homographs |
dc.subject |
embeddings |
dc.subject |
tts |
dc.title |
Icelandic Homograph Classifier (24.04.) |
dc.type |
toolService |
metashare.ResourceInfo#ContentInfo.detailedType |
tool |
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent |
true |
has.files |
yes |
branding |
Clarin IS Repository |
contact.person |
Daniel Schnell dschnell@grammatek.com Grammatek ehf. |
sponsor |
Ministry of Culture and Business Affairs Prosody and Intonation Analysis (T11) Language Technology for Icelandic 2019-2023 nationalFunds |
files.size |
30862 |
files.count |
1 |