dc.contributor.author | Ármannsson, Bjarki |
dc.contributor.author | Hafsteinsson, Hinrik |
dc.contributor.author | Sigtryggsson, Jóhannes B. |
dc.contributor.author | Jasonarson, Atli |
dc.contributor.author | Ingimundarson, Finnur Ágúst |
dc.contributor.author | Sigurðsson, Einar Freyr |
dc.contributor.author | Steingrímsson, Steinþór |
dc.date.accessioned | 2024-10-30T14:38:29Z |
dc.date.available | 2024-10-30T14:38:29Z |
dc.date.issued | 2024-10-30 |
dc.identifier.uri | http://hdl.handle.net/20.500.12537/349 |
dc.description | The Icelandic Standardization Benchmark Set: Spelling and Punctuation (IceStaBS:SP) consists of examples of written text that deviate from the standard with respect to spelling and punctuation, along with explained corrections corresponding to the official spelling rules for Icelandic (https://ritreglur.arnastofnun.is). It is meant to serve as a key component in the development of automatic spell checking in an educational setting, providing handcrafted explanations which can be expanded or used for instruction tuning. See README for further information. Þessi pakki hefur að geyma Icelandic Standardization Benchmark Set: Spelling and Punctuation (IceStaBS:SP), prófunargögn fyrir réttritun á íslensku. Gögnin samanstanda af dæmum um ritaðan texta sem samrýmist ekki málstaðli með tilliti til staf- og greinarmerkjasetningar ásamt leiðréttum dæmum og stuttum og lengri útskýringum sem byggjast á opinberum ritreglum fyrir íslensku (https://ritreglur.arnastofnun.is). Gögnunum er ætlað að nýtast við þróun og fínstillingu sjálfvirks leiðréttingarbúnaðar og -líkana. Þessi tenging við opinberar ritreglur ætti sérstaklega að gagnast í stafsetningarkennslu í skólum þar sem skylt er að fylgja ritreglunum en gögnin innihalda sérútbúnar útskýringar á leiðréttingum sem byggjast á umfjöllun í ritreglunum sjálfum. Nánari upplýsingar eru í README-skrá. |
dc.language.iso | isl |
dc.publisher | The Árni Magnússon Institute for Icelandic Studies |
dc.rights | Creative Commons - Attribution 4.0 International (CC BY 4.0) |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ |
dc.rights.label | PUB |
dc.subject | Icelandic |
dc.subject | Spelling |
dc.subject | Punctuation |
dc.subject | Standardization |
dc.subject | Error Corpus |
dc.subject | Spelling Rules |
dc.subject | Spell Checking |
dc.title | Icelandic Standardization Benchmark Set: Spelling and Punctuation 24.10 |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | Clarin IS Repository |
contact.person | Bjarki Ármannsson bjarki.armannsson@arnastofnun.is The Árni Magnússon Institute for Icelandic Studies |
sponsor | Ministry of Culture and Business Affairs Spell and grammar checking (L15) Language Technology for Icelandic nationalFunds |
files.size | 63723 |
files.count | 1 |
Files in this item
This item is
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution 4.0 International (CC BY 4.0)
- Name
- IceStaBS-SP.zip
- Size
- 62.23 KB
- Format
- application/zip
- Description
- IceStaBS-SP
- MD5
- ec1cd757fa54f25f580a3bd73f57b339