Sýna einfalda færslu atriðis

 
dc.contributor.author Ármannsson, Bjarki
dc.contributor.author Hafsteinsson, Hinrik
dc.contributor.author Sigtryggsson, Jóhannes B.
dc.contributor.author Jasonarson, Atli
dc.contributor.author Ingimundarson, Finnur Ágúst
dc.contributor.author Sigurðsson, Einar Freyr
dc.contributor.author Steingrímsson, Steinþór
dc.date.accessioned 2024-10-30T14:38:29Z
dc.date.available 2024-10-30T14:38:29Z
dc.date.issued 2024-10-30
dc.identifier.uri http://hdl.handle.net/20.500.12537/349
dc.description The Icelandic Standardization Benchmark Set: Spelling and Punctuation (IceStaBS:SP) consists of examples of written text that deviate from the standard with respect to spelling and punctuation, along with explained corrections corresponding to the official spelling rules for Icelandic (https://ritreglur.arnastofnun.is). It is meant to serve as a key component in the development of automatic spell checking in an educational setting, providing handcrafted explanations which can be expanded or used for instruction tuning. See README for further information. Þessi pakki hefur að geyma Icelandic Standardization Benchmark Set: Spelling and Punctuation (IceStaBS:SP), prófunargögn fyrir réttritun á íslensku. Gögnin samanstanda af dæmum um ritaðan texta sem samrýmist ekki málstaðli með tilliti til staf- og greinarmerkjasetningar ásamt leiðréttum dæmum og stuttum og lengri útskýringum sem byggjast á opinberum ritreglum fyrir íslensku (https://ritreglur.arnastofnun.is). Gögnunum er ætlað að nýtast við þróun og fínstillingu sjálfvirks leiðréttingarbúnaðar og -líkana. Þessi tenging við opinberar ritreglur ætti sérstaklega að gagnast í stafsetningarkennslu í skólum þar sem skylt er að fylgja ritreglunum en gögnin innihalda sérútbúnar útskýringar á leiðréttingum sem byggjast á umfjöllun í ritreglunum sjálfum. Nánari upplýsingar eru í README-skrá.
dc.language.iso isl
dc.publisher The Árni Magnússon Institute for Icelandic Studies
dc.rights Creative Commons - Attribution 4.0 International (CC BY 4.0)
dc.rights.uri https://creativecommons.org/licenses/by/4.0/
dc.rights.label PUB
dc.subject Icelandic
dc.subject Spelling
dc.subject Punctuation
dc.subject Standardization
dc.subject Error Corpus
dc.subject Spelling Rules
dc.subject Spell Checking
dc.title Icelandic Standardization Benchmark Set: Spelling and Punctuation 24.10
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding Clarin IS Repository
contact.person Bjarki Ármannsson bjarki.armannsson@arnastofnun.is The Árni Magnússon Institute for Icelandic Studies
sponsor Ministry of Culture and Business Affairs Spell and grammar checking (L15) Language Technology for Icelandic nationalFunds
files.size 63723
files.count 1


 Files in this item

This item is
Publicly Available
and licensed under:
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Icon
Name
IceStaBS-SP.zip
Size
62.23 KB
Format
application/zip
Description
IceStaBS-SP
MD5
ec1cd757fa54f25f580a3bd73f57b339
 Download file  Preview
 File Preview  
    • README.md2 kB
    • IceStaBS-SP.json481 kB

Sýna einfalda færslu atriðis