dc.contributor.author | Ingason, Anton Karl |
dc.contributor.author | Arnardóttir, Þórunn |
dc.contributor.author | Stefánsdóttir, Lilja Björk |
dc.contributor.author | Xu, Xindan |
dc.date.accessioned | 2021-04-21T15:36:39Z |
dc.date.available | 2021-04-21T15:36:39Z |
dc.date.issued | 2021-04-26 |
dc.identifier.uri | http://hdl.handle.net/20.500.12537/107 |
dc.description | The Icelandic Dyslexia Error Corpus (IceDEC) is a collection of texts in modern Icelandic, written by people who have been diagnosed with dyslexia. They have been annotated for mistakes related to spelling, grammar, and other issues. Each mistake is marked according to error type using an error code, of which there are 253. The corpus consists of 15 files with 2,227 categorized error instances. Íslenska lesblinduvillumálheildin er safn texta á nútímaíslensku sem hafa verið skrifaðir af einstaklingum sem greindir hafa verið með dyslexíu. Villur hafa verið merktar, t.d. hvað varðar stafsetningu, málfræði og fleira. Hver villa í textanum er flokkuð með hjálp villukóða, en 253 villukóðar eru notaðir í málheildinni. Málheildin samanstendur af 15 textum með 2.227 flokkuðum villutilvikum. |
dc.language.iso | isl |
dc.publisher | University of Iceland |
dc.relation.isreplacedby | http://hdl.handle.net/20.500.12537/132 |
dc.rights | Creative Commons - Attribution 4.0 International (CC BY 4.0) |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ |
dc.rights.label | PUB |
dc.subject | error corpus |
dc.subject | grammatical errors |
dc.subject | spelling errors |
dc.subject | dyslexia |
dc.title | The Icelandic Dyslexia Error Corpus (IceDEC) Version 1.0 |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | Clarin IS Repository |
demo.uri | https://github.com/antonkarl/iceErrorCorpusSpecialized |
contact.person | Anton Karl Ingason anton.karl.ingason@gmail.com University of Iceland |
sponsor | Ministry of Education, Science and Culture Specialized error corpora (L2) Language Technology for Icelandic 2019-2023 nationalFunds |
size.info | 2227 other |
files.size | 259387 |
files.count | 1 |
Files in this item
This item is
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution 4.0 International (CC BY 4.0)
- Name
- iceErrorCorpusDyslexia.zip
- Size
- 253.31 KB
- Format
- application/zip
- Description
- A zip file containing the corpus and information on error codes
- MD5
- 02648b06e943172b96c2d1743ef209b8
- __MACOSX
- ._iceErrorCorpusDyslexia-1 B
- iceErrorCorpusDyslexia
- ._README.md-1 B
- ._IEC_ErrorCodes.pdf-1 B
- ._data-1 B
- data
- ._texti013.xml-1 B
- ._texti012.xml-1 B
- ._texti009.xml-1 B
- ._texti011.xml-1 B
- ._texti008.xml-1 B
- ._texti010.xml-1 B
- ._texti007.xml-1 B
- ._texti006.xml-1 B
- ._texti005.xml-1 B
- ._texti004.xml-1 B
- ._texti003.xml-1 B
- ._texti002.xml-1 B
- ._texti015.xml-1 B
- ._texti001.xml-1 B
- ._texti014.xml-1 B
- ._errorCodes.tsv-1 B
- iceErrorCorpusDyslexia
- README.md-1 B
- data
- texti009.xml-1 B
- texti011.xml-1 B
- texti008.xml-1 B
- texti010.xml-1 B
- texti007.xml-1 B
- texti006.xml-1 B
- texti005.xml-1 B
- texti004.xml-1 B
- texti003.xml-1 B
- texti002.xml-1 B
- texti015.xml-1 B
- texti001.xml-1 B
- texti014.xml-1 B
- texti013.xml-1 B
- texti012.xml-1 B
- IEC_ErrorCodes.pdf-1 B
- errorCodes.tsv-1 B