dc.contributor.author | Ingason, Anton Karl |
dc.contributor.author | Arnardóttir, Þórunn |
dc.contributor.author | Stefánsdóttir, Lilja Björk |
dc.contributor.author | Xu, Xindan |
dc.contributor.author | Guðmundsdóttir, Dagbjört |
dc.contributor.author | Glišić, Isidora |
dc.date.accessioned | 2022-09-29T14:59:07Z |
dc.date.available | 2022-09-29T14:59:07Z |
dc.date.issued | 2022-10-01 |
dc.identifier.uri | http://hdl.handle.net/20.500.12537/281 |
dc.description | The Icelandic Dyslexia Error Corpus (IceDEC) is a collection of texts in modern Icelandic, written by people who have been diagnosed with dyslexia. They have been annotated for mistakes related to spelling, grammar, and other issues. Each mistake is marked according to error type using an error code, of which there are 253. The corpus consists of 35 files with 8,436 categorized error instances. Íslenska lesblinduvillumálheildin er safn texta á nútímaíslensku sem hafa verið skrifaðir af einstaklingum sem greindir hafa verið með dyslexíu. Villur hafa verið merktar, t.d. hvað varðar stafsetningu, málfræði og fleira. Hver villa í textanum er flokkuð með hjálp villukóða, en 253 villukóðar eru notaðir í málheildinni. Málheildin samanstendur af 35 textum með 8.436 flokkuðum villutilvikum. |
dc.language.iso | isl |
dc.publisher | University of Iceland |
dc.relation.replaces | http://hdl.handle.net/20.500.12537/132 |
dc.rights | Creative Commons - Attribution 4.0 International (CC BY 4.0) |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ |
dc.rights.label | PUB |
dc.source.uri | https://github.com/antonkarl/iceErrorCorpusSpecialized |
dc.subject | error corpus |
dc.subject | grammatical errors |
dc.subject | spelling errors |
dc.subject | dyslexia |
dc.title | The Icelandic Dyslexia Error Corpus 1.2 (22.10) |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | Clarin IS Repository |
contact.person | Anton Karl Ingason anton.karl.ingason@gmail.com University of Iceland |
sponsor | Ministry of Education, Science and Culture Specialized error corpora (L2) Language Technology for Icelandic 2019-2023 nationalFunds |
size.info | 8436 other |
files.size | 419490 |
files.count | 1 |
Files in this item
This item is
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution 4.0 International (CC BY 4.0)
- Name
- iceErrorCorpusDyslexia.zip
- Size
- 409.66 KB
- Format
- application/zip
- Description
- A zip file containing the error corpus along with relevant information.
- MD5
- 533defb4e0e9d138661081ae2d181d61
- iceErrorCorpusDyslexia
- txt
- corrected
- texti007.txt2 kB
- texti025.txt5 kB
- texti012.txt5 kB
- texti017.txt2 kB
- texti004.txt1 kB
- texti022.txt4 kB
- texti009.txt6 kB
- texti014.txt1 kB
- texti001.txt2 kB
- texti019.txt20 kB
- texti006.txt4 kB
- texti024.txt4 kB
- texti011.txt1 kB
- texti016.txt2 kB
- texti003.txt2 kB
- texti021.txt23 kB
- texti008.txt17 kB
- texti026.txt977 B
- texti013.txt1 kB
- texti018.txt18 kB
- texti005.txt6 kB
- texti023.txt1 kB
- texti010.txt1 kB
- texti015.txt3 kB
- texti002.txt2 kB
- texti020.txt15 kB
- original
- texti007.txt2 kB
- texti025.txt5 kB
- texti012.txt6 kB
- texti017.txt2 kB
- texti004.txt1 kB
- texti022.txt4 kB
- texti009.txt6 kB
- texti014.txt1 kB
- texti001.txt2 kB
- texti019.txt20 kB
- texti006.txt4 kB
- texti024.txt4 kB
- texti011.txt2 kB
- texti016.txt2 kB
- texti003.txt2 kB
- texti021.txt23 kB
- texti008.txt16 kB
- texti026.txt902 B
- texti013.txt1 kB
- texti018.txt18 kB
- texti005.txt5 kB
- texti023.txt1 kB
- texti010.txt1 kB
- texti015.txt3 kB
- texti002.txt2 kB
- texti020.txt15 kB
- corrected
- README.md3 kB
- data
- texti019.xml131 kB
- texti006.xml21 kB
- texti024.xml63 kB
- texti011.xml22 kB
- texti029.xml44 kB
- texti016.xml13 kB
- texti034.xml15 kB
- texti003.xml27 kB
- texti021.xml153 kB
- texti008.xml122 kB
- texti026.xml15 kB
- texti013.xml13 kB
- texti031.xml8 kB
- texti018.xml104 kB
- texti005.xml42 kB
- texti023.xml12 kB
- texti010.xml15 kB
- texti028.xml52 kB
- texti015.xml39 kB
- texti033.xml201 kB
- texti002.xml16 kB
- texti020.xml90 kB
- texti007.xml7 kB
- texti025.xml85 kB
- texti012.xml55 kB
- texti030.xml24 kB
- texti017.xml13 kB
- texti035.xml111 kB
- texti004.xml10 kB
- texti022.xml24 kB
- texti009.xml48 kB
- texti027.xml93 kB
- texti014.xml12 kB
- texti032.xml10 kB
- texti001.xml11 kB
- errorCodes.tsv30 kB
- txt