dc.contributor.author | Ingason, Anton Karl |
dc.contributor.author | Arnardóttir, Þórunn |
dc.contributor.author | Stefánsdóttir, Lilja Björk |
dc.contributor.author | Xu, Xindan |
dc.date.accessioned | 2021-04-21T15:37:37Z |
dc.date.available | 2021-04-21T15:37:37Z |
dc.date.issued | 2021-04-26 |
dc.identifier.uri | http://hdl.handle.net/20.500.12537/108 |
dc.description | The Icelandic Child Language Error Corpus (IceCLEC) is a collection of texts in modern Icelandic, written by native speakers of Icelandic of ages 10 to 15. They have been annotated for mistakes related to spelling, grammar, and other issues. Each mistake is marked according to error type using an error code, of which there are 253. The corpus consists of 34 files with 2,293 categorized error instances. Villumálheild íslensks barnamáls er safn texta á nútímaíslensku sem hafa verið skrifaðir af íslenskum móðurmálsmálhöfum á aldrinum 10–15 ára. Villur hafa verið merktar, t.d. hvað varðar stafsetningu, málfræði og fleira. Hver villa í textanum er flokkuð með hjálp villukóða, en 253 villukóðar eru notaðir í málheildinni. Málheildin samanstendur af 34 textum með 2.293 flokkuðum villutilvikum. |
dc.language.iso | isl |
dc.publisher | University of Iceland |
dc.relation.isreplacedby | http://hdl.handle.net/20.500.12537/133 |
dc.rights | Creative Commons - Attribution 4.0 International (CC BY 4.0) |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ |
dc.rights.label | PUB |
dc.source.uri | https://github.com/antonkarl/iceErrorCorpusSpecialized |
dc.subject | error corpus |
dc.subject | grammatical errors |
dc.subject | spelling errors |
dc.subject | child language |
dc.title | The Icelandic Child Language Error Corpus (IceCLEC) Version 1.0 |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | Clarin IS Repository |
contact.person | Anton Karl Ingason anton.karl.ingason@gmail.com University of Iceland |
sponsor | Ministry of Education, Science and Culture Specialized error corpora (L2) Language Technology for Icelandic 2019-2023 nationalFunds |
size.info | 2293 other |
files.size | 285488 |
files.count | 1 |
Files in this item
This item is
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution 4.0 International (CC BY 4.0)
- Name
- iceErrorCorpusChildLanguage.zip
- Size
- 278.8 KB
- Format
- application/zip
- Description
- A zip file containing the corpus and information on the error codes
- MD5
- c8d59969d80cc1f4ca6d2f88c6c13873
- iceErrorCorpusChildLanguage
- README.md-1 B
- data
- texti015-2006.xml-1 B
- texti008-2006.xml-1 B
- texti021-2006.xml-1 B
- texti030-2008.xml-1 B
- texti005-2008.xml-1 B
- texti029-2008.xml-1 B
- texti014-2006.xml-1 B
- texti001-2010.xml-1 B
- texti018-2006.xml-1 B
- texti024-2006.xml-1 B
- texti027-2008.xml-1 B
- texti012-2006.xml-1 B
- texti033-2008.xml-1 B
- texti016-2006.xml-1 B
- texti010-2006.xml-1 B
- texti022-2006.xml-1 B
- texti009-2006.xml-1 B
- texti031-2008.xml-1 B
- texti006-2008.xml-1 B
- texti002-2010.xml-1 B
- texti026-2006.xml-1 B
- texti007-2006.xml-1 B
- texti020-2006.xml-1 B
- texti019-2006.xml-1 B
- texti025-2006.xml-1 B
- texti004-2008.xml-1 B
- texti013-2006.xml-1 B
- texti028-2008.xml-1 B
- texti034-2008.xml-1 B
- texti017-2006.xml-1 B
- texti011-2006.xml-1 B
- texti023-2006.xml-1 B
- texti003-2009.xml-1 B
- texti032-2008.xml-1 B
- IEC_ErrorCodes.pdf-1 B
- errorCodes.tsv-1 B
- __MACOSX
- iceErrorCorpusChildLanguage
- ._README.md-1 B
- ._IEC_ErrorCodes.pdf-1 B
- ._data-1 B
- data
- ._texti015-2006.xml-1 B
- ._texti021-2006.xml-1 B
- ._texti002-2010.xml-1 B
- ._texti008-2006.xml-1 B
- ._texti030-2008.xml-1 B
- ._texti005-2008.xml-1 B
- ._texti029-2008.xml-1 B
- ._texti025-2006.xml-1 B
- ._texti013-2006.xml-1 B
- ._texti018-2006.xml-1 B
- ._texti027-2008.xml-1 B
- ._texti033-2008.xml-1 B
- ._texti016-2006.xml-1 B
- ._texti010-2006.xml-1 B
- ._texti022-2006.xml-1 B
- ._texti009-2006.xml-1 B
- ._texti031-2008.xml-1 B
- ._texti006-2008.xml-1 B
- ._texti026-2006.xml-1 B
- ._texti014-2006.xml-1 B
- ._texti007-2006.xml-1 B
- ._texti001-2010.xml-1 B
- ._texti020-2006.xml-1 B
- ._texti019-2006.xml-1 B
- ._texti004-2008.xml-1 B
- ._texti028-2008.xml-1 B
- ._texti034-2008.xml-1 B
- ._texti024-2006.xml-1 B
- ._texti012-2006.xml-1 B
- ._texti017-2006.xml-1 B
- ._texti011-2006.xml-1 B
- ._texti023-2006.xml-1 B
- ._texti003-2009.xml-1 B
- ._texti032-2008.xml-1 B
- ._errorCodes.tsv-1 B
- ._iceErrorCorpusChildLanguage-1 B
- iceErrorCorpusChildLanguage