Show simple item record

 
dc.contributor.author Ingason, Anton Karl
dc.contributor.author Arnardóttir, Þórunn
dc.contributor.author Stefánsdóttir, Lilja Björk
dc.contributor.author Xu, Xindan
dc.date.accessioned 2021-04-21T15:37:37Z
dc.date.available 2021-04-21T15:37:37Z
dc.date.issued 2021-04-26
dc.identifier.uri http://hdl.handle.net/20.500.12537/108
dc.description The Icelandic Child Language Error Corpus (IceCLEC) is a collection of texts in modern Icelandic, written by native speakers of Icelandic of ages 10 to 15. They have been annotated for mistakes related to spelling, grammar, and other issues. Each mistake is marked according to error type using an error code, of which there are 253. The corpus consists of 34 files with 2,293 categorized error instances. Villumálheild íslensks barnamáls er safn texta á nútímaíslensku sem hafa verið skrifaðir af íslenskum móðurmálsmálhöfum á aldrinum 10–15 ára. Villur hafa verið merktar, t.d. hvað varðar stafsetningu, málfræði og fleira. Hver villa í textanum er flokkuð með hjálp villukóða, en 253 villukóðar eru notaðir í málheildinni. Málheildin samanstendur af 34 textum með 2.293 flokkuðum villutilvikum.
dc.language.iso isl
dc.publisher University of Iceland
dc.relation.isreplacedby http://hdl.handle.net/20.500.12537/133
dc.rights Creative Commons - Attribution 4.0 International (CC BY 4.0)
dc.rights.uri https://creativecommons.org/licenses/by/4.0/
dc.rights.label PUB
dc.source.uri https://github.com/antonkarl/iceErrorCorpusSpecialized
dc.subject error corpus
dc.subject grammatical errors
dc.subject spelling errors
dc.subject child language
dc.title The Icelandic Child Language Error Corpus (IceCLEC) Version 1.0
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
has.files yes
branding Clarin IS Repository
contact.person Anton Karl Ingason anton.karl.ingason@gmail.com University of Iceland
sponsor Ministry of Education, Science and Culture Specialized error corpora (L2) Language Technology for Icelandic 2019-2023 nationalFunds
size.info 2293 other
files.size 285488
files.count 1


 Files in this item

This item is
Publicly Available
and licensed under:
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Icon
Name
iceErrorCorpusChildLanguage.zip
Size
278.8 KB
Format
application/zip
Description
A zip file containing the corpus and information on the error codes
MD5
c8d59969d80cc1f4ca6d2f88c6c13873
 Download file  Preview
 File Preview  
  • iceErrorCorpusChildLanguage
    • README.md-1 B
    • data
      • texti015-2006.xml-1 B
      • texti008-2006.xml-1 B
      • texti021-2006.xml-1 B
      • texti030-2008.xml-1 B
      • texti005-2008.xml-1 B
      • texti029-2008.xml-1 B
      • texti014-2006.xml-1 B
      • texti001-2010.xml-1 B
      • texti018-2006.xml-1 B
      • texti024-2006.xml-1 B
      • texti027-2008.xml-1 B
      • texti012-2006.xml-1 B
      • texti033-2008.xml-1 B
      • texti016-2006.xml-1 B
      • texti010-2006.xml-1 B
      • texti022-2006.xml-1 B
      • texti009-2006.xml-1 B
      • texti031-2008.xml-1 B
      • texti006-2008.xml-1 B
      • texti002-2010.xml-1 B
      • texti026-2006.xml-1 B
      • texti007-2006.xml-1 B
      • texti020-2006.xml-1 B
      • texti019-2006.xml-1 B
      • texti025-2006.xml-1 B
      • texti004-2008.xml-1 B
      • texti013-2006.xml-1 B
      • texti028-2008.xml-1 B
      • texti034-2008.xml-1 B
      • texti017-2006.xml-1 B
      • texti011-2006.xml-1 B
      • texti023-2006.xml-1 B
      • texti003-2009.xml-1 B
      • texti032-2008.xml-1 B
    • IEC_ErrorCodes.pdf-1 B
    • errorCodes.tsv-1 B
  • __MACOSX
    • iceErrorCorpusChildLanguage
      • ._README.md-1 B
      • ._IEC_ErrorCodes.pdf-1 B
      • ._data-1 B
      • data
        • ._texti015-2006.xml-1 B
        • ._texti021-2006.xml-1 B
        • ._texti002-2010.xml-1 B
        • ._texti008-2006.xml-1 B
        • ._texti030-2008.xml-1 B
        • ._texti005-2008.xml-1 B
        • ._texti029-2008.xml-1 B
        • ._texti025-2006.xml-1 B
        • ._texti013-2006.xml-1 B
        • ._texti018-2006.xml-1 B
        • ._texti027-2008.xml-1 B
        • ._texti033-2008.xml-1 B
        • ._texti016-2006.xml-1 B
        • ._texti010-2006.xml-1 B
        • ._texti022-2006.xml-1 B
        • ._texti009-2006.xml-1 B
        • ._texti031-2008.xml-1 B
        • ._texti006-2008.xml-1 B
        • ._texti026-2006.xml-1 B
        • ._texti014-2006.xml-1 B
        • ._texti007-2006.xml-1 B
        • ._texti001-2010.xml-1 B
        • ._texti020-2006.xml-1 B
        • ._texti019-2006.xml-1 B
        • ._texti004-2008.xml-1 B
        • ._texti028-2008.xml-1 B
        • ._texti034-2008.xml-1 B
        • ._texti024-2006.xml-1 B
        • ._texti012-2006.xml-1 B
        • ._texti017-2006.xml-1 B
        • ._texti011-2006.xml-1 B
        • ._texti023-2006.xml-1 B
        • ._texti003-2009.xml-1 B
        • ._texti032-2008.xml-1 B
      • ._errorCodes.tsv-1 B
    • ._iceErrorCorpusChildLanguage-1 B

Show simple item record