dc.contributor.author | Xu, Xindan |
dc.contributor.author | Ingason, Anton Karl |
dc.contributor.author | Kolka, Veronika Teresa |
dc.contributor.author | Kovalova, Alesia |
dc.contributor.author | Kristínardóttir, Iðunn |
dc.date.accessioned | 2023-04-11T10:16:08Z |
dc.date.available | 2023-04-11T10:16:08Z |
dc.date.issued | 2023-01-01 |
dc.identifier.uri | http://hdl.handle.net/20.500.12537/308 |
dc.description | IceFlash 4K contains a multilingual dataset with the 4,000 most common Icelandic words according to the Tagged Icelandic Corpus, along with a printable pdf-version and a digital Anki-version of the flashcards. Currently, the flashcards are available in 4 language versions: English, Chinese, Polish, Ukrainian. The dataset contains a variety of information about each vocabulary item, such as its frequency and rank in the corpus, part-of-speech tag, English/Polish/Ukranian/Chinese translation, a sample sentence to show the usage of the word in context, phonetic transcription, and selected conjugation forms in respect to its word category. IceFlash 4K er fjölmála gagnagrunnur með 4.000 algengustu orðum íslenskrar tungu samkvæmt Markaðri íslenskri málheild, ásamt leifturminniskortum bæði á PDF-sniði sem hægt er að prenta út og Anki-sniði. Minniskortin eru til á fjórum tungumálum: ensku, kínversku, pólsku og úkraínsku. Gagnagrunnurinn er samsettur af fjölbreyttum upplýsingum um orðin, t.d. tíðni og tíðnaröð í MÍM, marki (e. tag), enskri/pólskri/úkraínskri/kínverskri þýðingu, setningu sem sýnir notkun orðsins í samhengi, hljóðritun og hljóðskrá, og ákveðnum beygingarmyndum sem fara eftir orðflokkun. |
dc.language.iso | isl |
dc.language.iso | eng |
dc.language.iso | zho |
dc.language.iso | pol |
dc.language.iso | ukr |
dc.publisher | University of Iceland |
dc.relation.isreferencedby | https://aclanthology.org/2021.nlp4call-1.5 |
dc.rights | Creative Commons - Attribution 4.0 International (CC BY 4.0) |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ |
dc.rights.label | PUB |
dc.source.uri | https://github.com/antonkarl/iceFlash4K |
dc.subject | flashcards |
dc.subject | multilingual dataset |
dc.subject | second language learning |
dc.title | Multilingual Flashcards with 4,000 Most Common Icelandic Words (IceFlash4K) |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | Clarin IS Repository |
contact.person | Xindan Xu xindanxu@hi.is University of Iceland |
contact.person | Anton Karl Ingason antoni@hi.is University of Iceland |
sponsor | University of Iceland Áslaug Hafliðadóttir Memorial Fund Developing Flashcards for Icelandic as a Second Language nationalFunds |
size.info | 4000 entries |
files.size | 76700911 |
files.count | 1 |
Files in this item
This item is
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution 4.0 International (CC BY 4.0)
- Name
- IceFlash4K-flashcards.zip
- Size
- 73.15 MB
- Format
- application/zip
- Description
- A zip file containing all relevant files
- MD5
- 4483e3a81fc0ec52b8aa69e38005845d
- IceFlash4K-flashcards
- README.md2 kB
- cc-by-4-0.txt18 kB
- flashcards
- pdf
- en
- flash_4k.pdf623 kB
- flash_4k.tex1015 kB
- flash_3k.pdf569 kB
- flash_3k.tex913 kB
- flash_2k.pdf557 kB
- flash_2k.tex885 kB
- flash_1k.pdf496 kB
- flash_1k.tex790 kB
- ukr
- flash_4K.pdf653 kB
- flash_4K.tex1 MB
- flash_3K.pdf599 kB
- flash_3K.tex949 kB
- flash_2K.pdf588 kB
- flash_2K.tex924 kB
- flash_1K.pdf528 kB
- flash_1K.tex828 kB
- pl
- flash_4k.pdf626 kB
- flash_4k.tex1 MB
- flash_3k.pdf572 kB
- flash_3k.tex923 kB
- flash_2k.pdf558 kB
- flash_2k.tex894 kB
- flash_1k.pdf498 kB
- flash_1k.tex799 kB
- zh
- flash_4k.pdf911 kB
- flash_4k.tex1019 kB
- flash_3k.pdf827 kB
- flash_3k.tex916 kB
- flash_2k.pdf803 kB
- flash_2k.tex888 kB
- flash_1k.pdf703 kB
- flash_1k.tex793 kB
- en
- anki
- IceFlash4k_pl_v2.apkg20 MB
- IceFlash4k_en_v2.apkg20 MB
- IceFlash4k_zh_v2.apkg20 MB
- IceFlash4k_ukr_v2.apkg20 MB
- pdf
- data
- list_4k_zh.tsv1 MB
- list_4k_ukr.tsv1 MB
- list_4k_pl.tsv1 MB
- list_4k_en.tsv1 MB
- document description.md3 kB