What's New

 corpus 
corpus
Description:
A question answering dataset intended to measure a large language model's knowledge of Icelandic culture and history and its ability to answer questions correctly. The dataset is split into two parts, a gold corpus and ...
 This item contains 2 files (12.06 MB).
 
Publicly Available
 toolService 
toolService
Description:
Icelandic Gigaword Corpus JSONL Converter is a tool for converting the unannotated version of the Icelandic Gigaword Corpus (IGC; http://hdl.handle.net/20.500.12537/253) to JSONL format. The converter takes in original XML ...
 This item contains 2 files (12.83 KB).
 
Publicly Available
 corpus 
corpus
Description:
This package contains those subcorpora of the Icelandic Gigaword Corpus, version 22.10 (http://hdl.handle.net/20.500.12537/253), that have been published with an restricted licence, in a jsonl format, which is suitable ...
 This item contains 2 files (2.91 GB).
 
Publicly Available