What's New
corpus
Description:
A question answering dataset intended to measure a large language model's knowledge of Icelandic culture and history and its ability to answer questions correctly.
The dataset is split into two parts, a gold corpus and ...
This item contains 2 files (12.06
MB).
Publicly Available
toolService
Description:
Icelandic Gigaword Corpus JSONL Converter is a tool for converting the unannotated version of the Icelandic Gigaword Corpus (IGC; http://hdl.handle.net/20.500.12537/253) to JSONL format. The converter takes in original XML ...
This item contains 2 files (12.83
KB).
Publicly Available
corpus
Description:
This package contains those subcorpora of the Icelandic Gigaword Corpus, version
22.10 (http://hdl.handle.net/20.500.12537/253), that have been published with an
restricted licence, in a jsonl format, which is suitable ...
This item contains 2 files (2.91
GB).
Publicly Available