ENGLISH:
This package contains questions and answers from the corpus 'Texts from the Icelandic Web of Science and the European Web' (http://hdl.handle.net/20.500.12537/361) in a jsonl format, which is suitable for LLM training. The dataset is also available at Huggingface: https://huggingface.co/datasets/arnastofnun/VV_EV.
dc.description
ÍSLENSKA:
Pakkinn inniheldur spurningar og svör ú málheildinni 'Textar af Vísindavefnum og Evrópuvefnum' (http://hdl.handle.net/20.500.12537/361) á jsonl-sniðmáti sem hentar m.a. við þjálfun mállíkana. Gagnasettið er einnig aðgengilegt á Huggingface: https://huggingface.co/datasets/arnastofnun/VV_EV.
dc.language.iso
isl
dc.publisher
The Árni Magnússon Institute for Icelandic Studies