# NQiI - SQuAD format v. 1.0.0

This is a subset of the NQiI dataset in the same format as the Stanford
Question Answering Dataset. The data is provided both tokenized and not
tokenized. The data is included in json format and jsonl format (one 
json object per line).

The dataset is further described in 
Vésteinn Snæbjarnarson, 2021, Automated methods for Question-Answering in Icelandic, M.Sc. thesis, Faculty of Industrial Engineering, MechanicalEngineering and Computer Science, University of Iceland.

Contact vesteinnsnaebjarnarson@gmail.com for questions about the
dataset.

