###################################################################################### ### Icelandic Gigaword Corpus: Parliamentary corpus, IGC-Parla-2410ext [IGC-Parla] ### ### http://hdl.handle.net/20.500.12537/354 ### ###################################################################################### [DESCRIPTION] IGC-Parla is a part of the IGC-project (Icelandic Gigaword corpus) that aims to collect as much as possible of Icelandic texts that can be published under an open licence. IGC-Parla contains parliamentary speeches that have been encoded according to the Parla-CLARIN recommendations [https://github.com/clarin-eric/parla-clarin]. The corpus is published in two versions. IGC-Parla contains plain text while IGC-Parla.ana is a linguastically marked-up version. This version, 2410ext, is an extension to the version 22.10 and only contains texts from the years 2021 to 2023. This corpus contains the untokenized and unannotated version of IGC-Parla, where each paragraph is contained inside of a tag. The annotated version can be found here: http://hdl.handle.net/20.500.12537/355. Further information about IGC is available at http://igc.arnastofnun.is. [LICENCE] http://creativecommons.org/licenses/by/4.0/ [PUBLISHER] Árni Magnússon Institute for Icelandic Studies. [STATISTICS] 345 TEI-files 100709 paragraphs