dc.contributor.author | Arnardóttir, Þórunn |
dc.contributor.author | Ingason, Anton Karl |
dc.date.accessioned | 2020-04-22T20:25:52Z |
dc.date.available | 2020-04-22T20:25:52Z |
dc.date.issued | 2020-04-22 |
dc.identifier.uri | http://hdl.handle.net/20.500.12537/18 |
dc.description | The Neural Machine-Parsed IcePaHC is a machine-parsed treebank which consists of the Icelandic sagas, text from the 13th to 19th century. The texts were parsed using the IceNeuralParsingPipeline, a parsing pipeline which includes an Icelandic model of the Berkeley Neural Parser along with pre- and postprocessing steps. The parser was trained on IcePaHC and the parsing scheme of the treebank is therefore the same, although the treebank does not include empty phrases or lemmas. The treebank includes 43 Icelandic sagas. The total word count is 1,835,693 and the total number of matrix clauses is 188,426. |
dc.language.iso | isl |
dc.publisher | Háskóli Íslands |
dc.relation.isreplacedby | http://hdl.handle.net/20.500.12537/20 |
dc.rights | Creative Commons - Attribution 4.0 International (CC BY 4.0) |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ |
dc.rights.label | PUB |
dc.source.uri | https://github.com/antonkarl/micepahc |
dc.subject | treebank |
dc.subject | machine parsing |
dc.subject | phrase structure grammar |
dc.subject | parsing |
dc.subject | neural parsing |
dc.subject | historical corpus |
dc.subject | parsed corpus |
dc.subject | icepahc |
dc.subject | parsing |
dc.title | NeuralMIcePaHC 20.04 |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | Clarin IS Repository |
contact.person | Þórunn Arnardóttir tha86@hi.is Háskóli Íslands |
size.info | 1835693 tokens |
files.size | 13161899 |
files.count | 1 |
Files in this item
This item is
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution 4.0 International (CC BY 4.0)
- Name
- NeuralMIcePaHC.zip
- Size
- 12.55 MB
- Format
- application/zip
- Description
- zip file containing the parsed files and the raw text files
- MD5
- 45e4c5dd24874a5b8a9f042ca4339b83
- NeuralMIcePaHC
- psd
- 1390.graenlendinga.nar-sag.psd-1 B
- 1400.vatnsdaela.nar-sag.psd-1 B
- 1400.gisla.nar-sag.psd-1 B
- 1400.reykdaela.nar-sag.psd-1 B
- 1639.thorsteinshvita.nar-sag.psd-1 B
- 1400.ljosvetninga.nar-sag.psd-1 B
- 1425.vopnfirdinga.nar-sag.psd-1 B
- 1275.laxdaela.nar-sag.psd-1 B
- 1300.heidarviga.nar-sag.psd-1 B
- 1824.hrana.nar-sag.psd-1 B
- 1350.finnboga.nar-sag.psd-1 B
- 1325.gunnlaugs.nar-sag.psd-1 B
- 1300.eyrbyggja.nar-sag.psd-1 B
- 1375.bjarnar.nar-sag.psd-1 B
- 1400.hardar.nar-sag.psd-1 B
- 1250.egils.nar-sag.psd-1 B
- 1350.viga.nar-sag.psd-1 B
- 1350.kormaks.nar-sag.psd-1 B
- 1350.hallfredarmodruvallabok.nar-sag.psd-1 B
- 1350.bandamanna.nar-sag.psd-1 B
- 1390.hallfredarolafs.nar-sag.psd-1 B
- 1500.viglundar.nar-sag.psd-1 B
- 1450.svarfdaela.nar-sag.psd-1 B
- 1390.faereyinga.nar-sag.psd-1 B
- 1305.fostbraedra.nar-sag.psd-1 B
- 1400.floamanna.nar-sag.psd-1 B
- 1400.thordar.nar-sag.psd-1 B
- 1640.valla.nar-sag.psd-1 B
- 1625.fljotsdaela.nar-sag.psd-1 B
- 1650.havardar.nar-sag.psd-1 B
- 1500.haensna.nar-sag.psd-1 B
- 1300.njals.nar-sag.psd-1 B
- 1500.hrafnkels.nar-sag.psd-1 B
- 1350.droplaugarsona.nar-sag.psd-1 B
- 1400.bardar.nar-sag.psd-1 B
- 1650.gunnars.nar-sag.psd-1 B
- 1305.eiriks.nar-sag.psd-1 B
- 1400.gull.nar-sag.psd-1 B
- 1700.thorsteins.nar-sag.psd-1 B
- 1500.grettis.nar-sag.psd-1 B
- 1390.graenlendingath.nar-sag.psd-1 B
- 1475.kroka.nar-sag.psd-1 B
- 1475.kjalnesinga.nar-sag.psd-1 B
- txt
- 1325.gunnlaugs.nar-sag.txt-1 B
- 1300.eyrbyggja.nar-sag.txt-1 B
- 1375.bjarnar.nar-sag.txt-1 B
- 1400.hardar.nar-sag.txt-1 B
- 1250.egils.nar-sag.txt-1 B
- 1350.kormaks.nar-sag.txt-1 B
- 1350.viga.nar-sag.txt-1 B
- 1350.hallfredarmodruvallabok.nar-sag.txt-1 B
- 1350.bandamanna.nar-sag.txt-1 B
- 1390.hallfredarolafs.nar-sag.txt-1 B
- 1500.viglundar.nar-sag.txt-1 B
- 1450.svarfdaela.nar-sag.txt-1 B
- 1390.faereyinga.nar-sag.txt-1 B
- 1305.fostbraedra.nar-sag.txt-1 B
- 1400.floamanna.nar-sag.txt-1 B
- 1400.thordar.nar-sag.txt-1 B
- 1625.fljotsdaela.nar-sag.txt-1 B
- 1640.valla.nar-sag.txt-1 B
- 1650.havardar.nar-sag.txt-1 B
- 1300.njals.nar-sag.txt-1 B
- 1500.haensna.nar-sag.txt-1 B
- 1500.hrafnkels.nar-sag.txt-1 B
- 1350.droplaugarsona.nar-sag.txt-1 B
- 1400.bardar.nar-sag.txt-1 B
- 1650.gunnars.nar-sag.txt-1 B
- 1305.eiriks.nar-sag.txt-1 B
- 1400.gull.nar-sag.txt-1 B
- 1700.thorsteins.nar-sag.txt-1 B
- 1500.grettis.nar-sag.txt-1 B
- 1390.graenlendingath.nar-sag.txt-1 B
- 1475.kroka.nar-sag.txt-1 B
- 1475.kjalnesinga.nar-sag.txt-1 B
- 1390.graenlendinga.nar-sag.txt-1 B
- 1400.vatnsdaela.nar-sag.txt-1 B
- 1400.gisla.nar-sag.txt-1 B
- 1400.reykdaela.nar-sag.txt-1 B
- 1639.thorsteinshvita.nar-sag.txt-1 B
- 1400.ljosvetninga.nar-sag.txt-1 B
- 1425.vopnfirdinga.nar-sag.txt-1 B
- 1275.laxdaela.nar-sag.txt-1 B
- 1300.heidarviga.nar-sag.txt-1 B
- 1824.hrana.nar-sag.txt-1 B
- 1350.finnboga.nar-sag.txt-1 B
- .DS_Store-1 B
- psd
- ._NeuralMIcePaHC-1 B
- NeuralMIcePaHC
- psd
- ._1824.hrana.nar-sag.psd-1 B
- ._1400.hardar.nar-sag.psd-1 B
- ._1350.bandamanna.nar-sag.psd-1 B
- ._1450.svarfdaela.nar-sag.psd-1 B
- ._1390.faereyinga.nar-sag.psd-1 B
- ._1639.thorsteinshvita.nar-sag.psd-1 B
- ._1500.viglundar.nar-sag.psd-1 B
- ._1400.ljosvetninga.nar-sag.psd-1 B
- ._1250.egils.nar-sag.psd-1 B
- ._1400.floamanna.nar-sag.psd-1 B
- ._1400.gull.nar-sag.psd-1 B
- ._1425.vopnfirdinga.nar-sag.psd-1 B
- ._1500.hrafnkels.nar-sag.psd-1 B
- ._1375.bjarnar.nar-sag.psd-1 B
- ._1700.thorsteins.nar-sag.psd-1 B
- ._1640.valla.nar-sag.psd-1 B
- ._1400.bardar.nar-sag.psd-1 B
- ._1300.njals.nar-sag.psd-1 B
- ._1305.eiriks.nar-sag.psd-1 B
- ._1350.kormaks.nar-sag.psd-1 B
- ._1390.hallfredarolafs.nar-sag.psd-1 B
- ._1350.droplaugarsona.nar-sag.psd-1 B
- ._1275.laxdaela.nar-sag.psd-1 B
- ._1400.thordar.nar-sag.psd-1 B
- ._1350.hallfredarmodruvallabok.nar-sag.psd-1 B
- ._1500.haensna.nar-sag.psd-1 B
- ._1475.kroka.nar-sag.psd-1 B
- ._1350.finnboga.nar-sag.psd-1 B
- ._1400.vatnsdaela.nar-sag.psd-1 B
- ._1305.fostbraedra.nar-sag.psd-1 B
- ._1625.fljotsdaela.nar-sag.psd-1 B
- ._1400.reykdaela.nar-sag.psd-1 B
- ._1650.gunnars.nar-sag.psd-1 B
- ._1500.grettis.nar-sag.psd-1 B
- ._1300.heidarviga.nar-sag.psd-1 B
- ._1390.graenlendingath.nar-sag.psd-1 B
- ._1400.gisla.nar-sag.psd-1 B
- ._1350.viga.nar-sag.psd-1 B
- ._1390.graenlendinga.nar-sag.psd-1 B
- ._1325.gunnlaugs.nar-sag.psd-1 B
- ._1300.eyrbyggja.nar-sag.psd-1 B
- ._1650.havardar.nar-sag.psd-1 B
- ._1475.kjalnesinga.nar-sag.psd-1 B
- txt
- ._1425.vopnfirdinga.nar-sag.txt-1 B
- ._1500.hrafnkels.nar-sag.txt-1 B
- ._1375.bjarnar.nar-sag.txt-1 B
- ._1700.thorsteins.nar-sag.txt-1 B
- ._1640.valla.nar-sag.txt-1 B
- ._1400.bardar.nar-sag.txt-1 B
- ._1300.njals.nar-sag.txt-1 B
- ._1305.eiriks.nar-sag.txt-1 B
- ._1350.kormaks.nar-sag.txt-1 B
- ._1390.hallfredarolafs.nar-sag.txt-1 B
- ._1350.droplaugarsona.nar-sag.txt-1 B
- ._1275.laxdaela.nar-sag.txt-1 B
- ._1400.thordar.nar-sag.txt-1 B
- ._1350.hallfredarmodruvallabok.nar-sag.txt-1 B
- ._1500.haensna.nar-sag.txt-1 B
- ._1475.kroka.nar-sag.txt-1 B
- ._1400.vatnsdaela.nar-sag.txt-1 B
- ._1350.finnboga.nar-sag.txt-1 B
- ._1305.fostbraedra.nar-sag.txt-1 B
- ._1625.fljotsdaela.nar-sag.txt-1 B
- ._1400.reykdaela.nar-sag.txt-1 B
- ._1650.gunnars.nar-sag.txt-1 B
- ._1500.grettis.nar-sag.txt-1 B
- ._1300.heidarviga.nar-sag.txt-1 B
- ._1390.graenlendingath.nar-sag.txt-1 B
- ._1400.gisla.nar-sag.txt-1 B
- ._1350.viga.nar-sag.txt-1 B
- ._1390.graenlendinga.nar-sag.txt-1 B
- ._1325.gunnlaugs.nar-sag.txt-1 B
- ._1300.eyrbyggja.nar-sag.txt-1 B
- ._1475.kjalnesinga.nar-sag.txt-1 B
- ._1650.havardar.nar-sag.txt-1 B
- ._1400.hardar.nar-sag.txt-1 B
- ._1824.hrana.nar-sag.txt-1 B
- ._1350.bandamanna.nar-sag.txt-1 B
- ._1450.svarfdaela.nar-sag.txt-1 B
- ._1390.faereyinga.nar-sag.txt-1 B
- ._1639.thorsteinshvita.nar-sag.txt-1 B
- ._1500.viglundar.nar-sag.txt-1 B
- ._1400.ljosvetninga.nar-sag.txt-1 B
- ._1250.egils.nar-sag.txt-1 B
- ._1400.floamanna.nar-sag.txt-1 B
- ._1400.gull.nar-sag.txt-1 B
- ._txt-1 B
- ._.DS_Store-1 B
- psd