dc.contributor.author | Steingrímsson, Steinþór |
dc.contributor.author | Barkarson, Starkaður |
dc.date.accessioned | 2021-09-30T14:14:08Z |
dc.date.available | 2021-09-30T14:14:08Z |
dc.date.issued | 2021-10-01 |
dc.identifier.uri | http://hdl.handle.net/20.500.12537/145 |
dc.description | A realigned and refiltered version of the ParIce corpus, with additional material. |
dc.language.iso | eng |
dc.language.iso | isl |
dc.publisher | The Árni Magnússon Institute for Icelandic Studies |
dc.relation.isreferencedby | https://aclanthology.org/W19-6115/ |
dc.relation.replaces | http://hdl.handle.net/20.500.12537/16 |
dc.rights | Creative Commons - Attribution 4.0 International (CC BY 4.0) |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ |
dc.rights.label | PUB |
dc.source.uri | http://parice.arnastofnun.is/ |
dc.subject | machine translation |
dc.subject | parallel corpus |
dc.subject | aligned sentence pairs |
dc.subject | sentence alignments |
dc.subject | parallel |
dc.subject | parallel data |
dc.title | ParIce: English-Icelandic parallel corpus (21.10) |
dc.type | corpus |
metashare.ResourceInfo#ContentInfo.mediaType | text |
has.files | yes |
branding | Clarin IS Repository |
contact.person | Steinþór Steingrímsson steinthor.steingrimsson@arnastofnun.is The Árni Magnússon Institute for Icelandic Studies |
sponsor | Ministry of Education, Science and Culture (Mennta- og menningamálaráðuneytið) Language Technology for Icelandic 2019-2023 Parallel source material (V2) nationalFunds |
size.info | 5329681 sentences |
files.size | 689674721 |
files.count | 1 |
Files in this item
This item is
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution 4.0 International (CC BY 4.0)
- Name
- ParIce-train.21.10.zip
- Size
- 657.73 MB
- Format
- application/zip
- Description
- Parice-train.21.10
- MD5
- 4369272ac5b4f0888cece5dbd7edf13f
- ParIce-train.21.10
- README4 kB
- paragraphs
- eea.tsv535 MB
- norden.tsv3 MB
- eso.tsv2 MB
- tmx
- bible.tmx20 MB
- opensubtitles.tmx627 MB
- tatoeba.tmx4 MB
- norden.tmx8 MB
- eea.tmx2 GB
- ubuntu.tmx3 MB
- kde4.tmx9 MB
- eso.tmx8 MB
- ted2020.tmx1 MB
- statisl.tmx1 MB
- ema.tmx232 MB
- tsv
- bible.tsv7 MB
- opensubtitles.tsv92 MB
- tatoeba.tsv696 kB
- ted.tsv434 kB
- norden.tsv2 MB
- eea.tsv367 MB
- ubuntu.tsv449 kB
- kde4.tsv2 MB
- eso.tsv2 MB
- statisl.tsv425 kB
- ema.tsv72 MB