Files in this item

This item is
Publicly Available
and licensed under:
Icelandic Gigaword Corpus Part1
Icon
Name
eng-isl-synthetic-corpus-v1.0.tar.gz
Size
7.01 GB
Format
application/gzip
Description
Backtranslated synthetic En--Is corpus for NMT
MD5
541612f84c6502f721f2ca6e417436b7
 Download file  Preview
 File Preview  
  • eng-isl-synthetic-corpus-v1.0
    • monolingual-eng
      • newscrawl.multi-year.en-is.tsv7 GB
      • enwiki-20161221.en-is.tsv2 GB
      • europarl-v9-en.en-is.tsv609 MB
    • monolingual-isl
      • rmh2018-2
        • visir-rmh2018-2.tsv1 GB
        • ras1_og_ras2-rmh2018-2.tsv254 MB
        • althingi-rmh2018-2.tsv904 MB
        • haestirettur-rmh2018-2.tsv610 MB
        • iswiki-rmh2018-2.tsv87 MB
        • domstolar-rmh2018-2.tsv533 MB
        • sjonvarpid-rmh2018-2.tsv167 MB
      • rmh2018-1
        • morgunbladid-rmh2018-1.tsv2 GB
        • visindavefur-rmh2018-1.tsv60 MB
    • README2 kB