Files in this item

 Download all files in item (3.81 GB)
This item is
Publicly Available
and licensed under:
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Icon
Name
doc_distil_is_en.zip
Size
1.91 GB
Format
application/zip
Description
Unknown
MD5
56fc4841f6e314ee426a77972dfe6b6e
 Download file  Preview
 File Preview  
  • doc_distil_is_en
    • fairseq_user_dir
      • watch_sync_ada.sh403 B
      • document_utils.py20 kB
      • make_merged_sentence_testset.py2 kB
      • document_dataset.py874 B
      • fragment_noise.py5 kB
      • sentencepiece_bpe_sampling.py1 kB
      • check_parallel.py3 kB
      • scratch_load.py4 kB
      • document_translation_from_pretrained_bart.py10 kB
      • noised_translation_from_pretrained_bart.py10 kB
      • check_pos_dist.py604 B
      • indexed_parallel_documents_dataset.py19 kB
      • batch_sampler.py1022 B
      • indexed_parallel_bt_documents_dataset.py8 kB
      • noised_sequence.py147 B
      • cached_mmap_jsonl_dataset.py2 kB
      • word_noise.py5 kB
      • __pycache__
        • __init__.cpython-38.pyc288 B
        • sentencepiece_bpe_sampling.cpython-38.pyc1 kB
        • document_translation_from_pretrained_bart.cpython-38.pyc6 kB
      • spm_segmentation_noise.py2 kB
      • check_align.py5 kB
      • check_domain.py1 kB
      • encoders.py1 kB
      • __init__.py143 B
      • noiser.py87 B
    • README.md2 kB
    • dict.en_XX.txt3 MB
    • fairseq_model.pt3 GB
    • requirements.txt52 B
    • dict.is_IS.txt3 MB
    • interactive.sh477 B
    • sentencepiece.bpe.model4 MB
Icon
Name
doc_distil_en_is.zip
Size
1.9 GB
Format
application/zip
Description
Unknown
MD5
f7c1fab1632b2371451bb11edc958d72
 Download file  Preview
 File Preview  
  • doc_distil_en_is
    • fairseq_user_dir
      • watch_sync_ada.sh403 B
      • document_utils.py20 kB
      • make_merged_sentence_testset.py2 kB
      • document_dataset.py874 B
      • fragment_noise.py5 kB
      • sentencepiece_bpe_sampling.py1 kB
      • check_parallel.py3 kB
      • scratch_load.py4 kB
      • document_translation_from_pretrained_bart.py10 kB
      • noised_translation_from_pretrained_bart.py10 kB
      • check_pos_dist.py604 B
      • indexed_parallel_documents_dataset.py19 kB
      • batch_sampler.py1022 B
      • indexed_parallel_bt_documents_dataset.py8 kB
      • noised_sequence.py147 B
      • cached_mmap_jsonl_dataset.py2 kB
      • word_noise.py5 kB
      • __pycache__
        • __init__.cpython-38.pyc292 B
        • sentencepiece_bpe_sampling.cpython-38.pyc1 kB
        • document_translation_from_pretrained_bart.cpython-38.pyc6 kB
      • spm_segmentation_noise.py2 kB
      • check_domain.py1 kB
      • check_align.py5 kB
      • encoders.py1 kB
      • __init__.py143 B
      • noiser.py87 B
    • README.md2 kB
    • fairseq_model.pt3 GB
    • dict.en_XX.txt3 MB
    • requirements.txt52 B
    • interactive.sh477 B
    • dict.is_IS.txt3 MB
    • sentencepiece.bpe.model4 MB