Files in this item
Download all files in item (3.81 GB)This item is
Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution-ShareAlike 4.0 International (CC BY-SA 4.0)
- Name
- doc_distil_is_en.zip
- Size
- 1.91 GB
- Format
- application/zip
- Description
- Unknown
- MD5
- 56fc4841f6e314ee426a77972dfe6b6e
- doc_distil_is_en
- fairseq_user_dir
- watch_sync_ada.sh403 B
- document_utils.py20 kB
- make_merged_sentence_testset.py2 kB
- document_dataset.py874 B
- fragment_noise.py5 kB
- sentencepiece_bpe_sampling.py1 kB
- check_parallel.py3 kB
- scratch_load.py4 kB
- document_translation_from_pretrained_bart.py10 kB
- noised_translation_from_pretrained_bart.py10 kB
- check_pos_dist.py604 B
- indexed_parallel_documents_dataset.py19 kB
- batch_sampler.py1022 B
- indexed_parallel_bt_documents_dataset.py8 kB
- noised_sequence.py147 B
- cached_mmap_jsonl_dataset.py2 kB
- word_noise.py5 kB
- __pycache__
- __init__.cpython-38.pyc288 B
- sentencepiece_bpe_sampling.cpython-38.pyc1 kB
- document_translation_from_pretrained_bart.cpython-38.pyc6 kB
- spm_segmentation_noise.py2 kB
- check_align.py5 kB
- check_domain.py1 kB
- encoders.py1 kB
- __init__.py143 B
- noiser.py87 B
- README.md2 kB
- dict.en_XX.txt3 MB
- fairseq_model.pt3 GB
- requirements.txt52 B
- dict.is_IS.txt3 MB
- interactive.sh477 B
- sentencepiece.bpe.model4 MB
- fairseq_user_dir
- Name
- doc_distil_en_is.zip
- Size
- 1.9 GB
- Format
- application/zip
- Description
- Unknown
- MD5
- f7c1fab1632b2371451bb11edc958d72
- doc_distil_en_is
- fairseq_user_dir
- watch_sync_ada.sh403 B
- document_utils.py20 kB
- make_merged_sentence_testset.py2 kB
- document_dataset.py874 B
- fragment_noise.py5 kB
- sentencepiece_bpe_sampling.py1 kB
- check_parallel.py3 kB
- scratch_load.py4 kB
- document_translation_from_pretrained_bart.py10 kB
- noised_translation_from_pretrained_bart.py10 kB
- check_pos_dist.py604 B
- indexed_parallel_documents_dataset.py19 kB
- batch_sampler.py1022 B
- indexed_parallel_bt_documents_dataset.py8 kB
- noised_sequence.py147 B
- cached_mmap_jsonl_dataset.py2 kB
- word_noise.py5 kB
- __pycache__
- __init__.cpython-38.pyc292 B
- sentencepiece_bpe_sampling.cpython-38.pyc1 kB
- document_translation_from_pretrained_bart.cpython-38.pyc6 kB
- spm_segmentation_noise.py2 kB
- check_domain.py1 kB
- check_align.py5 kB
- encoders.py1 kB
- __init__.py143 B
- noiser.py87 B
- README.md2 kB
- fairseq_model.pt3 GB
- dict.en_XX.txt3 MB
- requirements.txt52 B
- interactive.sh477 B
- dict.is_IS.txt3 MB
- sentencepiece.bpe.model4 MB
- fairseq_user_dir