Files in this item
Download all files in item (560.28 KB)
- Name
- Tokenizer-3.5.5.zip
- Size
- 289.69 KB
- Format
- application/zip
- Description
- Unknown
- MD5
- 0bd11287ef446264fe392eda81ee755e
- Tokenizer-3.5.5
- src
- tokenizer
- abbrev.py13 kB
- main.py9 kB
- definitions.py28 kB
- tokenizer.py135 kB
- __init__.py2 kB
- Abbrev.conf46 kB
- tokenizer
- README.md37 kB
- .gitignore1 kB
- pyproject.toml3 kB
- CLAUDE.md3 kB
- test
- toktest_edgecases.txt6 kB
- toktest_large_gold_perfect.txt576 kB
- test_composite_glyphs.py8 kB
- test_helper_functions.py2 kB
- test_dashes.py14 kB
- Overview.txt34 kB
- toktest_large_gold_acceptable.txt583 kB
- toktest_edgecases_gold_expected.txt6 kB
- test_index_calculation.py22 kB
- toktest_edgecases_diff.txt751 B
- test_cli.py7 kB
- toktest_sentences.txt21 kB
- test_abbrev.py1 kB
- toktest_large.txt559 kB
- test_tokenizer.py103 kB
- toktest_normal.txt12 kB
- toktest_normal_gold_expected.txt13 kB
- example.txt3 kB
- test_detokenize.py2 kB
- test_tokenizer_tok.py18 kB
- .github
- workflows
- python-package.yml1 kB
- workflows
- LICENSE.txt1 kB
- perf.py977 B
- MANIFEST.in104 B
- src
- Name
- Tokenizer-3.5.5.tar.gz
- Size
- 270.59 KB
- Format
- application/gzip
- Description
- Unknown
- MD5
- 3cba60656f8c51fa219b051930fd8e33
- Tokenizer-3.5.5
- src
- tokenizer
- abbrev.py13 kB
- main.py9 kB
- definitions.py28 kB
- tokenizer.py135 kB
- __init__.py2 kB
- Abbrev.conf46 kB
- tokenizer
- README.md37 kB
- .gitignore1 kB
- pyproject.toml3 kB
- CLAUDE.md3 kB
- test
- toktest_edgecases.txt6 kB
- toktest_large_gold_perfect.txt576 kB
- test_composite_glyphs.py8 kB
- test_helper_functions.py2 kB
- test_dashes.py14 kB
- Overview.txt34 kB
- toktest_large_gold_acceptable.txt583 kB
- toktest_edgecases_gold_expected.txt6 kB
- test_index_calculation.py22 kB
- toktest_edgecases_diff.txt751 B
- test_cli.py7 kB
- toktest_sentences.txt21 kB
- test_abbrev.py1 kB
- toktest_large.txt559 kB
- test_tokenizer.py103 kB
- toktest_normal.txt12 kB
- toktest_normal_gold_expected.txt13 kB
- example.txt3 kB
- test_detokenize.py2 kB
- test_tokenizer_tok.py18 kB
- .github
- workflows
- python-package.yml1 kB
- workflows
- LICENSE.txt1 kB
- perf.py977 B
- MANIFEST.in104 B
- src
- pax_global_header52 B