Files in this item
Download all files in item (549.61 KB)
- Name
- Tokenizer-3.5.3.tar.gz
- Size
- 265.82 KB
- Format
- application/gzip
- Description
- Unknown
- MD5
- 502d504f47732f7089b736f817978496
- Tokenizer-3.5.3
- src
- tokenizer
- abbrev.py13 kB
- main.py9 kB
- definitions.py28 kB
- tokenizer.py128 kB
- __init__.py2 kB
- Abbrev.conf46 kB
- tokenizer
- README.md34 kB
- .gitignore1 kB
- pyproject.toml2 kB
- CLAUDE.md3 kB
- test
- toktest_edgecases.txt6 kB
- toktest_large_gold_perfect.txt576 kB
- test_composite_glyphs.py8 kB
- test_helper_functions.py2 kB
- Overview.txt34 kB
- toktest_large_gold_acceptable.txt583 kB
- toktest_edgecases_gold_expected.txt6 kB
- test_index_calculation.py22 kB
- toktest_edgecases_diff.txt751 B
- test_cli.py7 kB
- toktest_sentences.txt21 kB
- test_abbrev.py984 B
- toktest_large.txt559 kB
- test_tokenizer.py103 kB
- toktest_normal.txt12 kB
- toktest_normal_gold_expected.txt13 kB
- example.txt3 kB
- test_detokenize.py2 kB
- test_tokenizer_tok.py18 kB
- .github
- workflows
- python-package.yml1 kB
- workflows
- LICENSE.txt1 kB
- perf.py977 B
- MANIFEST.in104 B
- src
- pax_global_header52 B

- Name
- Tokenizer-3.5.3.zip
- Size
- 283.79 KB
- Format
- application/zip
- Description
- Unknown
- MD5
- 1199c654b1a8e4fe437c9a2e03f48577
- Tokenizer-3.5.3
- src
- tokenizer
- abbrev.py13 kB
- main.py9 kB
- definitions.py28 kB
- tokenizer.py128 kB
- __init__.py2 kB
- Abbrev.conf46 kB
- tokenizer
- README.md34 kB
- .gitignore1 kB
- pyproject.toml2 kB
- CLAUDE.md3 kB
- test
- toktest_edgecases.txt6 kB
- toktest_large_gold_perfect.txt576 kB
- test_composite_glyphs.py8 kB
- test_helper_functions.py2 kB
- Overview.txt34 kB
- toktest_large_gold_acceptable.txt583 kB
- toktest_edgecases_gold_expected.txt6 kB
- test_index_calculation.py22 kB
- toktest_edgecases_diff.txt751 B
- test_cli.py7 kB
- toktest_sentences.txt21 kB
- test_abbrev.py984 B
- toktest_large.txt559 kB
- test_tokenizer.py103 kB
- toktest_normal.txt12 kB
- toktest_normal_gold_expected.txt13 kB
- example.txt3 kB
- test_detokenize.py2 kB
- test_tokenizer_tok.py18 kB
- .github
- workflows
- python-package.yml1 kB
- workflows
- LICENSE.txt1 kB
- perf.py977 B
- MANIFEST.in104 B
- src