Files in this item

 Download all files in item (549.51 KB)
This item is
Publicly Available
and licensed under:
The MIT License (MIT)
Icon
Name
Tokenizer-3.5.1.tar.gz
Size
265.82 KB
Format
application/gzip
Description
Tokenizer source code
MD5
ddcd40021d8f18831e13e5046a7d7a16
 Download file  Preview
 File Preview  
  • Tokenizer-3.5.1
    • src
      • tokenizer
        • abbrev.py13 kB
        • main.py9 kB
        • definitions.py28 kB
        • tokenizer.py128 kB
        • __init__.py2 kB
        • Abbrev.conf46 kB
    • .gitignore1 kB
    • README.rst41 kB
    • pyproject.toml2 kB
    • CLAUDE.md3 kB
    • test
      • toktest_edgecases.txt6 kB
      • toktest_large_gold_perfect.txt576 kB
      • test_composite_glyphs.py8 kB
      • test_helper_functions.py2 kB
      • Overview.txt34 kB
      • toktest_large_gold_acceptable.txt583 kB
      • toktest_edgecases_gold_expected.txt6 kB
      • test_index_calculation.py22 kB
      • toktest_edgecases_diff.txt751 B
      • test_cli.py7 kB
      • toktest_sentences.txt21 kB
      • test_abbrev.py984 B
      • toktest_large.txt559 kB
      • test_tokenizer.py103 kB
      • toktest_normal.txt12 kB
      • toktest_normal_gold_expected.txt13 kB
      • example.txt3 kB
      • test_detokenize.py2 kB
      • test_tokenizer_tok.py18 kB
    • .github
    • LICENSE.txt1 kB
    • perf.py977 B
    • MANIFEST.in104 B
    • pax_global_header52 B
Icon
Name
Tokenizer-3.5.1.zip
Size
283.69 KB
Format
application/zip
Description
Tokenizer source code
MD5
9896d654bb1fccaa8e5760fb34b49843
 Download file  Preview
 File Preview  
  • Tokenizer-3.5.1
    • src
      • tokenizer
        • abbrev.py13 kB
        • main.py9 kB
        • definitions.py28 kB
        • tokenizer.py128 kB
        • __init__.py2 kB
        • Abbrev.conf46 kB
    • .gitignore1 kB
    • README.rst41 kB
    • pyproject.toml2 kB
    • CLAUDE.md3 kB
    • test
      • toktest_edgecases.txt6 kB
      • toktest_large_gold_perfect.txt576 kB
      • test_composite_glyphs.py8 kB
      • test_helper_functions.py2 kB
      • Overview.txt34 kB
      • toktest_large_gold_acceptable.txt583 kB
      • toktest_edgecases_gold_expected.txt6 kB
      • test_index_calculation.py22 kB
      • toktest_edgecases_diff.txt751 B
      • test_cli.py7 kB
      • toktest_sentences.txt21 kB
      • test_abbrev.py984 B
      • toktest_large.txt559 kB
      • test_tokenizer.py103 kB
      • toktest_normal.txt12 kB
      • toktest_normal_gold_expected.txt13 kB
      • example.txt3 kB
      • test_detokenize.py2 kB
      • test_tokenizer_tok.py18 kB
    • .github
    • LICENSE.txt1 kB
    • perf.py977 B
    • MANIFEST.in104 B