tok0.2.1 package

Fast Text Tokenization

Interfaces with the 'Hugging Face' tokenizers library to provide implementations of today's most used tokenizers such as the 'Byte-Pair Encoding' algorithm <https://huggingface.co/docs/tokenizers/index>. It's extremely fast for both training new vocabularies and tokenizing texts.

  • Maintainer: Daniel Falbel
  • License: MIT + file LICENSE
  • Last published: 2025-09-30