#tokenizer #token #words #nlp #tokenize #text

nipah_tokenizer

A powerful yet simple text tokenizer for your everyday needs!

1 unstable release

0.1.0 Jan 14, 2023

#1671 in Text processing

Custom license

29KB
730 lines

nipah_tokenizer

A powerful yet simple text tokenizer for your everyday needs!

322977877_dark_magic_and_black_fire-transformed

How To Use

  1. Obtain the default options (or create yourself your configuration):
let options = nipah_tokenizer::options::default();
  1. Get some text
let text = "Hello, World!".to_string();
  1. Tokenize your text!
use nipah_tokenizer::tokenizer::tokenize;

let tokens = tokenize(text, &options);

Output will be like:

["Hello": Id] [Comma] ["World": Id] [Exclamation]

You can use it as your hearth desires, and it is pretty versatile

Happy coding!

Dependencies

~66KB