#tantivy #tokenizer #japanese

tantivy-tokenizer-tiny-segmenter

A Japanese tokenizer for Tantivy, based on TinySegmenter

3 releases (breaking)

0.3.0 Nov 7, 2019
0.2.0 Mar 20, 2019
0.1.0 Feb 14, 2019

#38 in #tantivy

MIT license

15KB
57 lines

tantivy-tokenizer-tiny-segmenter

A Japanese tokenizer for Tantivy based on TinySegmenter. Compatible with Tantivy 0.10.

See examples/basic.rs for basic usage.

Dependencies

~19–31MB
~403K SLoC