#tokenizer #tantivy #vibrato

tantivy-vibrato

A Tantivy tokenizer using Vibrato

1 unstable release

0.3.0 Jan 19, 2023

#44 in #tantivy

Custom license

7KB
154 lines

tantivy-vibrato

A Tantivy tokenizer using Vibrato.

Usage

let tokenizer = VibratoTokenizer::new("/path/to/dictionary")?;
let analyzer = TextAnalyzer::from(tokenizer).filter(LowerCaser);
index.tokenizers().register("lang_ja", analyzer);

You need to specify a path to the Vibrato's dictionary file.

Dependencies

~21MB
~354K SLoC