8 releases (5 breaking)
0.7.0 | Jan 5, 2023 |
---|---|
0.6.0 | Jun 5, 2022 |
0.5.1 | Aug 5, 2021 |
0.5.0 | Jun 14, 2021 |
0.1.1 | Feb 13, 2019 |
#4 in #tantivy
86 downloads per month
8KB
107 lines
tantivy-jieba
An adapter that bridges between tantivy and jieba-rs.
Usage
Add dependency tantivy-jieba
to your Cargo.toml
.
Example
use tantivy::tokenizer::*;
let tokenizer = tantivy_jieba::JiebaTokenizer {};
let mut token_stream = tokenizer.token_stream("测试");
assert_eq!(token_stream.next().unwrap().text, "测试");
assert!(token_stream.next().is_none());
Register tantivy tokenizer
use tantivy::schema::Schema;
use tantivy::tokenizer::*;
use tantivy::Index;
let tokenizer = tantivy_jieba::JiebaTokenizer {};
let index = Index::create_in_ram(schema);
index.tokenizers()
.register("jieba", tokenizer);
License
Dependencies
~19–48MB
~715K SLoC