#tantivy #jieba

tantivy-jieba

A library that bridges between tantivy and jieba-rs

8 releases (5 breaking)

0.7.0 Jan 5, 2023
0.6.0 Jun 5, 2022
0.5.1 Aug 5, 2021
0.5.0 Jun 14, 2021
0.1.1 Feb 13, 2019

#4 in #tantivy

Download history 13/week @ 2023-02-04 31/week @ 2023-02-11 42/week @ 2023-02-18 3/week @ 2023-02-25 13/week @ 2023-03-04 4/week @ 2023-03-11 30/week @ 2023-03-18 19/week @ 2023-03-25 28/week @ 2023-04-01 30/week @ 2023-04-08 28/week @ 2023-04-15 18/week @ 2023-04-22 22/week @ 2023-04-29 31/week @ 2023-05-06 9/week @ 2023-05-13 22/week @ 2023-05-20

86 downloads per month

MIT license

8KB
107 lines

tantivy-jieba

Crates.io version docs.rs Changelog FOSSA Status

An adapter that bridges between tantivy and jieba-rs.

Usage

Add dependency tantivy-jieba to your Cargo.toml.

Example

use tantivy::tokenizer::*;
let tokenizer = tantivy_jieba::JiebaTokenizer {};
let mut token_stream = tokenizer.token_stream("测试");
assert_eq!(token_stream.next().unwrap().text, "测试");
assert!(token_stream.next().is_none());

Register tantivy tokenizer

use tantivy::schema::Schema;
use tantivy::tokenizer::*;
use tantivy::Index;
let tokenizer = tantivy_jieba::JiebaTokenizer {};
let index = Index::create_in_ram(schema);
index.tokenizers()
     .register("jieba", tokenizer);

License

FOSSA Status

Dependencies

~19–48MB
~715K SLoC