#tantivy #jieba

tantivy-jieba

A library that bridges between tantivy and jieba-rs

18 breaking releases

new 0.20.0 May 23, 2026
0.18.0 Dec 31, 2025
0.17.0 Aug 26, 2025
0.15.0 Jun 18, 2025
0.1.1 Feb 13, 2019

#1 in #tantivy

Download history 6343/week @ 2026-02-04 3309/week @ 2026-02-11 2748/week @ 2026-02-18 5802/week @ 2026-02-25 9246/week @ 2026-03-04 7727/week @ 2026-03-11 7811/week @ 2026-03-18 9629/week @ 2026-03-25 9168/week @ 2026-04-01 11278/week @ 2026-04-08 8625/week @ 2026-04-15 8458/week @ 2026-04-22 6566/week @ 2026-04-29 14764/week @ 2026-05-06 31497/week @ 2026-05-13 13312/week @ 2026-05-20

67,220 downloads per month
Used in 17 crates (5 directly)

MIT license

350KB
234 lines

tantivy-jieba

Crates.io version docs.rs Changelog FOSSA Status

An adapter that bridges between tantivy and jieba-rs.

Usage

Add dependency tantivy-jieba to your Cargo.toml.

Example

use tantivy::tokenizer::*;
let mut tokenizer = tantivy_jieba::JiebaTokenizer {};
let mut token_stream = tokenizer.token_stream("测试");
assert_eq!(token_stream.next().unwrap().text, "测试");
assert!(token_stream.next().is_none());

Register tantivy tokenizer

use tantivy::schema::Schema;
use tantivy::tokenizer::*;
use tantivy::Index;
let tokenizer = tantivy_jieba::JiebaTokenizer {};
let index = Index::create_in_ram(schema);
index.tokenizers()
     .register("jieba", tokenizer);

See examples/mod.rs for detailed example.

License

FOSSA Status

Dependencies

~9MB
~77K SLoC