9 releases

Uses old Rust 2015

0.4.0 May 2, 2022
0.3.5 Feb 4, 2022
0.3.4 Dec 28, 2021
0.3.2 Aug 18, 2020
0.2.1 Nov 6, 2017

#615 in Text processing

Download history 42/week @ 2024-06-17 41/week @ 2024-06-24 6/week @ 2024-07-01 20/week @ 2024-07-08 23/week @ 2024-07-15 47/week @ 2024-07-22 34/week @ 2024-07-29 29/week @ 2024-08-05 51/week @ 2024-08-12 17/week @ 2024-08-19 62/week @ 2024-08-26 37/week @ 2024-09-02 25/week @ 2024-09-09 29/week @ 2024-09-16 94/week @ 2024-09-23 33/week @ 2024-09-30

183 downloads per month
Used in 4 crates

MIT license

38KB
583 lines

This crate provides fuzzy search/string matching using N-grams.

This implementation is character-based, rather than word based, matching solely based on string similarity.

Licensed under the MIT license.

Documentation

https://docs.rs/ngrammatic/latest/ngrammatic/

Installation

This crate is published on crates.io.

To use it, add this to your Cargo.toml:

[dependencies]
ngrammatic = "0.3.4"

Usage

To do fuzzy matching, build up your corpus of valid symbols like this:

use ngrammatic::{CorpusBuilder, Pad};

let mut corpus = CorpusBuilder::new()
    .arity(2)
    .pad_full(Pad::Auto)
    .finish();

// Build up the list of known words
corpus.add_text("pie");
corpus.add_text("animal");
corpus.add_text("tomato");
corpus.add_text("seven");
corpus.add_text("carbon");

// Now we can try an unknown/misspelled word, and find a similar match
// in the corpus
let word = String::from("tomacco");
if let Some(top_result) = corpus.search(word, 0.25).first() {
    if top_result.similarity > 0.99 {
        println!("{}", top_result.text);
    } else {
        println!("{} (did you mean {}? [{:.0}% match])",
                 word,
                 top_result.text,
                 top_result.similarity * 100.0);
    }
} else {
    println!("🗙 {}", word);
}

No runtime deps