9 releases

Uses old Rust 2015

0.4.0 May 2, 2022
0.3.5 Feb 4, 2022
0.3.4 Dec 28, 2021
0.3.2 Aug 18, 2020
0.2.1 Nov 6, 2017

#860 in Text processing

Download history 42/week @ 2024-07-21 40/week @ 2024-07-28 30/week @ 2024-08-04 48/week @ 2024-08-11 17/week @ 2024-08-18 50/week @ 2024-08-25 51/week @ 2024-09-01 25/week @ 2024-09-08 26/week @ 2024-09-15 90/week @ 2024-09-22 41/week @ 2024-09-29 23/week @ 2024-10-06 69/week @ 2024-10-13 186/week @ 2024-10-20 114/week @ 2024-10-27 35/week @ 2024-11-03

407 downloads per month
Used in 4 crates

MIT license

38KB
583 lines

This crate provides fuzzy search/string matching using N-grams.

This implementation is character-based, rather than word based, matching solely based on string similarity.

Licensed under the MIT license.

Documentation

https://docs.rs/ngrammatic/latest/ngrammatic/

Installation

This crate is published on crates.io.

To use it, add this to your Cargo.toml:

[dependencies]
ngrammatic = "0.3.4"

Usage

To do fuzzy matching, build up your corpus of valid symbols like this:

use ngrammatic::{CorpusBuilder, Pad};

let mut corpus = CorpusBuilder::new()
    .arity(2)
    .pad_full(Pad::Auto)
    .finish();

// Build up the list of known words
corpus.add_text("pie");
corpus.add_text("animal");
corpus.add_text("tomato");
corpus.add_text("seven");
corpus.add_text("carbon");

// Now we can try an unknown/misspelled word, and find a similar match
// in the corpus
let word = String::from("tomacco");
if let Some(top_result) = corpus.search(word, 0.25).first() {
    if top_result.similarity > 0.99 {
        println!("{}", top_result.text);
    } else {
        println!("{} (did you mean {}? [{:.0}% match])",
                 word,
                 top_result.text,
                 top_result.similarity * 100.0);
    }
} else {
    println!("🗙 {}", word);
}

No runtime deps