#fuzzy-search #fuzzy-string #search-engine #string-search #search #edit-distance #search-pattern

simsearch

A simple and lightweight fuzzy search engine that works in memory, searching for similar strings (a pun here)

9 releases

0.2.5 Apr 19, 2024
0.2.4 Feb 6, 2023
0.2.3 Apr 11, 2021
0.2.2 Dec 3, 2020
0.1.4 Apr 16, 2019

#307 in Algorithms

Download history 1713/week @ 2024-08-07 1794/week @ 2024-08-14 1102/week @ 2024-08-21 1142/week @ 2024-08-28 1340/week @ 2024-09-04 978/week @ 2024-09-11 1260/week @ 2024-09-18 999/week @ 2024-09-25 411/week @ 2024-10-02 1163/week @ 2024-10-09 966/week @ 2024-10-16 1619/week @ 2024-10-23 1722/week @ 2024-10-30 1175/week @ 2024-11-06 1317/week @ 2024-11-13 1331/week @ 2024-11-20

5,860 downloads per month
Used in 6 crates

MIT/Apache

19KB
208 lines

simsearch

Build Status crates.io docs.rs

A simple and lightweight fuzzy search engine that works in memory, searching for similar strings (a pun here).

Documentation

Usage

Add the following to your Cargo.toml:

[dependencies]
simsearch = "0.2"

Example

use simsearch::SimSearch;

let mut engine: SimSearch<u32> = SimSearch::new();

engine.insert(1, "Things Fall Apart");
engine.insert(2, "The Old Man and the Sea");
engine.insert(3, "James Joyce");

let results: Vec<u32> = engine.search("thngs");

assert_eq!(results, &[1]);

By default, Jaro-Winkler distance is used. An alternative Levenshtein distance, which is SIMD-accelerated but only works for ASCII byte strings, can be specified with custom SearchOptions:

use simsearch::{SimSearch, SearchOptions};

let options = SearchOptions::new().levenshtein(true);
let mut engine: SimSearch<u32> = SimSearch::new_with(options);

Also try the interactive demo by:

$ cargo run --release --example books

Contribution

All kinds of contribution are welcomed.

  • Issus. Feel free to open an issue when you find typos, bugs, or have any question.
  • Pull requests. New collection, better implementation, more tests, more documents and typo fixes are all welcomed.

License

Licensed under MIT license (LICENSE-MIT or http://opensource.org/licenses/MIT)

Dependencies

~270–435KB