#similarity #search #lsh #all-pairs

all-pairs-hamming

All pairs similarity search on binary sketches in the Hamming space

1 unstable release

0.1.0 Sep 25, 2022

#7 in #lsh

Download history 203/week @ 2024-01-31 76/week @ 2024-02-07 402/week @ 2024-02-14 308/week @ 2024-02-21 293/week @ 2024-02-28 367/week @ 2024-03-06 307/week @ 2024-03-13 404/week @ 2024-03-20 324/week @ 2024-03-27 321/week @ 2024-04-03 162/week @ 2024-04-10 294/week @ 2024-04-17 206/week @ 2024-04-24 83/week @ 2024-05-01 255/week @ 2024-05-08 168/week @ 2024-05-15

758 downloads per month
Used in find-simdoc

MIT/Apache

32KB
794 lines

All pairs similarity search on binary sketches in the Hamming space

This library provides a fast and compact all pairs similarity search (or similarity self-join) on binary sketches in the Hamming space. The algorithm employs a modified variant of the sketch sorting approach, a combination of the multiple sorting and the multi-index approach.

This library is a part of find-simdoc.


lib.rs:

This library provides a fast and compact all pairs similarity search (or similarity self-join) on binary sketches in the Hamming space. The algorithm employs a modified variant of the sketch sorting approach, a combination of the multiple sorting and the multi-index approach.

Dependencies

~1MB
~17K SLoC