#similarity #search #lsh #all-pairs

all-pairs-hamming

All pairs similarity search on binary sketches in the Hamming space

1 unstable release

0.1.0 Sep 25, 2022

#7 in #lsh

Download history 173/week @ 2023-12-07 282/week @ 2023-12-14 291/week @ 2023-12-21 56/week @ 2023-12-28 75/week @ 2024-01-04 175/week @ 2024-01-11 201/week @ 2024-01-18 199/week @ 2024-01-25 184/week @ 2024-02-01 89/week @ 2024-02-08 411/week @ 2024-02-15 333/week @ 2024-02-22 257/week @ 2024-02-29 376/week @ 2024-03-07 288/week @ 2024-03-14 483/week @ 2024-03-21

1,464 downloads per month
Used in find-simdoc

MIT/Apache

32KB
794 lines

All pairs similarity search on binary sketches in the Hamming space

This library provides a fast and compact all pairs similarity search (or similarity self-join) on binary sketches in the Hamming space. The algorithm employs a modified variant of the sketch sorting approach, a combination of the multiple sorting and the multi-index approach.

This library is a part of find-simdoc.


lib.rs:

This library provides a fast and compact all pairs similarity search (or similarity self-join) on binary sketches in the Hamming space. The algorithm employs a modified variant of the sketch sorting approach, a combination of the multiple sorting and the multi-index approach.

Dependencies

~1MB
~17K SLoC