#similarity #search #all-pairs #lsh

all-pairs-hamming

All pairs similarity search on binary sketches in the Hamming space

1 unstable release

0.1.0 Sep 25, 2022

#8 in #lsh

Download history 151/week @ 2024-07-23 216/week @ 2024-07-30 144/week @ 2024-08-06 133/week @ 2024-08-13 113/week @ 2024-08-20 170/week @ 2024-08-27 169/week @ 2024-09-03 92/week @ 2024-09-10 63/week @ 2024-09-17 156/week @ 2024-09-24 157/week @ 2024-10-01 154/week @ 2024-10-08 180/week @ 2024-10-15 271/week @ 2024-10-22 207/week @ 2024-10-29 259/week @ 2024-11-05

932 downloads per month
Used in find-simdoc

MIT/Apache

32KB
794 lines

All pairs similarity search on binary sketches in the Hamming space

This library provides a fast and compact all pairs similarity search (or similarity self-join) on binary sketches in the Hamming space. The algorithm employs a modified variant of the sketch sorting approach, a combination of the multiple sorting and the multi-index approach.

This library is a part of find-simdoc.


lib.rs:

This library provides a fast and compact all pairs similarity search (or similarity self-join) on binary sketches in the Hamming space. The algorithm employs a modified variant of the sketch sorting approach, a combination of the multiple sorting and the multi-index approach.

Dependencies

~1MB
~17K SLoC