#digest #hash #similarity #tlsh #hashing

tlsh2

A rust implementation of the TLSH algorithm

4 releases (2 breaking)

0.3.0 Jul 30, 2023
0.2.1 May 4, 2023
0.2.0 Jan 1, 2023
0.1.0 Dec 30, 2022

#1404 in Algorithms

Download history 72/week @ 2023-12-15 67/week @ 2023-12-22 395/week @ 2023-12-29 42/week @ 2024-01-05 41/week @ 2024-01-12 71/week @ 2024-01-19 205/week @ 2024-01-26 109/week @ 2024-02-02 127/week @ 2024-02-09 147/week @ 2024-02-16 59/week @ 2024-02-23 204/week @ 2024-03-01 78/week @ 2024-03-08 87/week @ 2024-03-15 90/week @ 2024-03-22 105/week @ 2024-03-29

362 downloads per month
Used in boreal

Apache-2.0 OR BSD-3-Clause

520KB
3.5K SLoC

TLSH2

Build status Crates.io Documentation

Rust port of the TLSH library. The code is kept close to the original C++ version, to limit bugs and help maintainability

This crate is no_std and different configurations of bucket numbers and checksum length are handled as generics, making every configuration properly optimized.

// The default builder uses 128 buckets and a 1-byte checksum.
// Other builders are also available.
let mut builder = tlsh2::TlshDefaultBuilder::new();
builder.update(b"Sed ut perspiciatis unde omnis iste natus");
builder.update(b"error sit voluptatem accusantium");
let tlsh = builder.build()
    .ok_or_else(|| "could not generate TLSH from payload")?;

// Alternatively, a TLSH object can be generated directly from
// a byte slice.
let tlsh2 = tlsh2::TlshDefaultBuilder::build_from(
    b"odit aut fugit, sed quia consequuntur magni dolores"
).ok_or_else(|| "could not generate TLSH from second payload")?;

// Then, the TLSH object can be used to generated a hash or compute
// distances
assert_eq!(
    tlsh.hash(),
    b"T184A022B383C2A2A20ACB0830880CF0202CCAC080033A023800338\
      A30B0880AA8E0BE38".as_slice(),
);
// The `diff` feature is required for this computation.
assert_eq!(tlsh.diff(&tlsh2, true), 209);

Those configurations are available:

  • 128 buckets and 1-byte checksum (default).
  • 128 buckets and 3-byte checksum.
  • 256 buckets and 1-byte checksum.
  • 256 buckets and 3-byte checksum.
  • 48 buckets and 1-byte checksum.

The threaded and private options that exists in the original TLSH version are not yet implemented.

No runtime deps