#adler32 #checksum #simd #performance #adler

no-std adler32fast

Fast, SIMD-accelerated Adler-32 checksum computation

6 stable releases

1.1.0 Apr 11, 2021
1.0.4 Apr 10, 2021

#2666 in Algorithms

MIT/Apache

27KB
558 lines

adler32fast

Fast, SIMD-accelerated Adler-32 checksum computation

Usage

use adler32fast::Adler32;

let mut adler32 = Adler32::new();
adler32.update(b"foo bar baz");
let checksum = adler32.as_u32();

Performance

This crate contains multiple Adler-32 implementations:

  • A fast baseline implementation which processes up to 16 bytes per iteration
  • An optimized implementation for modern x86/x86_64 using either avx2 or ssse3 instructions

Calling the Adler32::new or Adler32::from constructor at runtime will perform a feature detection to select the most optimal implementation for the current CPU feature set.

The following was tested on a Razer Book 13 RZ09-0357, i7-1165G7 @ 2.80 GHz with 100kb of data.

crate version variant us/iter GiB/s
adler32 1.2.0 n/a 23.50 3.96
adler32fast (this crate) 1.1.0 baseline 22.69 4.10
adler32fast (this crate) 1.1.0 ssse3 3.09 30.17
adler32fast (this crate) 1.1.0 avx2 2.36 39.43

Benchmarks using criterion can be run on stable Rust with cargo bench.

Contributions are welcomed for more SIMD variants!

Memory Safety

Due to the use of SIMD intrinsics for the optimized implementations, this crate contains some amount of unsafe code.

adler32fast is fuzz-tested with cargo-fuzz. (NOTE: cargo-fuzz does not work on Windows, but can run under WSL.)

cargo-fuzz currently requires the nightly toolchain.

$ cargo install cargo-fuzz
$ rustup toolchain install nightly

$ cargo +nightly fuzz run ssse3

Credits

This work is based on crc32fast as inspiration.

The AVX2 implementation was derived from simd-adler32

The SSSE3 implementation has been derived from Google's Wuffs implementation.

License

Licensed under either of

at your option.

Contribution

Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.

Dependencies