#lz4 #decompression

lz-fear

A fast pure-rust no-unsafe implementation of LZ4 compression and decompression

3 unstable releases

0.2.0 Feb 8, 2024
0.1.1 May 21, 2020
0.1.0 Apr 16, 2020

#247 in Compression

Download history 1066/week @ 2024-09-24 457/week @ 2024-10-01 881/week @ 2024-10-08 643/week @ 2024-10-15 820/week @ 2024-10-22 674/week @ 2024-10-29 714/week @ 2024-11-05 734/week @ 2024-11-12 602/week @ 2024-11-19 918/week @ 2024-11-26 1150/week @ 2024-12-03 1015/week @ 2024-12-10 626/week @ 2024-12-17 418/week @ 2024-12-24 265/week @ 2024-12-31 550/week @ 2025-01-07

2,018 downloads per month
Used in 14 crates (5 directly)

MIT license

145KB
896 lines

lz-fear

This crate aims to implement the LZ4 compression and decompression algorithm, as well as the framing format used for LZ4 files, in pure Rust with no unsafe code. The output perfectly matches the C reference implementation byte for byte. At the time of writing, this is also the fastest no-unsafe implementation that I'm aware of.

The lz4 crate calls into the C library. The compress crate has an implementation that relies on unsafe. And the redox implementation (more on that below) is slower, in some cases substantially.

Decompressor status: Beta. Works well, and is blazingly fast (at least as fast as the official C implementation in my tests).

Compressor status: Alpha. You can expect it to produce perfect (i.e. identical to what the C library produces) output for all configurations. Non-default block sizes are an exception here, not entirely sure what the problem is. Dictionary is also implemented slightly differently right now. There is one other unknown edge case where output differs slightly. Note that all of these cases still produce valid and correct output, they just encode slightly differently than the C implementation (compression ration may be slightly worse in these cases). The API may still change a little. The example named "dolz4" is a compressor but is currently lacking a CLI. You have to configure it by changing the code. Performance is good, but takes ~2-3x as long as the C implementation. The current bottleneck appears to be an abundance of range checks when writing output (~25% of cycles spent in there) which also cause the compiler to completely trip over itself and sometimes emit a sequence of copy_from_slice calls for 1-byte and 4-byte writes to the output array. Help wanted.

If you take a look at the git history, this is strictly speaking a fork from @johannesvollmer. He took redox-os' lz4 compression, and reworked it to be usable as a library crate.

However after noticing performance issues, I have gradually rewritten both the compressor and the decompressor from scratch to closely resemble the 2400-line mess that is the official C implementation (and that's just the raw format without framing!). Admittedly they pack plenty of optimizations in there (lots of intentionally reading beyond buffer boundaries for the sake of performance), but I'm proud to say that I achieved similar performance in just 400 lines of competely safe Rust.

Code Fuzzing

Fuzzing requires a nightly toolchain. Fuzzing for this project is currently confirmed to work with:

rustc 1.44.0-nightly (f509b26a7 2020-03-18)

Running

cargo install cargo-fuzz
cargo +nightly fuzz run roundtrip_fuzz

Dependencies

~0.6–1.1MB
~24K SLoC