#zarr #data #manipulating #codec #benchmark #zarrs #ome-zarr

bin+lib zarrs_tools

Tools for creating and manipulating Zarr V3 data

14 releases (4 breaking)

new 0.5.3 Jul 24, 2024
0.4.2 May 16, 2024
0.3.0 Feb 22, 2024
0.2.0 Dec 26, 2023

#362 in Encoding

Download history 7/week @ 2024-04-04 186/week @ 2024-04-18 114/week @ 2024-04-25 138/week @ 2024-05-02 11/week @ 2024-05-09 137/week @ 2024-05-16 3/week @ 2024-05-23 1/week @ 2024-05-30 2/week @ 2024-06-06 3/week @ 2024-06-13 143/week @ 2024-06-27 346/week @ 2024-07-04 49/week @ 2024-07-11 56/week @ 2024-07-18

594 downloads per month

MIT/Apache

290KB
6.5K SLoC

zarrs_tools

Latest Version msrv build

Various tools for creating and manipulating Zarr v3 data with the zarrs rust crate.

A changelog can be found here.

Tools

All tools support input and output Zarr V3 data. Some tools additionally support input of a V3 compatible subset of Zarr V2.

  • zarrs_reencode: reencode an array. Manipulate the chunk size, shard size, codecs, fill value, chunk key encoding separator, and attributes.
  • zarrs_filter (feature filter): apply simple image filters (transformations) to an array.
  • zarrs_ome (feature ome): convert an array to an OME-Zarr multi-scale image.
    • Supports OME-Zarr 0.5-dev (as Zarr V3) and 0.5-dev1. The first is recognised by Neuroglancer.
  • zarrs_info (feature info): return metadata related info or the range/histogram of an array.
  • zarrs_binary2zarr (feature binary2zarr): create an array from piped binary data.
  • zarrs_ncvar2zarr (feature ncvar2zarr): convert a netCDF variable to an array.

See docs/ for tool documentation.

zarrs Benchmarking

  • zarrs_reencode: suitable for round trip benchmarking.
  • zarrs_benchmark_read_sync (feature benchmark): benchmark the zarrs sync API.
  • zarrs_benchmark_read_async (feature benchmark): benchmark the zarrs async API.

See docs/benchmarks.md for some benchmark measurements.

Install

From crates.io

cargo install --all-features zarrs_tools

From source

cargo install --all-features --path .
# cargo install --all-features --git https://github.com/LDeakin/zarrs_tools

Enabling SIMD intrinsics

Encoding and decoding performance may be improved with avx2/sse2 enabled (if supported).

This can be enabled by compiling with either of:

  • RUSTFLAGS="-C target-cpu=native"
  • RUSTFLAGS="-C target-feature=+avx2,+sse2"

Enabling non-default zarrs codecs

Non-default zarrs codecs (see zarrs crate features) can be enabled by passing them as feature flags. For example:

cargo install zarrs_tools --all-features --features zarrs/bitround,zarrs/zfp,zarrs/bz2,zarrs/pcodec

Licence

zarrs_tools is licensed under either of

Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.

Dependencies

~17–56MB
~1M SLoC