58 releases (18 breaking)
new 0.20.0 | May 17, 2025 |
---|---|
0.19.2 | Feb 13, 2025 |
0.18.3 | Dec 29, 2024 |
0.18.0 | Nov 23, 2024 |
0.6.0 | Nov 16, 2023 |
#100 in Encoding
2,968 downloads per month
Used in 5 crates
(4 directly)
2MB
38K
SLoC
zarrs
zarrs
is a Rust library for the Zarr storage format for multidimensional arrays and metadata.
[!TIP] If you are a Python user, check out
zarrs-python
. It includes a high-performance codec pipeline for the referencezarr-python
implementation.
zarrs
supports Zarr V3 and a V3 compatible subset of Zarr V2.
It is fully up-to-date and conformant with the Zarr 3.1 specification with support for:
- all core extensions (data types, codecs, chunk grids, chunk key encodings, storage transformers),
- all accepted Zarr Enhancement Proposals (ZEPs) and several draft ZEPs:
- ZEP 0003: Variable chunking
- ZEP 0007: Strings
- ZEP 0009: Zarr Extension Naming
- various registered extensions from zarr-developers/zarr-extensions/,
- experimental codecs and data types intended for future registration, and
- user-defined custom extensions and stores.
A changelog can be found here. Correctness issues with past versions are detailed here.
Developed at the Department of Materials Physics, Australian National University, Canberra, Australia.
Getting Started
- Review the implementation status which summarises zarr version support, array support (codecs, data types, etc.) and storage support.
- Read The
zarrs
Book. - View the examples and the example below.
- Read the documentation.
- Check out the
zarrs
ecosystem.
Example
use zarrs::group::GroupBuilder;
use zarrs::array::{ArrayBuilder, DataType, FillValue, ZARR_NAN_F32};
use zarrs::array::codec::GzipCodec; // requires gzip feature
use zarrs::array_subset::ArraySubset;
use zarrs::storage::ReadableWritableListableStorage;
use zarrs::filesystem::FilesystemStore; // requires filesystem feature
// Create a filesystem store
let store_path: PathBuf = "/path/to/hierarchy.zarr".into();
let store: ReadableWritableListableStorage =
Arc::new(FilesystemStore::new(&store_path)?);
// Write the root group metadata
GroupBuilder::new()
.build(store.clone(), "/")?
// .attributes(...)
.store_metadata()?;
// Create a new V3 array using the array builder
let array = ArrayBuilder::new(
vec![3, 4], // array shape
DataType::Float32,
vec![2, 2].try_into()?, // regular chunk shape (non-zero elements)
FillValue::from(ZARR_NAN_F32),
)
.bytes_to_bytes_codecs(vec![
Arc::new(GzipCodec::new(5)?),
])
.dimension_names(["y", "x"].into())
.attributes(serde_json::json!({"Zarr V3": "is great"}).as_object().unwrap().clone())
.build(store.clone(), "/array")?; // /path/to/hierarchy.zarr/array
// Store the array metadata
array.store_metadata()?;
println!("{}", array.metadata().to_string_pretty());
// {
// "zarr_format": 3,
// "node_type": "array",
// ...
// }
// Perform some operations on the chunks
array.store_chunk_elements::<f32>(
&[0, 1], // chunk index
&[0.2, 0.3, 1.2, 1.3]
)?;
array.store_array_subset_ndarray::<f32, _>(
&[1, 1], // array index (start of subset)
ndarray::array![[-1.1, -1.2], [-2.1, -2.2]]
)?;
array.erase_chunk(&[1, 1])?;
// Retrieve all array elements as an ndarray
let array_ndarray = array.retrieve_array_subset_ndarray::<f32>(&array.subset_all())?;
println!("{array_ndarray:4}");
// [[ NaN, NaN, 0.2, 0.3],
// [ NaN, -1.1, -1.2, 1.3],
// [ NaN, -2.1, NaN, NaN]]
zarrs
Ecosystem
The Zarr specification is inherently unstable. It is under active development and new extensions are continually being introduced.
The zarrs
crate has been split into multiple crates to:
- allow external implementations of stores and extensions points to target a relatively stable API compatible with a range of
zarrs
versions, - enable automatic backporting of metadata compatibility fixes and changes due to standardisation,
- stay up-to-date with unstable public dependencies (e.g.
opendal
,object_store
,icechunk
, etc) without impacting the release cycle ofzarrs
, and - improve compilation times.
A hierarchical overview of these crates can be found in the The zarrs
Book.
Core
zarrs
: The core library for manipulating Zarr hierarchies.zarrs_metadata
: Zarr metadata support (re-exported aszarrs::metadata
).zarrs_metadata_ext
: Zarr extensions metadata support (re-exported aszarrs::metadata_ext
).zarrs_data_type
: The data type extension API forzarrs
(re-exported inzarrs::array::data_type
).zarrs_storage
: The storage API forzarrs
(re-exported aszarrs::storage
).zarrs_plugin
: The plugin API forzarrs
(re-exported aszarrs::plugin
).zarrs_registry
: The Zarr extension point registry forzarrs
(re-exported aszarrs::registry
).
Stores
zarrs_filesystem
: A filesystem store (re-exported aszarrs::filesystem
).zarrs_object_store
:object_store
store support.zarrs_opendal
:opendal
store support.zarrs_http
: A synchronous http store.zarrs_zip
: A storage adapter for zip files.zarrs_icechunk
:icechunk
store support.git
-like version control for Zarr hierachies.- Read "virtual Zarr datacubes" of archival formats (e.g.,
netCDF4
,HDF5
, etc.) created byVirtualiZarr
and backed byicechunk
.
Bindings
zarrs-python
: A high-performance codec pipeline forzarr-python
.zarrs_ffi
: A subset ofzarrs
exposed as a C/C++ API.
Zarr Metadata Conventions
ome_zarr_metadata
: A library for OME-Zarr (previously OME-NGFF) metadata.
Tools
zarrs_tools
: Various tools for creating and manipulating Zarr V3 data with thezarrs
rust crate- A reencoder that can change codecs, chunk shape, convert Zarr V2 to V3, etc.
- Create an OME-Zarr hierarchy from a Zarr array.
- Transform arrays: crop, rescale, downsample, gradient magnitude, gaussian, noise filtering, etc.
Benchmarks
zarr_benchmarks
: Benchmarks of various Zarr V3 implementations:zarrs
,zarr-python
,tensorstore
Licence
zarrs
is licensed under either of
- the Apache License, Version 2.0 LICENSE-APACHE or http://www.apache.org/licenses/LICENSE-2.0 or
- the MIT license LICENSE-MIT or http://opensource.org/licenses/MIT, at your option.
Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.
Dependencies
~14–42MB
~646K SLoC