91 releases (53 major breaking)

53.3.0 Nov 20, 2024
53.0.0 Sep 3, 2024
52.2.0 Jul 28, 2024
51.0.0 Mar 18, 2024
0.3.0 May 5, 2018

#7 in Compression

Download history 189940/week @ 2024-08-23 166963/week @ 2024-08-30 169486/week @ 2024-09-06 165814/week @ 2024-09-13 170262/week @ 2024-09-20 173379/week @ 2024-09-27 177505/week @ 2024-10-04 175076/week @ 2024-10-11 151977/week @ 2024-10-18 151986/week @ 2024-10-25 135239/week @ 2024-11-01 146644/week @ 2024-11-08 151888/week @ 2024-11-15 131035/week @ 2024-11-22 126582/week @ 2024-11-29 118436/week @ 2024-12-06

555,744 downloads per month
Used in 302 crates (115 directly)

Apache-2.0

4MB
80K SLoC

Apache Parquet Official Native Rust Implementation

crates.io docs.rs

This crate contains the official Native Rust implementation of Apache Parquet, which is part of the Apache Arrow project.

See the API documentation for examples and the full API.

The API documentation for most recent, unreleased code is available here.

Rust Version Compatibility

This crate is tested with the latest stable version of Rust. We do not currently test against other, older versions of the Rust compiler.

Versioning / Releases

The parquet crate follows the SemVer standard defined by Cargo and works well within the Rust crate ecosystem. See the repository README for more details on the release schedule and version.

Note that for historical reasons, this crate uses versions with major numbers greater than 0.x (e.g. 19.0.0), unlike many other crates in the Rust ecosystem which spend extended time releasing versions 0.x to signal planned ongoing API changes. Minor arrow releases contain only compatible changes, while major releases may contain breaking API changes.

Feature Flags

The parquet crate provides the following features which may be enabled in your Cargo.toml:

  • arrow (default) - support for reading / writing arrow arrays to / from parquet
  • async - support async APIs for reading parquet
  • json - support for reading / writing json data to / from parquet
  • brotli (default) - support for parquet using brotli compression
  • flate2 (default) - support for parquet using gzip compression
  • lz4 (default) - support for parquet using lz4 compression
  • zstd (default) - support for parquet using zstd compression
  • snap (default) - support for parquet using snappy compression
  • cli - parquet CLI tools
  • crc - enables functionality to automatically verify checksums of each page (if present) when decoding
  • experimental - Experimental APIs which may change, even between minor releases

Parquet Feature Status

  • All encodings supported
  • All compression codecs supported
  • Read support
    • Primitive column value readers
    • Row record reader
    • Arrow record reader
    • Async support (to Arrow)
  • Statistics support
  • Write support
    • Primitive column value writers
    • Row record writer
    • Arrow record writer
    • Async support
  • Predicate pushdown
  • Parquet format 4.0.0 support

License

Licensed under the Apache License, Version 2.0: http://www.apache.org/licenses/LICENSE-2.0.

Dependencies

~13–43MB
~824K SLoC