|Jan 12, 2024
|Nov 13, 2023
|Nov 13, 2023
|Sep 22, 2023
|Oct 17, 2022
#60 in Testing
1,526 downloads per month
Used in chewdata
Native Rust implementation of Apache Arrow and Parquet
Welcome to the implementation of Arrow, the popular in-memory columnar format, in Rust.
This repo contains the following main components:
|Latest API Docs
|Core functionality (memory layout, arrays, low level computations)
|Support for Parquet columnar file format
|Support for Arrow-Flight IPC protocol
|Support for object store interactions (aws, azure, gcp, local, in-memory)
The current development version the API documentation in this repo can be found here.
There are two related crates in a different repository
|In-memory query engine with SQL support
|Distributed query execution
Collectively, these crates support a vast array of functionality for analytic computations in Rust.
For example, you can write an SQL query or a
DataFrame (using the
datafusion crate), run it against a parquet file (using the
parquet crate), evaluate it in-memory using Arrow's columnar format (using the
arrow crate), and send to another process (using the
Generally speaking, the
arrow crate offers functionality for using Arrow arrays, and
datafusion offers most operations typically found in SQL, including
joins and window functions.
You can find more details about each crate in their respective READMEs.
Arrow Rust Community
firstname.lastname@example.org mailing list serves as the core communication channel for the Arrow community. Instructions for signing up and links to the archives can be found at the Arrow Community page. All major announcements and communications happen there.
The Rust Arrow community also uses the official ASF Slack for informal discussions and coordination. This is
a great place to meet other contributors and get guidance on where to contribute. Join us in the
#arrow-rust channel and feel free to ask for an invite via:
The Rust implementation uses GitHub issues as the system of record for new features and bug fixes and this plays a critical role in the release process.
For design discussions we generally collaborate on Google documents and file a GitHub issue linking to the document.
There is more information in the contributing guide.
Support for the Apache Arrow JSON test data format
These utilities define structs that read the integration JSON format for integration testing purposes.
This is not a canonical format, but provides a human-readable way of verifying language implementations