#apache-arrow #query-execution #data-access #arrow #query #sql-query #sql

datafusion-data-access

General data access layer currently mainly based on the object store interfaces

3 stable releases

10.0.0 Jul 15, 2022
9.0.0 Jun 13, 2022
8.0.0 May 16, 2022

#16 in #query-execution

Download history 36/week @ 2024-03-11 9/week @ 2024-03-18 16/week @ 2024-03-25 84/week @ 2024-04-01 4/week @ 2024-04-08 8/week @ 2024-04-15 29/week @ 2024-04-22 8/week @ 2024-04-29 5/week @ 2024-05-06 15/week @ 2024-05-13 2/week @ 2024-05-20 7/week @ 2024-05-27 21/week @ 2024-06-03 11/week @ 2024-06-10 7/week @ 2024-06-17 13/week @ 2024-06-24

53 downloads per month
Used in 3 crates (via datafusion-objectstore-s3)

Apache-2.0

22KB
361 lines

DataFusion Data Access Layer

DataFusion is an extensible query execution framework, written in Rust, that uses Apache Arrow as its in-memory format.

This crate is a submodule of DataFusion that provides an async API for accessing data, either remotely or locally. Currently, it is based on the object store interfaces. In the future, this module may include interfaces for accessing databases, or streaming data.

Dependencies

~6–17MB
~220K SLoC