#data-fusion #orc #arrow-rs #arrow

datafusion-orc

Implementation of Apache ORC file format using Apache Arrow in-memory format

3 releases (breaking)

new 0.7.0 Jan 13, 2026
0.6.0 Nov 28, 2025
0.5.0 Oct 20, 2025

#2182 in Database interfaces

Download history 65/week @ 2025-10-14 935/week @ 2025-10-21 1342/week @ 2025-10-28 1344/week @ 2025-11-04 1015/week @ 2025-11-11 2337/week @ 2025-11-18 4484/week @ 2025-11-25 2960/week @ 2025-12-02 4590/week @ 2025-12-09 3779/week @ 2025-12-16 3510/week @ 2025-12-23 2391/week @ 2025-12-30 4096/week @ 2026-01-06

14,598 downloads per month

Apache-2.0

27KB
405 lines

Integration with Apache DataFusion query engine to allow querying ORC files with a SQL/DataFrame API.

Example usage

let ctx = SessionContext::new();
ctx.register_orc(
    "table1",
    "/path/to/file.orc",
    OrcReadOptions::default(),
)
.await?;

ctx.sql("select a, b from table1")
    .await?
    .show()
    .await?;

test

datafusion-orc

Experimental ORC file reader for DataFusion based on orc-rust.

Dependencies

~94MB
~1.5M SLoC