33 releases

new 0.18.1 Jun 8, 2024
0.17.10 Feb 5, 2024
0.17.8 Dec 20, 2023
0.17.3 Nov 20, 2023
0.2.2 Mar 5, 2021

#284 in Parser implementations

Download history 2/week @ 2024-02-14 37/week @ 2024-02-21 33/week @ 2024-02-28 1/week @ 2024-03-06 16/week @ 2024-03-13 118/week @ 2024-03-27 45/week @ 2024-04-03 2/week @ 2024-04-10

1,995 downloads per month


192 lines

JSON to Arrow


Convert JSON files to Apache Arrow. This package is part of Arrow CLI tools.


Download prebuilt binaries

You can get the latest releases from https://github.com/domoritz/arrow-tools/releases.

With Homebrew

brew install domoritz/homebrew-tap/json2arrow

With Cargo

cargo install json2arrow

With Cargo B(inary)Install

To avoid re-compilation and speed up installation, you can install this tool with cargo binstall:

cargo binstall json2arrow


Usage: json2arrow [OPTIONS] <JSON> [ARROW]

  <JSON>   Input JSON file, stdin if not present
  [ARROW]  Output file, stdout if not present

  -s, --schema-file <SCHEMA_FILE>
          File with Arrow schema in JSON format
  -m, --max-read-records <MAX_READ_RECORDS>
          The number of records to infer the schema from. All rows if not present. Setting max-read-records to zero will stop schema inference and all columns will be string typed
  -p, --print-schema
          Print the schema to stderr
  -n, --dry
          Only print the schema
  -h, --help
          Print help
  -V, --version
          Print version

The --schema-file option uses the same file format as --dry and --print-schema.


For usage examples, see the csv2parquet examples which shares a similar interface.


Since we use the Arrow JSON loader, we are limited to what it supports. Right now, it supports JSON line-delimited files.

{ "a": 42, "b": true }
{ "a": 12, "b": false }
{ "a": 7, "b": true }


~295K SLoC