#json-query #json #query-engine #json-path #simd #query #search-engine

rsonpath-lib

Blazing fast JSONPath query engine powered by SIMD. Core library of rsonpath.

24 releases (8 breaking)

0.9.1 Apr 3, 2024
0.9.0 Mar 28, 2024
0.8.7 Feb 29, 2024
0.8.4 Oct 30, 2023
0.1.2 Nov 19, 2022

#337 in Text processing

28 downloads per month
Used in rsonpath

MIT license

740KB
15K SLoC

rsonpath-lib – SIMD-powered JSONPath, as a library 🚀

Rust docs.rs Book

Crates.io GitHub Release Date GitHub last commit

MSRV License

Library for rsonpath, the JSONPath engine for querying massive streamed datasets.

The main target of this crate is the rsonpath CLI tool. Note that this API is unstable until we reach v1.0.0. This is going to happen (we have a roadmap), but our dev resources are quite limited. Contributions are welcome and appreciated.

Unsafe

The library uses unsafe for SIMD operations, because it has to, at least until portable-simd gets stabilized. Because of this, a compiled library is not portable – if you build on a platform supporting AVX2 and then use the same compiled code on an ARM platform, it will crash. We put special care to not use unsafe code anywhere else – in fact, the crate uses #[forbid(unsafe_code)] when compiled without the default simd feature.

Build & test

The dev workflow utilizes just. Use the included Justfile. It will automatically install Rust for you using the rustup tool if it detects there is no Cargo in your environment.

just build
just test

Architecture diagram

Below is a simplified overview of the module interactions and interfaces, and how data flows from the user's input (query, document) through the pipeline to produce results.

Architecture diagram

Optional features

The simd feature is enabled by default and is recommended to make use of the performance benefits of the project.

The arbitrary feature is optional and enables the arbitrary dependency, which provides an implementation of Arbitrary for the query struct.

Dependencies

Showing direct dependencies.

cargo tree --package rsonpath-lib --edges normal --depth 1
rsonpath-lib v0.9.1 (/home/mat/src/rsonpath/crates/rsonpath-lib)
├── arbitrary v1.3.2
├── cfg-if v1.0.0
├── log v0.4.21
├── memmap2 v0.9.4
├── nom v7.1.3
├── rsonpath-syntax v0.3.1 (/home/mat/src/rsonpath/crates/rsonpath-syntax)
├── smallvec v1.13.2
├── static_assertions v1.1.0
├── thiserror v1.0.58
└── vector-map v1.0.1

Justification

  • cfg-if – used to support SIMD and no-SIMD versions.
  • memchr – rapid, SIMDified substring search for fast-forwarding to labels.
  • memmap2 – for fast reading of source files via a memory map instead of buffered copies.
  • nom – for parser implementation.
  • replace_with – for safe handling of internal classifier state when switching classifiers.
  • smallvec – crucial for small-stack performance.
  • static_assertions – additional reliability by some constant assumptions validated at compile time.
  • thiserror – idiomatic Error implementations.
  • vector_map – used in the query compiler for measurably better performance.

Dependencies

~3.5–5MB
~83K SLoC