#regex #bindings #text #ffi #native-bindings

sys vectorscan-rs-sys

Native bindings to the Vectorscan high-performance regex library

5 releases

0.0.5 Dec 9, 2024
0.0.4 Nov 7, 2024
0.0.3 Aug 22, 2024
0.0.2 Apr 18, 2024
0.0.1 Apr 5, 2024

#2370 in Parser implementations

Download history 21/week @ 2024-09-06 25/week @ 2024-09-13 51/week @ 2024-09-20 189/week @ 2024-09-27 136/week @ 2024-10-04 223/week @ 2024-10-11 170/week @ 2024-10-18 119/week @ 2024-10-25 166/week @ 2024-11-01 123/week @ 2024-11-08 164/week @ 2024-11-15 38/week @ 2024-11-22 127/week @ 2024-11-29 288/week @ 2024-12-06 155/week @ 2024-12-13 96/week @ 2024-12-20

666 downloads per month
Used in vectorscan-rs

Apache-2.0 OR MIT

2MB
807 lines

vectorscan-rs-sys

Overview

This crate implements minimal Rust bindings to the Vectorscan fork of Hyperscan, the high-performance regular expression engine. This crate builds a vendored copy of Vectorscan from source.

Dependencies

  • Boost >= 1.57
  • CMake
  • patch
  • Optional: Clang, when building with the bindgen feature

This has been tested on x86_64 Linux, x86_64 macOS, and aarch64 macOS.

Implementation Notes

This crate was originally written as part of Nosey Parker. It was adapted from the pyperscan project, which uses Rust to expose Hyperscan to Python. (That project is released under either the Apache 2.0 or MIT license.)

The only bindings exposed at present are for Vectorscan's block-based matching APIs. The various other APIs such as stream- and vector-based matching are not exposed. Other features, such as the Chimera PCRE library, test code, benchmark code, and supporting utilities are disabled.

The source of Vectorscan 5.4.11 is included here in the 5.4.11.tar.gz file.

At build time, a patch is applied to the Vectorscan sources. This patch does a few things:

  • The CMake-based build system is modified to eliminate the build-time dependency on ragel
  • The precompiled version of 4 Ragel .rl files are added to the source tree
  • The CMake-based build system is modified to allow disabling several components that are not used in this crate

License

This project is licensed under either of

at your option.

This project contains a vendored copy of Vectorscan, which is released under a 3-clause BSD license (LICENSE-VECTORSCAN).

Contributing

Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in vectorscan-rs-sys by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.

Dependencies