4 releases (2 breaking)
0.3.0 | Apr 1, 2023 |
---|---|
0.2.0 | Jan 30, 2023 |
0.1.1 | Sep 12, 2022 |
0.1.0 | Sep 12, 2022 |
#1751 in Text processing
63 downloads per month
78KB
187 lines
wildflower
wildflower is a Rust library that performs wildcard matching against strings.
It's fast, ergonomic, zero-copy, and works on no_std
.
Usage
The wildcard matching grammar contains the following special characters:
?
matches a single character.*
matches zero or more characters.\
escapes these special characters.
A pattern is constructed from a UTF-8-encoded string which may contain these special characters. When a pattern is created, the given source string is parsed and compiled into an optimized internal form. Since no internal state is maintained between matches, it is recommended that you reuse patterns for best results.
Alternatives
wildmatch is the closest alternative at the time of writing. Unfortunately, it explicitly does not support escaping special characters, which I found to be a significant limitation to warrant an alternative. wildflower also performs certain optimizations that make it more performant when matching, in many cases by an order of magnitude (see benchmarks).
Several other crates exist for pattern matching, namely regex (for regular expressions) and glob (for Unix shell patterns).
Benchmarking
Using a benchmark similar to the one found in wildmatch (source), I obtained the following results on my machine:
Benchmark | wildflower | wildmatch | regex | glob |
---|---|---|---|---|
compiling/text | 362 ns | 390 ns | 131,770 ns | 2,041 ns |
compiling/complex | 218 ns | 47 ns | 84,236 µs | 165 ns |
matching/text | 7 ns | 416 ns | 415 ns | 832 ns |
matching/complex | 104 ns | 494 ns | 409 ns | 2,222 ns |
In this benchmark run, wildflower is shown to be 76x and 4x as fast as wildmatch in the simple and complex case of matching respectfully. It could certainly stand to see performance improvements in compiling, but even in the worst case of a single-use compilation, it still outperforms wildmatch.
Credits
Credit to Armin Becher for the benchmarking code and table format from wildmatch, and to Ilona Ilyés of Pixabay for the original of the cat image featured above.
Dependencies
~0.4–0.9MB
~18K SLoC