3 unstable releases
0.2.1 | Feb 7, 2023 |
---|---|
0.2.0 | Jan 22, 2023 |
0.1.0 | Jan 16, 2023 |
#49 in Text processing
101,384 downloads per month
Used in 57 crates
(27 directly)
87KB
2K
SLoC
glob-match
An extremely fast glob matching library with support for wildcards, character classes, and brace expansion.
- Linear time matching. No exponential backtracking.
- Zero allocations.
- No regex compilation. Matching occurs on the glob pattern in place.
- Support for capturing matched ranges of wildcards.
- Thousands of tests based on Bash and micromatch.
Example
use glob_match::glob_match;
assert!(glob_match("some/**/{a,b,c}/**/needle.txt", "some/path/a/to/the/needle.txt"));
Wildcard values can also be captured using the glob_match_with_captures
function. This returns a Vec
containing ranges within the path string that matched dynamic parts of the glob pattern. You can use these ranges to get slices from the original path string.
use glob_match::glob_match_with_captures;
let glob = "some/**/{a,b,c}/**/needle.txt";
let path = "some/path/a/to/the/needle.txt";
let result = glob_match_with_captures(glob, path)
.map(|v| v.into_iter().map(|capture| &path[capture]).collect());
assert_eq!(result, vec!["path", "a", "to/the"]);
Syntax
Syntax | Meaning |
---|---|
? |
Matches any single character. |
* |
Matches zero or more characters, except for path separators (e.g. / ). |
** |
Matches zero or more characters, including path separators. Must match a complete path segment (i.e. followed by a / or the end of the pattern). |
[ab] |
Matches one of the characters contained in the brackets. Character ranges, e.g. [a-z] are also supported. Use [!ab] or [^ab] to match any character except those contained in the brackets. |
{a,b} |
Matches one of the patterns contained in the braces. Any of the wildcard characters can be used in the sub-patterns. Braces may be nested up to 10 levels deep. |
! |
When at the start of the glob, this negates the result. Multiple ! characters negate the glob multiple times. |
\ |
A backslash character may be used to escape any of the above special characters. |
Benchmarks
globset time: [35.176 µs 35.200 µs 35.235 µs]
glob time: [339.77 ns 339.94 ns 340.13 ns]
glob_match time: [179.76 ns 179.96 ns 180.27 ns]
Fuzzing
You can fuzz glob-match
itself using cargo fuzz
. See the
Rust Fuzz Book for
guidance on setup and installation. Follow the Rust Fuzz Book for information on
how to configure and run Fuzz steps.
After discovering artifacts, use cargo fuzz fmt [target] [artifact-path]
to
get the original input back.
$ cargo fuzz fmt both_fuzz fuzz/artifacts/both_fuzz/slow-unit-LONG_HASH
Output of `std::fmt::Debug`:
Data {
pat: "some pattern",
input: "some input",
}