3 releases
0.1.2 | Dec 2, 2024 |
---|---|
0.1.1 | Dec 2, 2024 |
0.1.0 | Dec 2, 2024 |
#664 in Development tools
157 downloads per month
Used in liberty-db
13KB
182 lines
hashmatch
More efficient static &str
matching when match #arm > 30.
Usage
use hashmatch::{hash_arm, hash_str};
// to avoid hash conflict
#[deny(unreachable_patterns)]
let res = match hash_str("ABC") {
hash_arm!("ABC") => 1,
hash_arm!("AAA") | hash_arm!("BBB") => 2,
_ => 3,
};
assert_eq!(res, 1);
The hash_arm!
will compute each arm's hash during compiling, and expand it to
let res = match hash_str("ABC") {
5661343534983258464u64 => 1,
8040774132769170485u64 | 15932663093540935610u64 => 2,
_ => 3,
};
Why hashmatch
?
For current rustc, the string matching with large arm number is linear complexity O(n). While the matching complexity of hash (u64
) is O(log(n)).
hashmatch
achieves matching string's hash from 2 sides:
- For each match arm, using
hashmatch::hash_arm!
to get string's static hash during compiling. - For the value to be matched, use
hashmatch::hash_str
to get string's hash in runtime, which is consistent tohashmatch::hash_arm!
.
But there is a potential risk when encounter DOS attacks.
Benchmark
You can run the benchmark in your environment by cargo bench
, requiring ~20min.
The benchmark compared 4 methods:
match_str
: Directly match string. O(n)match s { "xrxeclxu" => Some(0), "vukddz" => Some(1), "qwhkdyjog" => Some(2), "dpesutax" => Some(3), "tqgzzfcblp" => Some(4), _ => None, }
match_hash
: (this) Compute string's hash, then matchu64
. O(log(n))match hash_str(s) { hash_arm!("xrxeclxu") => Some(0), hash_arm!("vukddz") => Some(1), hash_arm!("qwhkdyjog") => Some(2), hash_arm!("dpesutax") => Some(3), hash_arm!("tqgzzfcblp") => Some(4), _ => None, }
lookup_phf
: Create const hashmap in compiling time. O(1)const STRING_MAP: phf::Map<&str, usize> = phf::phf_map! { "xrxeclxu" => 0, "vukddz" => 1, "qwhkdyjog" => 2, "dpesutax" => 3, "tqgzzfcblp" => 4 }; STRING_MAP.get(s).copied()
lookup_lazy
: Create lazy hashmap. O(1)static STRING_MAP: std::sync::LazyLock<foldhash::HashMap<&str, usize>> = std::sync::LazyLock::new(|| { use foldhash::HashMapExt; let mut map = foldhash::HashMap::with_capacity(5); map.insert("xrxeclxu", 0); map.insert("vukddz", 1); map.insert("qwhkdyjog", 2); map.insert("dpesutax", 3); map.insert("tqgzzfcblp", 4); map }); STRING_MAP.get(s).copied()
Dependencies
~1.5MB
~39K SLoC