#hash-map #map #hybrid #small-vec #small

hybridmap

Hybrid map using smallvec and the std hashmap

2 releases

0.1.1 Jan 30, 2024
0.1.0 Jan 29, 2024

#6 in #hybrid

37 downloads per month

Apache-2.0

26KB
445 lines

HybridMap

HybridMap is a Rust™ hybrid map implementation that uses a vector on the memory stack for small maps and a hash map overwise.

As with most hybrid technologies, including two components instead of one is one too many. However, the hybrid solution can provide some value for specific use cases.

HybridMap can be slightly faster for tiny maps, especially short-lived ones living on the memory stack, usually up to 16 entries and without too many lookups.

Example

HybridMap can be used like most other maps.

use hybridmap::HybridMap;

let mut map = HybridMap::<i32, &str>::new();
map.insert(1, "one");
map.insert(2, "two");

assert_eq!(map.get(&1), Some(&"one"));
assert_eq!(map.len(), 2);

Benchmarks

The benchmark is unlikely to be representative of your use cases. You might see some of the gains shown below if you create many short-lived small maps. You may also get worse performances than a standard hash map.

You could adapt the benchmarks to your use cases. If you don't know whether you should use this hybrid map or a hashmap, you should go with a hashmap. As the numbers show, the performance gain is not that great.

Results on a Macbook Pro M1:

Type Map Size Median Time (ns) Performance Gain
i64 HashMap 1 248
i64 HybridMap 1 194 x1.28
i64 HashMap 4 1 117
i64 HybridMap 4 822 x1.36
i64 HashMap 16 4 581
i64 HybridMap 16 3 241 x1.41
i64 HashMap 128 36 593
i64 HybridMap 128 36 629 x1.0
uuid HashMap 1 335
uuid HybridMap 1 235 x1.43
uuid HashMap 4 1 610
uuid HybridMap 4 941 x1.71
uuid HashMap 16 6 346
uuid HybridMap 16 6 424 x0.99
uuid HashMap 128 49 799
uuid HybridMap 128 49 841 x1.0
string HashMap 1 1 176
string HybridMap 1 1 113 x1.06
string HashMap 4 5 313
string HybridMap 4 4 695 x1.13
string HashMap 16 21 626
string HybridMap 16 21 009 x1.03
string HashMap 128 156 010
string HybridMap 128 156 880 x0.99

In this benchmark, the HybridMap switches to a HashMap internally once it has more than 16 entries. This benchmark is not a very robust benchmark. Benchmarking HybridMap correctly is hard and requires more effort than implementing the crate. As the license says, use at your own risk.

However for tiny maps, that are short-lived, the performance gain could be more interesting:

Type Len Median Time (ns) Performance Gain
HashMap<Uuid,i64> 1 130
HashMap<Uuid,i64> 2 173
HybridMap<Uuid,i64,1> 1 50 x2.61
HybridMap<Uuid,i64,1> 2 174 x0.99
HybridMap<Uuid,i64,4> 1 53 x2.45
HybridMap<Uuid,i64,4> 2 80 x2.17
# Run the benchmarks
cargo bench --bench=hybridmap_bench -- --quick --quiet

# Run this command instead if you have more patience
cargo bench --bench=hybridmap_bench

# Open the results in a browser
open target/criterion/report/index.html
# or
xdg-open target/criterion/report/index.html

Memory Usage

HybridMap has a small memory overhead, the enum variant between the vector and the hashmap and a vector pre-allocated on the stack.

The default vector size on the stack is 8 entries. You may save a tiny bit of memory by adapting the vector size to the number of entries you expect to store in the maps. But a large vector will very quickly be a waste of resources. Consider staying below 20.

For maps containing very few entries, one or two, memory usage can be one order of magnitude smaller than a hashmap. Otherwise, the memory usage is similar to a normal hashmap.

You can adapt the benches/hybridmap_memory.rs file to your use case.

# Run the memory benchmark
# You will probably have to run it many times without things in the background
# to get a coherent result.
cargo bench --bench=hybridmap_memory

Why ?

I started benchmarking tiny maps to check whether I should switch from HashMap to BTreeMap for my use case. I also had a naive Vec implementation that was faster despite for small maps. Thus, I made this crate for fun.

The energy savings this crate may bring probably do not compensate for the energy I used to boil water for my tea while implementing this crate. But it was fun.

License

This project is licensed under the Apache License, Version 2.0 - see the LICENSE file for details.

Acknowledgements

Dependencies

~410KB