#merkle-tree #tree #tree-structure #merkle #binary #sparse-matrix #patricia

bin+lib starling

This tree structure is a binary merkle tree with branch compression via split indexes

37 stable releases (3 major)

4.0.0 Jul 11, 2022
3.2.3 Sep 7, 2020
3.2.2 Jun 17, 2020
3.1.4 Mar 11, 2020
1.0.5 Dec 21, 2018

#316 in Data structures

MIT/Apache

130KB
3K SLoC

GitHub release Crates.io Crates.io GitHub last commit dependency status GitHub issues Codecov branch
Crates.io Gitter Donate

Merkle Binary Indexed Tree (Merkle-BIT)

This tree structure is a binary merkle tree with branch compression via split indexes. This structure can be used to store multiple versions of tree state without any duplication of the stored data, either in memory or on disk. See here and here for a basic explanation of its purpose.

Basic Usage

To quickly get started and get a feel for the Merkle-BIT, you can use the already implemented HashTree structure.

    use std::error::Error;
    use starling::hash_tree::HashTree;
    
    fn main() -> Result<Ok(), Error> {
        let tree = HashTree::new(8)?;
        
        // Keys must be of fixed size
        let mut key: Array<32> = [0xFF; 32].into();
        
        // Value to be put into the tree
        let value: Vec<u8> = vec![0xDDu8];
        
        // Inserting an element changes the root node
        let root = tree.insert(None, &mut [&key], &[value])?;
        
        let retrieved_value = tree.get(&root, &mut [&key])?;
        
        // Removing a root only deletes elements that are referenced only by that root
        tree.remove(&root)?;
        Ok(())
    }

This structure can be used for small amounts of data, but all the data in the tree will persist in memory unless explicitly pruned.

For larger numbers of items to store in the tree, it is recommended to connect the structure to a database by implementing the Database trait for your database. This structure will also take advantage of batch writes if your database supports it.

Benchmarks

Below are the benchmarks when using starling on an in-memory database on a reasonably fast machine:

Operation Num. Entries Is Tree Empty? Measured Benchmark
insertion 1 yes 0.407μs
insertion 10 yes 5.136μs
insertion 100 yes 46.796μs
insertion 1000 yes 480.060μs
insertion 10000 yes 7,219.300μs
insertion 1 no 6.315μs
insertion 10 no 19.400μs
insertion 100 no 149.710μs
insertion 1000 no 1,517.700μs
insertion 10000 no 15,043.000μs
retrieval 4096 no 2,889.100μs
retrieval 10000 no 9,437.100μs
removal 4096 no 0.070μs
removal 10000 no 0.071μs

Features

Starling supports a number of serialization and hashing schemes for use in the tree, which should be selected based on your performance and application needs.

Currently, integrated serialization schemes include:

  • bincode
  • serde-json
  • serde-cbor
  • serde-yaml
  • serde-pickle
  • ron

It should be noted that any serialization scheme will work with starling, provided you implement the Encode and Decode traits for the node types.

Currently, integrated tree hashing schemes include:

  • Blake2b via blake2_rfc
  • Groestl via groestl
  • SHA2 via openssl
  • SHA3 via tiny-keccak
  • Keccak via tiny-keccak
  • SeaHash via seahash
  • FxHash via fxhash
  • and most updated hashes from RustCrypto

You may also use the default Rust hasher, or implement the Hasher trait for your own hashing scheme (unless using a hash from RustCrypto, then you will want to enable the digest feature, which implements Hasher for Digest).

You can also use RocksDB to handle storing and loading from disk. You can use the RocksTree with a serialization scheme via the --features="rocksdb bincode" command line flags or by enabling the features in your Cargo.toml manifest.

Some enabled features must be used in combination, or you must implement the required traits yourself (E.g. using the rocksdb feature alone will generate a compiler error, you must also select a serialization scheme, such as bincode or implement it for your data).

Finally, you can take advantage of the hashbrown to use the hasbrown crate instead of the standard library HashMap.

Full Customization

To use the full power of the Merkle-BIT structure, you should customize the structures stored in the tree to match your needs.

If you provide your own implementation of the traits for each component of the tree structure, the tree can utilize them over the default implementation.

    use starling::merkle_bit::MerkleBIT;
    use starling::Array;
    use std::path::Path;
    use std::error::Error;
    
    fn main() -> Result<Ok, Error> {
        // A path to a database to be opened
        let path = Path::new("some path");
        
        // Your own database library
        let db = YourDB::open(&path);
        
        // These type annotations are required to specialize the Merkle BIT
        // Check the documentation for the required trait bounds for each of these types.
        pub struct MyTree;
        
        impl MerkleTree for MyTree {
            type Database = MyDatabase;
            type Branch = MyBranch;
            type Leaf = MyLeaf;
            type Data = MyData;
            type Node = MyNode;
            type Hasher = MyHasher;
            type Value = Myvalue;
        }
        let mbit = MerkleBIT<MyTree, 32>::from_db(db, depth);
                             
        // Keys must be of fixed size
        let key: Array<32> = [0xFF; 32].into();
        
        // An example value created from `MyValue`.  
        let value: MyValue = MyValue::new("Some value");
        
        // You can specify a previous root to add to, in this case there is no previous root
        let root: Array<32> = mbit.insert(None, &mut [key], &[value])?;

        // Every time an element is added or removed a new root is created.
        let new_key: Array<32> = [0xEE; 32].into();
        let new_value: ValueType = MyValue::new("Some new value");
        let new_root: Array<32> = mbit.insert(&root, &mut [key], &[value])?;
        
        // Retrieving the inserted value
        let inserted_values: HashMap<&Array<32>, Option<MyValue>> = mbit.get(&root, &mut [key])?;

        // You must ensure that the root you supply matches a root where the key existed when retrieving items
        // This line will fail to find the `new_value`
        let empty_map = mbit.get(&root, &mut [new_key])?;

        // This line will succeed in finding values for both `key` and `new_key`
        let inhabited_map = mbit.get(&new_root, &mut [key, new_key])?;

        
        // Removing a tree root
        mbit.remove(&root)?;

        // This line will fail to find a value for `key` but will succeed in finding the value for `new_key`
        let partially_inhabited_map = mbit.get(&new_root, &mut [key, new_key])?;
        Ok(())
    }

Verification

The MerkleBIT also supports generating and verifying merkle inclusion proofs, and may be used like below:

    use starling::hash_tree::HashTree;
    use std::error::Error;
    
    fn main() -> Result<Ok, Error> {
        let tree = HashTree::new(8)?;
        
        let mut key: Array<32> = [0xFF; 32].into();
        let value: Vec<u8> = vec![0xDDu8];
        
        let root: Array<32> = tree.insert(None, &mut [&key], &[value])?;
        
        // An inclusion proof that proves membership of a key in the tree
        let proof: Vec<(Array<32>, bool)> = tree.generate_inclusion_proof(&root, key)?;
        
        // If the proof is valid, it will return Ok(())
        HashTree::verify_inclusion_proof(&root, key, &value, &proof)?;
        Ok(())
    }

License

Licensed under either of

at your option.

Contribution

Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.

Reporting

The project is currently undergoing rapid development and it should be noted that minor releases may include breaking changes to the API. These changes will be noted in the Changelog of each release, but if we broke something or forgot to mention such a change, please file an issue or submit a pull request and we will review it at our earliest convenience.

Support

Do you use this crate and would like to ensure continued support? Please consider supporting me via Github Sponsors at my sponsor page.

Acknowledgments

Special thanks to Niall Moore and Owen Delahoy for assistance with the early phases of this project.

Dependencies

~0–6MB
~110K SLoC