#k-mer #sequencing #graph #dna

debruijn

Tools for DNA sequences: efficient k-mer manipulation, De Bruijn graph construction and compaction and handling of DNA strings

3 releases

0.3.4 Sep 9, 2021
0.3.3 Aug 27, 2021
0.3.2 Jun 25, 2020

#274 in Biology

Download history 84/week @ 2024-12-11 31/week @ 2024-12-18 1/week @ 2024-12-25 32/week @ 2025-01-01 78/week @ 2025-01-08 56/week @ 2025-01-15 36/week @ 2025-01-22 91/week @ 2025-01-29 182/week @ 2025-02-05 37/week @ 2025-02-12 50/week @ 2025-02-19 42/week @ 2025-02-26 35/week @ 2025-03-05 60/week @ 2025-03-12 103/week @ 2025-03-19 40/week @ 2025-03-26

243 downloads per month
Used in 6 crates

MIT license

215KB
5K SLoC

rust-debruijn

De Bruijn graph construction & path compression libraries.

Docs

Key features

  • 2-bit packed fixed-length (Kmer) and variable-length (DnaString) sequence containers
  • Statically compiled code paths for different K values
  • Ability to track arbitrary auxiliary data through the DeBruijn graph
  • Customizable kmer counting & filtering schemes supporting a variety of use cases
  • DeBruijn graph compression
  • Minimum-substring partitioning to shard kmers for memory efficient counting and DeBruijn graph compression
  • Configurable for stranded and non-stranded input sequence
  • Extensive unit test suite
  • In production use in Supernova, Long Ranger, Cell Ranger, and Cell Ranger VDJ pipelines from 10x Genomics.

Dependencies

~2.6–4MB
~77K SLoC