na_seq

DNA and RNA sequence types and functions

3 unstable releases

new 0.2.1 Oct 16, 2024
0.2.0 Oct 6, 2024
0.1.0 Oct 6, 2024

#78 in Biology

Download history 118/week @ 2024-09-30 144/week @ 2024-10-07

262 downloads per month
Used in plascad

MIT license

30KB
618 lines

Nucleic Acid Sequence tools

Crate Docs

This small library contains types and functions used for performing operations on DNA sequences. Its most fundamental type is the Nucleotide enum, representing a single DNA nucleotide. This library is general, and intended to be used by any program or library that uses DNA sequences.

It includes functions to convert between &[Nucleotide] to string and vice-versa, and convert to and from u8 integer representations. It includes functions to serialize and deserialize in a compact binary format, with 2 bits per nucleotide.

See the docs for details on data structures and functions available.

Utility functionality

  • Sequence and nucleotide complements
  • Sequence and nucleotide weight
  • GC content

We may add alignment, sequence searches, and other utility features in the future.

Also includes restriction enzyme functionality basics.

This library is used by the PlasCAD plasmid editor.

Dependencies

~0.5–1MB
~23K SLoC