#sequence-alignment #nlp #dna #needleman-wunsch #smith-waterman

seal

Implementation of Needleman-Wunsch & Smith-Waterman sequence alignment

4 releases

0.1.5 May 12, 2022
0.1.4 Apr 5, 2022
0.1.3 Mar 6, 2022
0.1.2 Dec 12, 2021
0.1.0 May 3, 2017

#79 in Biology

Download history 199/week @ 2024-03-13 241/week @ 2024-03-20 192/week @ 2024-03-27 223/week @ 2024-04-03 189/week @ 2024-04-10 150/week @ 2024-04-17 197/week @ 2024-04-24 113/week @ 2024-05-01 112/week @ 2024-05-08 252/week @ 2024-05-15 182/week @ 2024-05-22 221/week @ 2024-05-29 187/week @ 2024-06-05 172/week @ 2024-06-12 85/week @ 2024-06-19 74/week @ 2024-06-26

529 downloads per month
Used in 2 crates (via stam-tools)

MPL-2.0 license

29KB
761 lines

seal

Downloads Version License

Synopsis

A Rust implementation of Needleman-Wunsch & Smith-Waterman sequence alignment.

Motivation

The aim of this crate is to provide a memory- and time-efficient implementation of Needleman-Wunsch as well as Smith-Waterman sequence alignment using a unified API.

Getting Started

Add the most recent version of seal to your dependencies in your project's Cargo.toml.

Then add …

extern crate seal;

… to your crate's root file (e.g. lib.rs, main.rs).

Once that's done you're ready to play!

Example

extern crate seal;

use seal::pair::{
    Alignment, AlignmentSet, InMemoryAlignmentMatrix, NeedlemanWunsch, SmithWaterman, Step,
};

fn main() {
    let str_x = "The quick brown fox jumps over the lazy dog.";
    let str_y = "The brown dog jumps over the very lazy snail.";

    let strategy = NeedlemanWunsch::new(1, -1, -1, -1);
    // Alternatively:
    // let strategy = SmithWaterman::new(2, -1, -1, -1);

    let sequence_x: Vec<char> = str_x.chars().collect();
    let sequence_y: Vec<char> = str_y.chars().collect();
    let set: AlignmentSet<InMemoryAlignmentMatrix> =
        AlignmentSet::new(sequence_x.len(), sequence_y.len(), strategy, |x, y| {
            sequence_x[x] == sequence_y[y]
        })
        .unwrap();

    let print_alignment = |alignment: Alignment| {
        for step in alignment.steps() {
            match step {
                Step::Align { x, y } => {
                    if sequence_x[x] == sequence_y[y] {
                        print!("=")
                    } else {
                        print!("!")
                    }
                }
                Step::Delete { .. } => print!("-"),
                Step::Insert { .. } => print!("+"),
            }
        }
        println!("\n");
    };

    println!("Local alignment:");
    let local_alignment = set.local_alignment();
    print_alignment(local_alignment);

    println!("Global alignment:");
    let global_alignment = set.global_alignment();
    print_alignment(global_alignment);

    // Local alignment:
    // ====------======!=!================+++++=====
    //
    // Global alignment:
    // ====------======!=!================+++++=====!!!++=
}

See the examples directory for more in-depth examples.

API Reference

An AlignmentSet contains all optimal alignments for a given pair of sequences.

Retrieving a single locally/globally optimal alignment

let alignment in alignment_set.local_alignment();
let alignment in alignment_set.global_alignment();

Enumerate all locally/globally optimal alignments

for alignment in alignment_set.local_alignments() {
    //
}
for alignment in alignment_set.global_alignments() {
    //
}

Contributing

Please read CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests to us.

License

This project is licensed under the MPL-2.0 – see the LICENSE.md file for details.

Dependencies

~2–11MB
~118K SLoC