12 releases
0.1.11 | Jul 22, 2024 |
---|---|
0.1.10 | Jun 15, 2023 |
0.1.9 | Oct 12, 2022 |
#15 in #sequencing
Used in 4 crates
24KB
533 lines
disambiseq
Creates unambiguous nucleotide mismatch libraries for for a set of nucleotide sequences.
Usage
I've rewritten this functionality a few times for different use cases and put it into a standalone crate since it might be useful to others.
This is used to generate unambiguous one-off mismatch libraries for a set of DNA sequences.
Creating a new unambiguous set
use disambiseq::Disambiseq;
let sequences = vec![
"ACT".to_string(),
"AGT".to_string()
];
let dsq = Disambiseq::from_slice(&sequences);
println!("{:#?}", dsq);
Visualizing the set
Disambiseq {
unambiguous: {
"TCT": "ACT",
"ACA": "ACT",
"CCT": "ACT",
"ACC": "ACT",
"CGT": "AGT",
"GGT": "AGT",
"AGA": "AGT",
"GCT": "ACT",
"ACG": "ACT",
"TGT": "AGT",
"AGC": "AGT",
"AGG": "AGT",
},
parents: {
"AGT",
"ACT",
},
ambiguous: {
"ATT",
"AAT",
},
}
Querying the Set
use disambiseq::Disambiseq;
let sequences = vec![
"ACT".to_string(),
"AGT".to_string()
];
let dsq = Disambiseq::from_slice(&sequences);
// retrieve a parental sequence
assert_eq!(dsq.get_parent("ACT"), Some(&"ACT".to_string()));
// retrieve a mutation sequence's parent
assert_eq!(dsq.get_parent("TCT"), Some(&"ACT".to_string()));
// exclude sequences with ambiguous parents
assert_eq!(dsq.get_parent("AAT"), None);
assert_eq!(dsq.get_parent("ATT"), None);
Dependencies
~2MB
~27K SLoC