-
needletail
FASTX parsing and k-mer methods
-
crispr_screen
A fast and configurable differential expression analysis tool for CRISPR screens
-
proseg
Probabilistic cell segmentation for in situ spatial transcriptomics
-
rasusa
Randomly subsample reads to a specified coverage
-
atg
Convert transcripts between different file formats
-
rust-bio-tools
A set of fast and robust command line utilities for bioinformatics tasks based on Rust-Bio
-
minimap2
Bindings to libminimap2
-
simpleaf
framework to make using alevin-fry even simpler
-
noodles
Bioinformatics I/O libraries
-
light_phylogeny
Methods and functions for phylogeny
-
rust-htslib
HTSlib bindings and a high level Rust API for reading and writing BAM files
-
finch
min-wise independent permutation locality sensitive hashing ('MinHashing') for genomic data and command-line utility for manipulation
-
bigtools
associated tools for reading and writing bigwigs and bigbeds
-
alevin-fry
A suite of tools for the rapid, accurate and memory-frugal processing single-cell and single-nucleus sequencing data
-
fasten
A set of scripts to run basic analysis on fastq files
-
fqkit
cross-platform program for fastq file manipulation
-
alignoth
creating alignment plots from bam files
-
lightmotif
A lightweight platform-accelerated library for biological motif scanning using position weight matrices
-
exon
A platform for scientific data processing and analysis
-
wham
weighted histogram analysis method
-
bio-seq
Bit packed and well-typed biological sequences
-
intspan
Command line tools for IntSpan related bioinformatics operations
-
rust-lapper
A fast and easy interval overlap library
-
align-cli
A command line interface for easily aligning sequences
-
stats_on_gff3_ncbi
Calculate statistics such as CDS GC3 ratio, intron GC ratio, flanking gene region GC ratio, first intron length, number of introns, CpG ratio, etc
-
sigalign
A Similarity-Guided Alignment Algorithm
-
bed-reader
Read and write the PLINK BED format, simply and efficiently
-
block-aligner
SIMD-accelerated library for computing global and X-drop affine gap penalty sequence-to-sequence or sequence-to-profile alignments using an adaptive block-based algorithm
-
galah
Microbial genome dereplicator
-
fastobo
Faultless AST for Open Biomedical Ontologies
-
rnapkin
CLI utility for drawing RNA secondary structure
-
coverm
Read coverage calculator for metagenomics
-
gfatk
command line tool for manipulating small to medium sized GFA files, and specifically for output from the genome assembler MBG
-
nail
alignment inference tool
-
omnitigs
Omnitig-related algorithms
-
verify-same-kmer-content
Verify that an SPSS has the same kmer content as a set of unitigs
-
coprosize
coprolite research (paleontology and archaeology): estimate the producer's body mass based on coprolite diameter by the use of regression models
-
ambigviz
Identify and plot ambiguous nucleotide bases at given positions from a BAM file
-
pipspeak
converting PIPSeq files to 10X Genomics compatible FASTQ files
-
give_a_sheet
Toolkit for generating input samplesheets for a variety of nf-core pipelines
-
consalifold
Consensus Secondary Structure Predictor Engaging Structural Alignment-based Error Correction
-
webgestalt
CLI for computing enrichment for different analytes using ORA, GSEA, or NTA
-
seqsizzle
A pager for viewing FASTQ files with fuzzy matching, allowing different adaptors to be colored differently
-
scidataflow
A command-line tool to manage scientific research project data
-
kmerutils
Kmer counting, hashing, sequence sketching
-
gsearch
genome classification, probminhash hnsw, genome search
-
ehxacto
Find exact tandem repeat coordinates from approximate regions identified by ExpansionHunter denovo
-
archaea
genome classification, probminhash hnsw archaea
-
genomicsqlite
Genomics Extension for SQLite
-
ska
Split k-mer analysis
-
sourmash
tools for comparing biological sequences with k-mer sketches
-
piscem-infer
A flexible tool to perform target quantification from bulk-sequencing data
-
varlociraptor
calling of genomic variants using a latent variable model
-
ebiotic
interacting with common bioinformatics web services
-
mehari
Variant effect prediction all in Rust
-
slow5
interacting with slow5
-
bustools
Interacting with the kallisto/bus format of scRNAseq data
-
parasail-rs
Rust bindings and wrapper for parasail, a SIMD C library for pairwise sequence alignment
-
fakit
program for fasta file manipulation
-
jam-rs
Just another (genomic) minhash (Jam) implementation in Rust
-
fxtools
A collection of commandline Fasta/Fastq utility tools
-
righor
creates model of Ig/TCR sequences from sequencing data
-
lightdock
Macromolecular docking software based on the GSO algorithm
-
nucleob
bioinformatics: nucleobases and amino acids statistics
-
maf2bed
Converts multiple alignment format (MAF) files to a BED format for tabixing. Used with jbrowse-plugin-mafviewer
-
fastax
Make phylogenetic trees and lineages from the NCBI Taxonomy database
-
taxonomy
Routines for loading, saving, and manipulating taxonomic trees
-
uniprot
Rust data structures and parser for the Uniprot database(s)
-
motif_finder
Find motifs using Gibbs Sampler, Median String, and Randomized Motif Search algorithms in a fasta formatted file of reads Refer to the README to understand the input data
-
nwr
nwr
is a command line tool for newick and taxonomy -
htsget-test
Common test functions and utilities used by htsget-rs
-
sprocket
A package manager for the Workflow Description Language files
-
cgt_bacpop
Label core and rare genes in pangenome dataa
-
fmlrc
FM-index Long Read Corrector - Rust implementation
-
prot_translate
Translate nucleotide sequence to protein
-
granges
command line tool for genomic range operations
-
atglib
handle transcripts for genomics and transcriptomics
-
rust-sasa
RustSASA is a Rust library for computing the absolute solvent accessible surface area (ASA/SASA) of each atom in a given protein structure using the Shrake-Rupley algorithm
-
ffforf
fasta/q/x file format parser. Well fuzzed.
-
ontime
Extract subsets of ONT (Nanopore) reads based on time
-
mni2mz3
Brain imaging surface mesh file format converter
-
librna-sys
Low-level bindings for the ViennaRNA library
-
cytos
A collection of frequently used Bioinformatics scripts/tasks written in Rust
-
perbase
Fast and correct perbase BAM/CRAM analysis
-
stats_on_gff3
Calculate statistics such as CDS GC3 ratio, intron GC ratio, flanking gene region GC ratio, first intron length, number of introns, CpG ratio, etc. Examples: stats_on_gff3 Homo_sapiens…
-
vcf
VCF Parser
-
genome-graph
Representation of genome graphs
-
chainfile
working with genomics chain files
-
phylotree
deal with phylogenetic trees and distance matrices
-
gdrs
GenomicDistributions package from bioconductor
-
seal
Needleman-Wunsch & Smith-Waterman sequence alignment
-
lorikeet-genome
Strain resolver and variant caller via local reassembly for metagenomics
-
consalign
RNA Structural Aligner Based on Transfer-learning and Thermodynamic Ensemble Model
-
fastx
reads Fasta and FastQ files with little overhead
-
msbwt2
multi-string BWT query library
-
to-trans
A high-performance transcriptome builder from fasta + GTF/GFF
-
mpileup
Pile up multiple bam files site by site
-
seqrepo
Port of (read-only) functionality of biocommons/seqrepo to Rust
-
noodles-gff
Generic Feature Format (GFF) reader and writer
-
bedblocks
Split a BED file into blocks
-
gchemol-lattice
Periodic 3D crystal lattice
-
fpa_lr
fpa filter long read mapping information to save disk space
-
bird_tool_utils
Microbial genomics utility functions
-
wdl
Document lexing, parsing, validation, and linting for Workflow Description Language (WDL) documents
-
kinesin
re-exports crate (WIP)
-
oxbow
Read specialized bioinformatic file formats as data frames in R, Python, and more
-
grass-runtime
Runtime library for GRASS
-
nwbview
Neurodata Without Borders viewer
-
nale
that performs profile Hidden Markov Model (PHMM) biological sequence alignment
-
fastlin
an ultra-fast program for MTBC lineage typing
-
yacrd
Using all-against-all read mapping, yacrd performs: computation of pile-up coverage for each read and detection of chimeras
-
fastq
A parser for fastq
-
ncbitaxonomy
Read NCBI Taxonomy Database from files and work with NCBI Taxonomy DB
-
fqtk
A toolkit for working with FASTQ files
-
podders
Write uncompressed Pod5 files in native rust. No FFI! PODDDDERS
-
dihardts_omicstools
Collection of different omic tools, structs and enums
-
diploid-contam-estimator
Estimating contamination level in diploid DNA seuqencing libraries
-
skani
fast tool for calculating ANI between metagenomic sequences, such as metagenome-assembled genomes (MAGs). It is extremely fast and is robust against incompleteness and fragmentation, giving accurate ANI estimates.
-
sgcount
A fast and flexible sgRNA counter
-
strif
identify interruptions in short tandem repeats across the genome
-
primaldimer_rs
calculating PrimerPrimer interactions
-
guide-counter
Fast and accurate guide counting for CRISPR screens
-
prole
collection of re-usable methods in Rust. Feel free to use them for your own work.
-
rosella
Metagenome assembled genome recovery from metagenomes using UMAP and HDBSCAN
-
finch_cli
min-wise independent permutation locality sensitive hashing ('MinHashing') for genomic data and command-line utility for manipulation
-
hts
Rust binding for htslib
-
gia
set theoretic operations of genomic intervals
-
genome
package used to generate DNA used for GAN generator
-
lib3dmol
written in rust to read, manipulate, select atoms in protein structure files
-
blobtk
Core utilities for BlobToolKit
-
pedigree
Build a pedigree from WGS data
-
gfa
working with graphs in the GFA (Graphical Fragment Assembly) format
-
hts-sys
HTSlib bindings
-
entrez-rs
Rust wrapper for the Entrez API
-
frag_gene_scan_rs
gene prediction model for short and error-prone reads
-
ngs
Command line tool for processing next-generation sequencing data
-
exon-genbank
Exon GenBank
-
bamrescue
check Binary Sequence Alignment / Map (BAM) files for corruption and repair them
-
smafa
Read aligner for small pre-aligned sequences
-
syntesuite
TODO
-
stats_on_genomes
Calculate 2 simple ratio on the whole genome: GC ratio and repetition ratio
-
genomap
A small library for storing generic genomic data indexed by a chromosome
-
noodles-gtf
Gene Transfer Format (GTF) reader and writer
-
freesasa-sys
Rust raw FFI bindings for the freesasa C library
-
mmap-bitvec
working with bit-vectors backed by memory-mapped files
-
kractor
Extract reads from a FASTQ file based on taxonomic classification via Kraken2
-
fasta_windows
Make quick statistics in windows from a fasta file
-
sorensen
Sørensen–Dice coefficient implementation for Rust
-
ggca
Computes efficiently the correlation (Pearson, Spearman or Kendall) and the p-value (two-sided) between all the pairs from two datasets
-
bam-builder
easily building BAMs for testing
-
libradicl
support library for alevin-fry
-
elias_fano_rust
An optimized implementation of Sebastiano Vigna's Elis-Fano quasi succint datastructure
-
rna-ss-params
RNA secondary structure parameters
-
pattern_partition_prediction
Reading and querying k-mer pattern partition information
-
nohuman
Remove human reads from a sequencing run
-
chainsaw
manipulate newick trees
-
ppgg
associated executable, the library provides tools for building tools that can parse and work for VCF and FASTA files while the associated executable is a command line tool for generating…
-
chiral-common
Common Stuff of Chiral: An All-in-One Data Processing Tool
-
flowtigs
An algorithm for calculating flowtigs in a De Bruijn graph of DNA reads in metagenomes
-
fastleng
read length statistics tool
-
lightmotif-py
PyO3 bindings and Python interface to the lightmotif crate
-
sigalign-utils
utils for core
-
haptk
Haplotype analysis toolkit
-
drug-extraction-core
A core library for extracting drugs from text records
-
spdi
format to describe a genomic variant. This crate provides a library to get an SPDI format representation of a variant and a command-line utility which adds SPDI format output to an input VCF file.
-
compact-genome
Representation of genomes
-
biofile
reading bioinformatics related files
-
gfautil
Command line tools for working with GFA files and related formats
-
simplr
An augmented execution context to simplify long-read transcriptome analysis
-
slow5lib-sys
Low-level bindings to the slow5lib C library
-
bio-jtools
A suite of bioinformatics tools for interacting with high throughput sequencing (HTS) data
-
methylome
Structs and methods for working with methylation data
-
kgst
containing the implementation of a K-Truncated Generalized Suffix Tree using Ukkonen's Algorithm
-
kmers
k-mer manipulation
-
heme
PDB reader and other protein modeling tools
-
protein-translate
Translate nucleotide sequence to protein
-
pilercr-parser
A parser for the output of the PILER-CR CRISPR array annotation tool
-
fusta
leverages the FUSE interface to transparently manipulate multiFASTA files as independent files
-
nthash
rolling hash function for hashing all possible k-mers in a DNA sequence
-
xdrfile
Wrapper around the gromacs libxdrfile library. Can be used to read and write gromacs trajectories in xtc and trr format.
-
gfastats
GFA statistics
-
exon-cram
Exon CRAM
-
bustools_cli
Rust reimplementation of bustools for scRNAseq processing
-
genotype
An abstraction layer between genotype and phenotype, with in-place mutation
-
census-proteomics
working with proteomics data quantified by the Census algorithm
-
nucleobases
low-level brick crate for managing nucleobases as data in code
-
bed-utils
manipulating genomic range objects
-
multi-seq-align
Manipulate multiple sequence alignments (DNA/protein)
-
biocommons-bioutils
(Partial) port of biocommons/bioutils to Rust
-
alpine-core
ALPINE (Anachronistic Lineage and Persistent INfection Explorer) Core Utilities
-
sfs-cli
Tools for working with site frequency spectra
-
fxread
A barebones fastx reader for rust
-
fastxgz
A fasta/fastq parser for both compressed and not compressed files
-
consprob
Quick Probability Inference Engine on RNA Structural Alignment
-
bedrs
Genomic interval library in rust
-
rust-spoa
wraps the C++ SPOA library for generating DNA and protein consensus sequences
-
fastix
prefix renaming FASTA records
-
distance_aa
A backend for the distAAnce web application
-
fastx-statistics
Compute simple statistics for fasta-like files
-
grass-driver
Driver for GRASS
-
libprosic
calling of genomic variants using a latent variable model
-
biostats
A bioinformatics library
-
GetPDB
Download Protein files from rcsb.org
-
jean_blosum
BLOSUM feature for jean
-
rust-parallelfastx
Parallel iteration of FASTA/FASTQ files, for when sequence order doesn't matter but speed does
-
noodles-refget
A refget client
-
libparasail-sys
Unsafe Rust bindings for the parasail C library
-
psdm
Compute a pairwise SNP distance matrix from one or two alignment(s)
-
rumi
PCR Deduplication via directional adjacency
-
protein_translation
translate &str or String of RNA sequence with nucleotide into a Vec<&str> of their appropriate protein names
-
aa-regex
macros to build regular expression matching protein sequences
-
recmap
reading and working with recombination maps in Rust
-
minimap2-sys
Bindings to libminimap2
-
chem-parse
A parser for simple chemical forumulas
-
ross
A set of scripts to run basic analysis on fastq files
-
exon-io
IO utilities for Exon
-
seqsplitter
Parses fasta records based on list of either header names or regexes
-
ggetrs
Efficient querying of biological databases from the command line
-
lightmotif-tfmpvalue
Rust reimplementation of TFMPvalue for the lightmotif crate
-
gfacut
Cutting out parts of the genome
-
gkl
Genomics Kernel Library
-
fqgrep
Search a pair of fastq files for reads that match a given ref or alt sequence
-
bio
A bioinformatics library for Rust. This library provides implementations of many algorithms and data structures that are useful for bioinformatics, but also in other fields.
-
vcfverifier
Verifies that a given VCF matches a given FASTA by comparing the REF column in the VCF to the underlying FASTA sequences
-
phcue-ck
command line tool to obtain FTP links to FASTQ files from ENA using run accession
-
biosignal
processing biological signals
-
libnail
that performs profile Hidden Markov Model (PHMM) biological sequence alignment
-
pdbtbx
open/edit/save (crystallographic) Protein Data Bank (PDB) and mmCIF files
-
mudskipper
Convert genomic alignments to transcriptomic BAM/RAD files
-
oarfish
A fast, accurate and versatile tool for long-read transcript quantification
-
rosalind-cli
CLI for
rosalind
crate -
gtftools
A barebones GTF toolkit with fast nom-based IO
-
census2csv
Convert TMT multiplexed proteomics data in the Census format to CSV files
-
nucleic-acids
creating DNA and RNA sequences
-
fq-filter-reads
program to filter a fastq file with a provided list of IDs
-
lorikeet-rs
Strain resolver for metagenomics
-
dna
Some tools that are 'internal' for now because they are insufficiently refined and unstable, but which are used by other 'public' crates
-
mmft
A minimal fasta toolkit
-
gffkit
program for gff3 file manipulation
-
protein
Working with protein structures
-
minced-parser
A parser for the output of the MinCED CRISPR array annotation tool
-
exon-gtf
GTF datasource for Exon
-
metaprofile
Segregate WGS data into windows along the genome. Optionally, run tools such as alphabeta for each window.
-
handlegraph
in variation graphs
-
select-random-fastx
Select random entries from fastx files
-
consprob-trained
Trainable Probability Inference Engine on RNA Structural Alignment
-
libwfa
Bindings to the C implementation of the WFA alignment algorithm
-
krakenxtract
Extract reads from a FASTQ file based on taxonomic classification via Kraken2
-
saboten
biedged graphs, cactus graphs and trees, and an algorithm for finding ultrabubbles in variation graphs
-
rget_pdb
Download Protein files from rcsb.org
-
exon-fcs
Exon FCS
-
drprg
Drug resistance prediction with reference graphs
-
ukhs
Universal K-mer Hitting Sets
-
read-structure
parsing and working with read structure descriptions
-
noodles-htsget
An htsget client
-
ncbi
Rust data structures for NCBI APIs
-
aa-name
Basic enum for amino acid names
-
rust-seq2kminmers
Construction and iteration of k-min-mers from a DNA sequence
-
bam2seq
Extract reads and reconstructed references from a .bam file using CIGAR and MD tags
-
vdj_ann
Some tools that are 'internal' for now because they are insufficiently refined and unstable, but which are used by other 'public' crates
-
crispr
designing sgRNAs for genome engineering using CRISPR/Cas9
-
readfish-tools
Tools for analysing adaptive sampling data
-
sce
importing and managing various single-cell matrices
-
chiral-apps
Applications available on chiral
-
fffx
fasta/q/x file format parser. Well fuzzed.
-
grafen
Create graphene and other substrates for use in molecular dynamics simulations
-
sigalign-core
A core crate for sigalign
-
readmerger
rapidly merging FASTQ files. Never use cat again!
-
bvreader
Reader for the BrainVision data format
-
aa_similarity
Substitution matrices for use in protein sequence analysis and alignment
-
lightmotif-transfac
TRANSFAC parser implementation for the lightmotif crate
-
bio-rust
解析生物信息领域的基本数据结构,提供操纵这些数据的接口和构建一些统计模型。
-
hgvs
Port of biocommons/hgvs to Rust
-
seqsample
Randomly sample sequences from a FASTA file
-
fasta_tools
Some tools that are 'internal' for now because they are insufficiently refined and unstable, but which are used by other 'public' crates
-
scispeak
converting Sci-RNA-Seq3 files to 10X Genomics compatible FASTQ files
-
jean_macros
Macro library feature for jean
-
rna-algos
RNA Bioinformatics Algorithms
-
seqcomplexity
Calculates per-read and total sequence complexity from FastQ file
-
jean_cut
Codon usage tables feature for jean
-
fasta
Tools for FASTA reading, writing and indexing
-
gtfjson
convert GTF files to newline-delim JSON
-
mfqe
FASTA/Q read extractor
-
exon-sam
Exon SAM
-
skc
Shared k-mer content between two genomes
-
thirdkind
Read phylogenetic tree(s) in newick, phyloXML or recPhyloXML file and build a svg representation of the tree(s) allowing 1, 2 or 3 reconciliation levels
-
sdust
symmetric DUST algorithm, with optional CLI
-
prepare_fasta
Compute hash-based signatures of sequence, and perform pre-processing
-
cramino
quickly extract quality metrics from bam or cram files
-
ring_api
Strongly-typed client for the RING HTTP API
-
crast
Context RNA Alignment Search Tool
-
blastdb-sequence-util
encoding nucleotide/protein sequences in BLAST database format
-
candentia
A cli tool for managing CT-Scan data to study evolutionary biology
-
alpaca
caller for genomic variants (single nucleotide and small indels) from next-generation sequencing data that uses a novel algebraic approach to incorporate sample based filtering into the calling…
-
aa-colour
Add some colours to your amino acids
-
chiral-operator
Operators f Chiral: An All-in-One Data Processing Tool
-
align_tools
Some tools that are 'internal' for now because they are insufficiently refined and unstable, but which are used by other 'public' crates
-
seq_geom_xform
Transform/normalize complex single-cell fragment geometries into simple geometries
-
jean_io
I/O library feature for jean
-
syncmers
finding syncmers