#rna-seq #quantification #science #read #rad-file

app piscem-infer

A flexible tool to perform target quantification from bulk-sequencing data

6 releases (breaking)

0.6.0 Feb 26, 2024
0.5.0 Oct 10, 2023
0.4.0 Oct 5, 2023
0.3.0 Oct 2, 2023
0.1.1 Sep 30, 2023

#50 in Biology

Download history 7/week @ 2024-02-16 148/week @ 2024-02-23 43/week @ 2024-03-01 12/week @ 2024-03-08 4/week @ 2024-03-15 6/week @ 2024-03-29

95 downloads per month

Custom license

51KB
1K SLoC

piscem-infer

What is piscem-infer?

piscem-infer is a tool to consume bulk-RAD files (produced by piscem or piscem-cpp) and to produce from them abundance estimates of the targets against which the reads were mapped. For example, a basic RNA-seq pipeline could consist of mapping the reads against the transcriptome using piscem, and then quantifying transcript abundance using piscem-infer. Likewise, one could use the pair of tools on metagenomic index and metagenomic sequencing reads to perform taxonomic abundance estimation. The main goal of piscem-infer is to separate the statistical inference algorithms and code from the code that performs indexing and mapping. This allows faster and more agile development of new improvements to the inference method, as well as eases the maintenance burden.

The piscem-infer program is written in rust, which makes it easy for end-users to compile, and which also makes it easy for us to deploy without the need for end-users to compile it (it's easy to create statically-linked, pre-compiled executables, and to put the package on bioconda). At the same time, this gives us access to the safety guarantees of rust, making the code easier to develop and maintain with confidence while retaining the efficiency of a high-performance, statically-typed, compiled language.

While piscem-infer is in active development, it is already very usable! We encourage folks who are curious to try it out, to open Github issues for any questions or feature requests, and to provide feedback on how you'd like to see this next "evolution" of (bulk sequencing) abundance estimation tools evolve!

Dependencies

~25–41MB
~633K SLoC