#fasta #fastq #randomly #sample #sequences

app seqsample

Randomly sample sequences from a FASTA file

2 unstable releases

0.2.0 Mar 15, 2023
0.1.0 Nov 14, 2022

#247 in Biology

Download history 2/week @ 2024-02-20 14/week @ 2024-02-27 1/week @ 2024-03-05 83/week @ 2024-03-12

100 downloads per month

MIT license

21KB
109 lines

Randomly samples from FASTA file

Installation

Source

Download the source code and run

cargo install

Usage

seqsample -n .5 --fasta original.fas --store-background ./background.fas

Arguments:

    -f, --fasta <fasta>
            The input FASTA file (gzip acceptable).

    -h, --help
            Print help information

    -i, --store-background <store-background>
            Write the samples not selected randomly in a separate file.

    -n, --number <number>
            Number of sequences to randomly sample. If the number is less than 1, then it will be
            treated as a percentage of the dataset.

    -V, --version
            Print version information

Dependencies

~38MB
~767K SLoC