#records #stdout #output #subset #min #pattern

app fasta-filter

Filter a FASTA file and output a subset of the records on STDOUT

1 unstable release

0.1.0 Sep 2, 2024

#57 in Biology

Download history 136/week @ 2024-09-02

136 downloads per month

MIT license

10KB
50 lines

fasta-filter

Filter a FASTA file and output a subset of the records on STDOUT

Usage

Filter a FASTA file and output a subset of the records on STDOUT

Usage: fasta-filter [OPTIONS] [FILE]

Arguments:
  [FILE]

Options:
  -p, --pattern <PATTERN>
  -x, --exclude
  -m, --min <MIN>
  -n, --max <MAX>
  -h, --help               Print help
  -V, --version            Print version

This simple program expects to read FASTA data either on STDIN or from a named file, and will output a subset of the records to STDOUT. Filtering can be based on a min or max length of sequence to include, as well as a pattern that either must or must not (-x) be present in the header of a record to include.

Example: fasta-filter --min 5 --max 2000 --pattern "mol:protein" test.fasta

Dependencies

~19MB
~321K SLoC