#bioinformatics #genome #ncbi #genome-assembly

app genomers

Package to download NCBI genome data and metadata

2 releases

new 0.1.2 Oct 16, 2024
0.1.1 Sep 15, 2024

#461 in Command line utilities

Download history 37/week @ 2024-09-09 165/week @ 2024-09-16 8/week @ 2024-09-23 8/week @ 2024-09-30 149/week @ 2024-10-14

172 downloads per month

MIT license

14KB
180 lines

genomers

genomers is a rust package for downloading genome assembly data and metadata from NCBI

Changelog

  • v0.1.5 Added GFF genome annotation to download options

Features

  • Download the assembly report and assembly fasta sequence

How to use

The binaries can be downloaded from the 'release' section of the repository (https://github.com/matteobolner/genomers/releases/download/{release}) or with

wget https://github.com/matteobolner/genomers/releases/download/first-release/genomers

Why genomers?

I wanted to have an easy way to download genome assembly data and metadata from NCBI, and since I'm learning rust I decided to try doing it in rust.

How to use

Two parameters are required:
the assembly accession and assembly name.

running "genomers -h" will print all the necessary information.

Example:

# download the assembly report for the E. coli genome

./genomers -a GCA_000005845.2 -n ASM584v2 -r > report.txt

# now the full genome seqeuence

./genomers -a GCA_000005845.2 -n ASM584v2 -g > genome.fasta.gz

The output is directed to stdout and can be piped:

./genomers -a GCA_000005845.2 -n ASM584v2 -r | grep "Chromosome"

# unzip the file
./genomers -a GCA_000005845.2 -n ASM584v2 -g | gunzip > genome.fasta

Dependencies

~9–21MB
~298K SLoC