#text #whisper #llm #audio

bin+lib aud2txt

Audio to text tool using ggerganov's whisper.cpp

1 unstable release

new 0.5.0 May 10, 2025

#2715 in Command line utilities

Custom license

22KB
284 lines

aud2txt

Audio to text tool, using ggerganov's whisper.cpp via whisper-rs and FFmpeg.

install

  1. install FFmpeg (via your package manager or directly)
  2. ensure ffmpeg command is available
  3. cargo install aud2txt

usage

TL;DR

aud2txt <INPUT_FILE>

where <INPUT_FILE> is any media file readable by ffmpeg.

Also see the demo script.

options

Usage: aud2txt [OPTIONS] <INPUT_FILE>

Arguments:
  <INPUT_FILE>  Input audio file

Options:
  -l, --log <LOG_LEVEL>            [default: error]
  -m, --model-file <MODEL_FILE>
  -N, --no-normalize               Disable audio normalization before conversion to text
  -o, --output-file <OUTPUT_FILE>  Output text file
  -h, --help                       Print help

If --model-file argument is omitted, aud2txt will try to download and use the default model from: https://huggingface.co/ggerganov/whisper.cpp/resolve/main/ggml-base.en.bin

If --no-normalize flag is passed, the normalization step will be skiped, removing the runtime dependency on ffmpeg.

Dependencies

~20–36MB
~604K SLoC