1 unstable release
new 0.5.0 | May 10, 2025 |
---|
#2715 in Command line utilities
22KB
284 lines
aud2txt
Audio to text tool, using ggerganov's whisper.cpp via whisper-rs and FFmpeg.
install
- install FFmpeg (via your package manager or directly)
- ensure
ffmpeg
command is available cargo install aud2txt
usage
TL;DR
aud2txt <INPUT_FILE>
where <INPUT_FILE>
is any media file readable by ffmpeg
.
Also see the demo script.
options
Usage: aud2txt [OPTIONS] <INPUT_FILE>
Arguments:
<INPUT_FILE> Input audio file
Options:
-l, --log <LOG_LEVEL> [default: error]
-m, --model-file <MODEL_FILE>
-N, --no-normalize Disable audio normalization before conversion to text
-o, --output-file <OUTPUT_FILE> Output text file
-h, --help Print help
If --model-file
argument is omitted, aud2txt
will try to download and use
the default model from: https://huggingface.co/ggerganov/whisper.cpp/resolve/main/ggml-base.en.bin
If --no-normalize
flag is passed, the normalization step will be skiped,
removing the runtime dependency on ffmpeg
.
Dependencies
~20–36MB
~604K SLoC