#whisper #cpp-bindings #opencl #model #open-blas #gpu #whisper-rs

sys whisper-rs-sys

Rust bindings for whisper.cpp (FFI bindings)

16 releases (8 breaking)

0.9.0 Apr 6, 2024
0.8.0 Nov 27, 2023
0.7.2 Oct 8, 2023
0.6.1 Jun 18, 2023
0.2.0 Oct 29, 2022

#261 in Audio

Download history 211/week @ 2024-01-05 144/week @ 2024-01-12 139/week @ 2024-01-19 65/week @ 2024-01-26 50/week @ 2024-02-02 150/week @ 2024-02-09 116/week @ 2024-02-16 220/week @ 2024-02-23 289/week @ 2024-03-01 172/week @ 2024-03-08 232/week @ 2024-03-15 371/week @ 2024-03-22 361/week @ 2024-03-29 420/week @ 2024-04-05 294/week @ 2024-04-12 242/week @ 2024-04-19

1,356 downloads per month
Used in 8 crates (via whisper-rs)

Unlicense

2MB
49K SLoC

C 22K SLoC // 0.1% comments CUDA 8K SLoC // 0.0% comments C++ 6K SLoC // 0.1% comments Rust 6K SLoC // 0.0% comments Metal Shading Language 4.5K SLoC // 0.0% comments Objective-C 2.5K SLoC // 0.1% comments Objective-C++ 52 SLoC // 0.1% comments

whisper-rs

Rust bindings to whisper.cpp

Usage

git clone --recursive https://github.com/tazz4843/whisper-rs.git

cd whisper-rs

cargo run --example basic_use

cargo run --example audio_transcription
use whisper_rs::{WhisperContext, WhisperContextParameters, FullParams, SamplingStrategy};

fn main() {
	let path_to_model = std::env::args().nth(1).unwrap();

	// load a context and model
	let ctx = WhisperContext::new_with_params(
		path_to_model,
		WhisperContextParameters::default()
	).expect("failed to load model");

	// create a params object
	let params = FullParams::new(SamplingStrategy::Greedy { best_of: 1 });

	// assume we have a buffer of audio data
	// here we'll make a fake one, floating point samples, 32 bit, 16KHz, mono
	let audio_data = vec![0_f32; 16000 * 2];

	// now we can run the model
	let mut state = ctx.create_state().expect("failed to create state");
	state
		.full(params, &audio_data[..])
		.expect("failed to run model");

	// fetch the results
	let num_segments = state
		.full_n_segments()
		.expect("failed to get number of segments");
	for i in 0..num_segments {
		let segment = state
			.full_get_segment_text(i)
			.expect("failed to get segment");
		let start_timestamp = state
			.full_get_segment_t0(i)
			.expect("failed to get segment start timestamp");
		let end_timestamp = state
			.full_get_segment_t1(i)
			.expect("failed to get segment end timestamp");
		println!("[{} - {}]: {}", start_timestamp, end_timestamp, segment);
	}
}

See examples/basic_use.rs for more details.

Lower level bindings are exposed if needed, but the above should be enough for most use cases. See the docs: https://docs.rs/whisper-rs/ for more details.

Feature flags

All disabled by default unless otherwise specified.

  • raw-api: expose whisper-rs-sys without having to pull it in as a dependency. NOTE: enabling this no longer guarantees semver compliance, as whisper-rs-sys may be upgraded to a breaking version in a patch release of whisper-rs.
  • cuda: enable CUDA support. Implicitly enables hidden GPU flag at runtime.
  • opencl: enable OpenCL support. Upstream whisper.cpp does not treat OpenCL as a GPU, so it is always enabled at runtime.
  • openblas: enable OpenBLAS support.
  • metal: enable Metal support. Implicitly enables hidden GPU flag at runtime.
  • whisper-cpp-log: allows hooking into whisper.cpp's log output and sending it to the log backend. Requires calling
  • whisper-cpp-tracing: allows hooking into whisper.cpp's log output and sending it to the tracing backend.

Building

See BUILDING.md for instructions for building whisper-rs on Windows and OSX M1. Linux builds should just work out of the box.

Troubleshooting

  • Something other than Windows/macOS/Linux isn't working!
    • I don't have a way to test these platforms, so I can't really help you.
      • If you can get it working, please open a PR with any changes to make it work and build instructions in BUILDING.md!
  • I get a panic during binding generation build!
    • You can attempt to fix it yourself, or you can set the WHISPER_DONT_GENERATE_BINDINGS environment variable. This skips attempting to build the bindings whatsoever and copies the existing ones. They may be out of date, but it's better than nothing.
      • WHISPER_DONT_GENERATE_BINDINGS=1 cargo build
    • If you can fix the issue, please open a PR!

License

Unlicense

tl;dr: public domain

Dependencies

~0–2MB
~41K SLoC