Lib.rs

›

infa-gguf

A minimal rust machine learning library in wip

0.0.1	Oct 10, 2024

#44 in #llama

MIT license

68KB
2K SLoC

Rust + CUDA = Fast and simple inference library from scratch

Linux computer with CUDA 12~, cublas, rust installed. You need at least sm_80 micro architecture. (This is hardcoded for now.)

WIP

Our first goal is to support bloat16 Llama 3.2 1B inference.

~0.8–1.4MB
~28K SLoC