5 unstable releases
Uses new Rust 2024
| new 0.2.0-pre.1 | Feb 9, 2026 |
|---|---|
| 0.1.1 | Jan 23, 2026 |
| 0.1.0 | Jan 15, 2026 |
| 0.1.0-pre.1 | Dec 18, 2025 |
| 0.0.1 | Dec 5, 2025 |
#1569 in Algorithms
15,627 downloads per month
Used in 34 crates
(2 directly)
1.5MB
36K
SLoC
Algorithms
| Algorithms | Variants |
|---|---|
| Random | bernoulli normal uniform |
| Quantization | symmetric per-block per-tensor q2 q4 q8 fp4 |
| Reduction | mean sum prod max min arg[max|min] per-cube per-plane |
| Matmul | mma unit tma multi-stage specialization ordered multi-rows |
| Convolution | mma unit tma multi-stage im2col |
| Attention | mma unit multi-rows |
Contributing
If you want to contribute new kernels, please read the GUIDE.md.
Dependencies
~60–100MB
~2M SLoC