19 releases (breaking)
Uses new Rust 2024
| new 0.24.0 | Dec 23, 2025 |
|---|---|
| 0.23.0 | Oct 16, 2025 |
| 0.22.1 | Sep 18, 2025 |
| 0.20.0 | Jul 6, 2025 |
| 0.12.0 | Jul 30, 2024 |
#1521 in Machine learning
2.5MB
58K
SLoC
Utilities to simplify running auto-regressive RTen models such as transformer decoders.
For working examples, see the examples in the rten-examples
crate which import rten_generate.
rten-generate is a layer on top of RTen which handles the generation loop for auto-regressive transformer models (aka. "transformer decoders" or "generative AI"). This includes managing the KV cache, sampling and post-processing logits etc.
Dependencies
~1.8–3MB
~63K SLoC