19 releases (breaking)

Uses new Rust 2024

new 0.24.0 Dec 23, 2025
0.23.0 Oct 16, 2025
0.22.1 Sep 18, 2025
0.20.0 Jul 6, 2025
0.12.0 Jul 30, 2024

#1521 in Machine learning

MIT/Apache

2.5MB
58K SLoC

Utilities to simplify running auto-regressive RTen models such as transformer decoders.

For working examples, see the examples in the rten-examples crate which import rten_generate.


rten-generate is a layer on top of RTen which handles the generation loop for auto-regressive transformer models (aka. "transformer decoders" or "generative AI"). This includes managing the KV cache, sampling and post-processing logits etc.

Dependencies

~1.8–3MB
~63K SLoC