10 releases (5 breaking)

new 0.16.0 Feb 8, 2025
0.15.1 Jan 6, 2025
0.15.0 Dec 28, 2024
0.14.1 Nov 16, 2024
0.11.1 Jul 17, 2024

#670 in Machine learning

Download history 57/week @ 2024-10-21 58/week @ 2024-10-28 86/week @ 2024-11-11 30/week @ 2024-11-18 4/week @ 2024-12-02 1/week @ 2024-12-09 108/week @ 2024-12-23 4/week @ 2024-12-30 124/week @ 2025-01-06 4/week @ 2025-01-13 104/week @ 2025-02-03

112 downloads per month

MIT/Apache

2MB
52K SLoC

rten-generate is a layer on top of RTen which handles the generation loop for auto-regressive transformer models (aka. "transformer decoders" or "generative AI"). This includes managing the KV cache, sampling and post-processing logits etc.


lib.rs:

Utilities to simplify running auto-regressive RTen models such as transformer decoders.

For working examples, see the examples in the rten-examples crate which import rten_generate.

Dependencies

~1.6–3MB
~61K SLoC