9 releases (4 breaking)
new 0.15.1 | Jan 6, 2025 |
---|---|
0.15.0 | Dec 28, 2024 |
0.14.1 | Nov 16, 2024 |
0.14.0 | Oct 27, 2024 |
0.11.1 | Jul 17, 2024 |
#678 in Machine learning
112 downloads per month
2MB
49K
SLoC
rten-generate is a layer on top of RTen which handles the generation loop for auto-regressive transformer models (aka. "transformer decoders" or "generative AI"). This includes managing the KV cache, sampling and post-processing logits etc.
lib.rs
:
Utilities to simplify running auto-regressive RTen models such as transformer decoders.
For working examples, see the examples in the rten-examples
crate which import rten_generate
.
Dependencies
~1.5–3MB
~60K SLoC