11 releases (6 breaking)
new 0.17.0 | Apr 9, 2025 |
---|---|
0.16.0 | Feb 8, 2025 |
0.15.1 | Jan 6, 2025 |
0.15.0 | Dec 28, 2024 |
0.12.0 | Jul 30, 2024 |
#814 in Machine learning
83 downloads per month
2.5MB
54K
SLoC
Utilities to simplify running auto-regressive RTen models such as transformer decoders.
For working examples, see the examples in the rten-examples
crate which import rten_generate
.
rten-generate is a layer on top of RTen which handles the generation loop for auto-regressive transformer models (aka. "transformer decoders" or "generative AI"). This includes managing the KV cache, sampling and post-processing logits etc.
Dependencies
~1.7–3MB
~63K SLoC