#triton #nvidia #inference #deep-learning #grpc-client

triton-client

A client for interfacing with NVIDIA Triton inference server

3 releases (breaking)

0.2.0 Aug 15, 2022
0.1.0 Jun 16, 2022
0.0.1 Jun 16, 2022

#1064 in Machine learning

Download history 2/week @ 2024-11-13 7/week @ 2024-11-20 10/week @ 2024-11-27 27/week @ 2024-12-04 23/week @ 2024-12-11 19/week @ 2024-12-18 2/week @ 2024-12-25 9/week @ 2025-01-01 33/week @ 2025-01-08 26/week @ 2025-01-15 25/week @ 2025-01-22 7/week @ 2025-01-29 13/week @ 2025-02-05 4/week @ 2025-02-12 13/week @ 2025-02-19 58/week @ 2025-02-26

88 downloads per month

Apache-2.0

64KB
1K SLoC

C++ 802 SLoC // 0.2% comments Rust 235 SLoC Python 73 SLoC // 0.3% comments

triton-client-rs

A Rust gRPC client library for NVIDIA Triton.

This library provides the necessary setup to generate a Triton client from NVIDIA's Protocol Buffers definitions.

// un-auth'd use of Triton
let client = Client::new("http://localhost:8001/", None).await?;
let models = client
    .repository_index(triton_client::inference::RepositoryIndexRequest {
        repository_name: "".into(), // This should show us models not referenced by repo name.
        ready: false,               // show all models, not just ready ones.
    })
    .await?;

Dependencies

~14–28MB
~441K SLoC