19 releases (7 breaking)

new 0.8.0 Nov 12, 2024
0.6.0 Jun 29, 2024
0.4.1 Feb 28, 2024
0.3.2 Dec 20, 2023
0.3.1 Nov 12, 2023

#260 in Machine learning

Download history 7751/week @ 2024-07-28 7779/week @ 2024-08-04 6473/week @ 2024-08-11 7384/week @ 2024-08-18 8117/week @ 2024-08-25 7807/week @ 2024-09-01 7582/week @ 2024-09-08 8466/week @ 2024-09-15 8407/week @ 2024-09-22 8810/week @ 2024-09-29 10057/week @ 2024-10-06 9593/week @ 2024-10-13 8756/week @ 2024-10-20 9984/week @ 2024-10-27 9663/week @ 2024-11-03 7783/week @ 2024-11-10

37,104 downloads per month
Used in 107 crates (69 directly)

MIT/Apache

1MB
30K SLoC

Rust 27K SLoC // 0.0% comments Metal Shading Language 2K SLoC // 0.0% comments CUDA 1.5K SLoC // 0.0% comments

Contains (Zip file, 2KB) tests/fortran_tensor_3d.pth, (Zip file, 2KB) tests/test.pt, (Zip file, 2KB) tests/test_with_key.pt

candle

Minimalist ML framework for Rust


lib.rs:

ML framework for Rust

use candle_core::{Tensor, DType, Device};

let a = Tensor::arange(0f32, 6f32, &Device::Cpu)?.reshape((2, 3))?;
let b = Tensor::arange(0f32, 12f32, &Device::Cpu)?.reshape((3, 4))?;

let c = a.matmul(&b)?;

Features

  • Simple syntax (looks and feels like PyTorch)
  • CPU and Cuda backends (and M1 support)
  • Enable serverless (CPU) small and fast deployments
  • Model training
  • Distributed computing (NCCL).
  • Models out of the box (Llama, Whisper, Falcon, ...)

FAQ

  • Why Candle?

Candle stems from the need to reduce binary size in order to enable serverless possible by making the whole engine smaller than PyTorch very large library volume

And simply removing Python from production workloads. Python can really add overhead in more complex workflows and the GIL is a notorious source of headaches.

Rust is cool, and a lot of the HF ecosystem already has Rust crates safetensors and tokenizers

Other Crates

Candle consists of a number of crates. This crate holds core the common data structures but you may wish to look at the docs for the other crates which can be found here:

Dependencies

~8–20MB
~338K SLoC