#huggingface #hub #python-packages #machine-learning #hf #tokio

candle-hf-hub

This crates aims ease the interaction with huggingface It aims to be compatible with huggingface_hub python package, but only implements a smaller subset of functions

1 unstable release

0.3.3 Oct 26, 2024

#489 in Machine learning

Download history 434/week @ 2024-10-22 416/week @ 2024-10-29 243/week @ 2024-11-05 720/week @ 2024-11-12 307/week @ 2024-11-19 2008/week @ 2024-11-26 1449/week @ 2024-12-03 1518/week @ 2024-12-10

5,405 downloads per month
Used in 4 crates (3 directly)

Apache-2.0

76KB
1.5K SLoC

This crates aims to emulate and be compatible with the huggingface_hub python package.

compatible means the Api should reuse the same files skipping downloads if they are already present and whenever this crate downloads or modifies this cache it should be consistent with huggingface_hub

At this time only a limited subset of the functionality is present, the goal is to add new features over time. We are currently treating this as an internel/external tool, meaning we will are currently modifying everything at will for out internal needs. This will eventually stabilize as it matures to accomodate most of our needs.

If you're interested in using this, you're welcome to do it but be warned about potential changing grounds.

If you want to contribute, you are more than welcome.

However allowing new features or creating new features might be denied by lack of maintainability time. We're focusing on what we currently internally need. Hopefully that subset is already interesting to more users.

How to use

Add the dependency

cargo add hf-hub  # --features tokio

tokio feature will enable an async (and potentially faster) API.

Use the crate:

use hf_hub::api::sync::Api;

let api = Api::new().unwrap();

let repo = api.model("bert-base-uncased".to_string());
let _filename = repo.get("config.json").unwrap();

// filename  is now the local location within hf cache of the config.json file

SSL/TLS

When using the ureq feature, you will always use its default TLS backend which is rustls.

When using tokio, by default default-tls will be enabled, which means OpenSSL. If you want/need to use rustls, disable the default features and use rustls-tls in conjunction with tokio.

Dependencies

~5–18MB
~259K SLoC