#machine-learning #huggingface #hub #python-packages #hf

hf-hub

This crates aims ease the interaction with huggingface It aims to be compatible with huggingface_hub python package, but only implements a smaller subset of functions

10 releases

0.4.1 Dec 31, 2024
0.4.0 Dec 30, 2024
0.3.2 Oct 4, 2023
0.3.1 Sep 13, 2023
0.1.3 Jul 20, 2023

#22 in Machine learning

Download history 14544/week @ 2024-09-25 15479/week @ 2024-10-02 13980/week @ 2024-10-09 13667/week @ 2024-10-16 14354/week @ 2024-10-23 12963/week @ 2024-10-30 12670/week @ 2024-11-06 13083/week @ 2024-11-13 12134/week @ 2024-11-20 14375/week @ 2024-11-27 16283/week @ 2024-12-04 16199/week @ 2024-12-11 12859/week @ 2024-12-18 8019/week @ 2024-12-25 13028/week @ 2025-01-01 16423/week @ 2025-01-08

53,705 downloads per month
Used in 89 crates (45 directly)

Apache-2.0

110KB
2.5K SLoC

This crates aims to emulate and be compatible with the huggingface_hub python package.

compatible means the Api should reuse the same files skipping downloads if they are already present and whenever this crate downloads or modifies this cache it should be consistent with huggingface_hub

At this time only a limited subset of the functionality is present, the goal is to add new features over time. We are currently treating this as an internel/external tool, meaning we will are currently modifying everything at will for out internal needs. This will eventually stabilize as it matures to accomodate most of our needs.

If you're interested in using this, you're welcome to do it but be warned about potential changing grounds.

If you want to contribute, you are more than welcome.

However allowing new features or creating new features might be denied by lack of maintainability time. We're focusing on what we currently internally need. Hopefully that subset is already interesting to more users.

How to use

Add the dependency

cargo add hf-hub  # --features tokio

tokio feature will enable an async (and potentially faster) API.

Use the crate:

use hf_hub::api::sync::Api;

let api = Api::new().unwrap();

let repo = api.model("bert-base-uncased".to_string());
let _filename = repo.get("config.json").unwrap();

// filename  is now the local location within hf cache of the config.json file

SSL/TLS

This library uses tokio default TLS implementations which is native-tls (openssl) for tokio.

If you want control over the TLS backend you can remove the default features and only add the backend you are intending to use.

cargo add hf-hub -no-default-features --features ureq,rustls-tls
cargo add hf-hub -no-default-features --features ureq,native-tls
cargo add hf-hub -no-default-features --features tokio,rustls-tls
cargo add hf-hub -no-default-features --features tokio,native-tls

When using the ureq feature, you will always use its default TLS backend which is rustls.

When using tokio, by default default-tls will be enabled, which means OpenSSL. If you want/need to use rustls, disable the default features and use rustls-tls in conjunction with tokio.

Dependencies

~5–21MB
~311K SLoC