6 releases (3 breaking)

0.7.0 Oct 16, 2023
0.6.1 Dec 3, 2022
0.6.0 Jun 15, 2022
0.5.1 Mar 1, 2022
0.4.0 Apr 28, 2021

#898 in Machine learning

Download history 2/week @ 2024-04-07 6/week @ 2024-04-14 7/week @ 2024-04-21 1/week @ 2024-04-28 2/week @ 2024-05-12 48/week @ 2024-05-19 20/week @ 2024-05-26 49/week @ 2024-06-02 54/week @ 2024-06-09 63/week @ 2024-06-16 16/week @ 2024-06-23 54/week @ 2024-07-07 13/week @ 2024-07-14 15/week @ 2024-07-21

83 downloads per month
Used in rosella

MIT/Apache

305KB
6K SLoC

Preprocessing

The Big Picture

linfa-preprocessing is a crate in the linfa ecosystem, an effort to create a toolkit for classical Machine Learning implemented in pure Rust, akin to Python's scikit-learn.

Current state

linfa-preprocessing provides a pure Rust implementation of:

  • Standard scaling
  • Min-max scaling
  • Max Abs Scaling
  • Normalization
  • Count vectorization
  • TfIdf vectorization
  • Whitening

Examples

There are various usage examples in the examples/ directory. To run, use:

$ cargo run --release --example count_vectorization
$ cargo run --release --example tfidf_vectorization
$ cargo run --release --example scaling
$ cargo run --release --example whitening

BLAS/Lapack backend

See this section to enable an external BLAS/LAPACK backend.

License

Dual-licensed to be compatible with the Rust project.

Licensed under the Apache License, Version 2.0 http://www.apache.org/licenses/LICENSE-2.0 or the MIT license http://opensource.org/licenses/MIT, at your option. This file may not be copied, modified, or distributed except according to those terms.

Dependencies

~10–21MB
~351K SLoC