1 unstable release

0.0.0 Feb 12, 2021

#44 in #example

MIT/Apache

20KB
231 lines

Rust

Example-data

This crate provides toy-datasets to be used in rust. Because we are using the Apache Arrow memory format it can easily be used for multiple dataframe implementations.

Currently, all datasets are embedded in the library.

Usage

use example_data::{Repo};
use arrow::record_batch::RecordBatch;

let iris = Repo::default().load_table("iris").unwrap();
let batches : Vec<RecordBatch> = iris.data();
let doc : &str = iris.doc().unwrap();

Suported data

  • iris: Classification if iris-flowers by petal- and sepal size
  • boston: Prediction of housing price based on accessibility, pollution, ...

Contribute

We use github. If you have any questions feel free to open an issue or send a pull-request. We'll be happy to assist.


lib.rs:

example-data is created to easily load common datasets.

We use the Apache Arrow memory format which allows for simple conversion to multiple dataframe implementations.

use example_data::{Repo};
use arrow::record_batch::RecordBatch;

let iris = Repo::default().load_table("iris").unwrap();
let batches : Vec<RecordBatch> = iris.data();
let doc : &str = iris.doc().unwrap();

Supported datatables

  • iris
  • boston

Dependencies

~7.5–10MB
~164K SLoC