Cargo Features

DataProf has no features set by default.

[dependencies]
dataprof = { version = "0.4.80", features = ["python", "python-async", "database", "arrow", "parquet", "postgres", "mysql", "sqlite", "minimal", "production", "all-db"] }

Features

default

Minimal by default - MUCH faster compilation

python python-async?

Individual features

Enables pyo3

Python bindings

Affects dataprof::python

python-async = python

Enables pyo3-async-runtimes and tokio

tokio:

Database connectors and async runtime
Latest: 1.48.0 (patch available via cargo update)

Affects python::database_async

database mysql? postgres? sqlite?

Enables async-trait, tokio, and url

Affects commands::database, config::DataprofConfig.database, config::DatabaseSettings, config::DatabaseSamplingConfig, dataprof::database, python::database_async

arrow parquet?

Enables arrow

Apache Arrow for columnar processing

Affects columnar::arrow_profiler, columnar::record_batch_analyzer

parquet = arrow

Enables parquet

Affects commands::is_parquet_file, parsers::parquet, analysis::analyze_parquet_file, analysis::analyze_parquet_with_quality_py

postgres all-db? production? = database

Enables postgres of sqlx

mysql all-db? production? = database

Enables mysql of sqlx

sqlite all-db? = database

Enables sqlite of sqlx

minimal

Sensible feature combinations
Just CSV processing - fastest builds

production = mysql, postgres

Most common production databases

all-db = mysql, postgres, sqlite

All databases