Cargo Features

[dependencies]
transmutation = { version = "0.3.1", default-features = false, features = ["tesseract", "pdf-to-image", "office", "image-ocr", "audio", "video", "archives-extended", "docling-ffi", "cli", "full", "tracing-opentelemetry", "roxmltree"] }
default = office

Core formats (PDF, HTML, XML, ZIP) are always enabled

tesseract image-ocr?

Core engines (pure Rust implementations)

Enables leptess and tesseract

tesseract:

NOTE: We are NOT integrating with Docling - we are building a pure Rust competitor The docling repository is for reference only, not as a dependency
OCR (Tesseract)

pdf-to-image full?

Format support (pure Rust implementations)
Note: PDF, HTML, XML, and basic ZIP support are ALWAYS enabled (no feature flags)
PDF rendering to images per page (optional)

Enables pdfium-render

Only for PDF rendering to images

office default full? = docx-rs, umya-spreadsheet

Office formats (DOCX, XLSX, PPTX)

Affects converters::docx, converters::xlsx, converters::pptx

image-ocr full? = tesseract

Affects converters::image

audio full?

Audio transcription (requires external whisper CLI)

Affects converters::audio

video full?

Video transcription (requires external ffmpeg + whisper CLI)

Affects converters::video

archives-extended full? = flate2, sevenz-rust, tar

Extended archive support (TAR, GZ, 7Z)

docling-ffi

Advanced layout analysis (C++ FFI to docling-parse + ML models)
Enable C++ docling-parse + ONNX ML models

Enables rstar

Spatial indexing

and ndarray ^0.15, ort, and pdfium-render

ort:

ML & Computer Vision (for docling-ffi feature)

Affects engines::docling_parse_ffi, engines::docling_json_parser, engines::rule_based_layout, engines::layout_postprocessor, transmutation::document, transmutation::ml, ml::preprocessing, ml::layout_model, ml::table_structure_model, ml::model_manager, ml::model_cache, ml::cell_matching, ml::DocumentModel

cli full? = clap, colored, console, indicatif

CLI

Required by the binary

full = archives-extended, audio, cli, image-ocr, office, pdf-to-image, video

All features (no external integrations - Transmutation is a library/CLI)

Features from optional dependencies

In crates that don't use the dep: syntax, optional dependencies automatically become Cargo features. These features may have been created by mistake, and this functionality may be removed in the future.

tracing-opentelemetry implicit feature

Enables tracing-opentelemetry ^0.30

tracing-opentelemetry:

OpenTelemetry integration for tracing

roxmltree implicit feature

Enables roxmltree

roxmltree:

Represent an XML as a read-only tree

tar archives-extended?
flate2 archives-extended?
sevenz-rust archives-extended?
docx-rs office

Enables docx-rs

Office formats (optional)

umya-spreadsheet office
clap cli?

Enables clap

CLI dependencies

indicatif cli?

Enables indicatif ^0.17

console cli?

Enables console ^0.15

colored cli?

Enables colored ^2.2