23 releases (stable)
| 1.5.0 | May 28, 2024 |
|---|---|
| 1.4.1 | Dec 19, 2023 |
| 1.4.0 | Nov 17, 2023 |
| 1.2.5 | Apr 25, 2023 |
| 0.4.1 | Nov 10, 2021 |
#8 in #unicode-data
1,418 downloads per month
15MB
118K
SLoC
icu_datagen 
icu_datagen is a library to generate data files that can be used in ICU4X data providers.
Data files can be generated either programmatically (i.e. in build.rs), or through a
command-line utility.
Also see our datagen tutorial.
Examples
Rust API
use icu_datagen::blob_exporter::*;
use icu_datagen::prelude::*;
use std::fs::File;
DatagenDriver::new()
.with_keys([icu::list::provider::AndListV1Marker::KEY])
.with_locales_and_fallback([LocaleFamily::FULL], Default::default())
.export(
&DatagenProvider::new_latest_tested(),
BlobExporter::new_v2_with_sink(Box::new(
File::create("data.postcard").unwrap(),
)),
)
.unwrap();
Command line
The command line interface can be installed through Cargo.
$ cargo install icu_datagen
Once the tool is installed, you can invoke it like this:
$ icu4x-datagen --keys all --locales de en-AU --format blob --out data.postcard
More details can be found by running --help.
Cargo features
This crate has a lot of dependencies, some of which are not required for all operating modes. These default Cargo features can be disabled to reduce dependencies:
baked_exporter- enables the
baked_exportermodule - enables the
--format modCLI argument
- enables the
blob_exporter- enables the
blob_exportermodule, a reexport oficu_provider_blob::export - enables the
--format blobCLI argument
- enables the
fs_exporter- enables the
fs_exportermodule, a reexport oficu_provider_fs::export - enables the
--format dirCLI argument
- enables the
networking- enables methods on
DatagenProviderthat fetch source data from the network - enables the
--cldr-tag,--icu-export-tag, and--segmenter-lstm-tagCLI arguments that download data
- enables methods on
rayon- enables parallelism during export
use_wasm/use_icu4c- see the documentation on
icu_codepointtrie_builder
- see the documentation on
bin- required by the CLI and enabled by default to make
cargo installwork
- required by the CLI and enabled by default to make
legacy_api- enables the deprecated pre-1.3 API
- enabled by default for semver stability
- will be removed in 2.0.
icu_experimental- enables data generation for keys defined in the unstable
icu_experimentalcrate - note that this features affects the behaviour of
all_keys
- enables data generation for keys defined in the unstable
The meta-feature experimental_components is available to activate all experimental components.
More Information
For more information on development, authorship, contributing etc. please visit ICU4X home page.
Dependencies
~5–18MB
~188K SLoC