9 stable releases

1.5.0 May 28, 2024
1.4.0 Nov 16, 2023
1.3.2 Oct 3, 2023
1.3.0 Sep 23, 2023
1.0.0-beta1 Aug 5, 2022

#8 in Internationalization (i18n)

Download history 1433242/week @ 2025-01-18 1463871/week @ 2025-01-25 1629983/week @ 2025-02-01 1687658/week @ 2025-02-08 1812032/week @ 2025-02-15 2326169/week @ 2025-02-22 2466582/week @ 2025-03-01 2499525/week @ 2025-03-08 2794899/week @ 2025-03-15 3054066/week @ 2025-03-22 2449245/week @ 2025-03-29 2654397/week @ 2025-04-05 2375118/week @ 2025-04-12 2352554/week @ 2025-04-19 2279673/week @ 2025-04-26 1988602/week @ 2025-05-03

9,411,693 downloads per month
Used in 136 crates (13 directly)

Unicode-3.0

1MB
20K SLoC

icu_locid_transform crates.io

Canonicalization of locale identifiers based on CLDR data.

This module is published as its own crate (icu_locid_transform) and as part of the icu crate. See the latter for more details on the ICU4X project.

It currently supports locale canonicalization based upon the canonicalization algorithm from UTS #35: Unicode LDML 3. LocaleId Canonicalization, as well as the minimize and maximize likely subtags algorithms as described in UTS #35: Unicode LDML 3. Likely Subtags.

The maximize method potentially updates a passed in locale in place depending up the results of running the 'Add Likely Subtags' algorithm from UTS #35: Unicode LDML 3. Likely Subtags.

This minimize method returns a new Locale that is the result of running the 'Remove Likely Subtags' algorithm from UTS #35: Unicode LDML 3. Likely Subtags.

Examples

use icu::locid::Locale;
use icu::locid_transform::{LocaleCanonicalizer, TransformResult};

let lc = LocaleCanonicalizer::new();

let mut locale: Locale = "ja-Latn-fonipa-hepburn-heploc"
    .parse()
    .expect("parse failed");
assert_eq!(lc.canonicalize(&mut locale), TransformResult::Modified);
assert_eq!(locale, "ja-Latn-alalc97-fonipa".parse::<Locale>().unwrap());
use icu::locid::locale;
use icu::locid_transform::{LocaleExpander, TransformResult};

let lc = LocaleExpander::new();

let mut locale = locale!("zh-CN");
assert_eq!(lc.maximize(&mut locale), TransformResult::Modified);
assert_eq!(locale, locale!("zh-Hans-CN"));

let mut locale = locale!("zh-Hant-TW");
assert_eq!(lc.maximize(&mut locale), TransformResult::Unmodified);
assert_eq!(locale, locale!("zh-Hant-TW"));
use icu::locid::locale;
use icu::locid_transform::{LocaleExpander, TransformResult};
use writeable::assert_writeable_eq;

let lc = LocaleExpander::new();

let mut locale = locale!("zh-Hans-CN");
assert_eq!(lc.minimize(&mut locale), TransformResult::Modified);
assert_eq!(locale, locale!("zh"));

let mut locale = locale!("zh");
assert_eq!(lc.minimize(&mut locale), TransformResult::Unmodified);
assert_eq!(locale, locale!("zh"));

More Information

For more information on development, authorship, contributing etc. please visit ICU4X home page.

Dependencies

~0.6–1.2MB
~25K SLoC