9 stable releases

1.5.0 May 28, 2024
1.4.0 Nov 16, 2023
1.3.2 Oct 3, 2023
1.3.0 Sep 23, 2023
1.0.0-beta1 Aug 5, 2022

#98 in Internationalization (i18n)

Download history 142198/week @ 2024-10-21 129038/week @ 2024-10-28 736755/week @ 2024-11-04 881718/week @ 2024-11-11 925329/week @ 2024-11-18 917814/week @ 2024-11-25 1082992/week @ 2024-12-02 1157628/week @ 2024-12-09 1156265/week @ 2024-12-16 700405/week @ 2024-12-23 817642/week @ 2024-12-30 1322728/week @ 2025-01-06 1450805/week @ 2025-01-13 1423136/week @ 2025-01-20 1481821/week @ 2025-01-27 1544406/week @ 2025-02-03

5,997,597 downloads per month
Used in 136 crates (25 directly)

Unicode-3.0

1MB
16K SLoC

icu_locid_transform crates.io

Canonicalization of locale identifiers based on CLDR data.

This module is published as its own crate (icu_locid_transform) and as part of the icu crate. See the latter for more details on the ICU4X project.

It currently supports locale canonicalization based upon the canonicalization algorithm from UTS #35: Unicode LDML 3. LocaleId Canonicalization, as well as the minimize and maximize likely subtags algorithms as described in UTS #35: Unicode LDML 3. Likely Subtags.

The maximize method potentially updates a passed in locale in place depending up the results of running the 'Add Likely Subtags' algorithm from UTS #35: Unicode LDML 3. Likely Subtags.

This minimize method returns a new Locale that is the result of running the 'Remove Likely Subtags' algorithm from UTS #35: Unicode LDML 3. Likely Subtags.

Examples

use icu::locid::Locale;
use icu::locid_transform::{LocaleCanonicalizer, TransformResult};

let lc = LocaleCanonicalizer::new();

let mut locale: Locale = "ja-Latn-fonipa-hepburn-heploc"
    .parse()
    .expect("parse failed");
assert_eq!(lc.canonicalize(&mut locale), TransformResult::Modified);
assert_eq!(locale, "ja-Latn-alalc97-fonipa".parse::<Locale>().unwrap());
use icu::locid::locale;
use icu::locid_transform::{LocaleExpander, TransformResult};

let lc = LocaleExpander::new();

let mut locale = locale!("zh-CN");
assert_eq!(lc.maximize(&mut locale), TransformResult::Modified);
assert_eq!(locale, locale!("zh-Hans-CN"));

let mut locale = locale!("zh-Hant-TW");
assert_eq!(lc.maximize(&mut locale), TransformResult::Unmodified);
assert_eq!(locale, locale!("zh-Hant-TW"));
use icu::locid::locale;
use icu::locid_transform::{LocaleExpander, TransformResult};
use writeable::assert_writeable_eq;

let lc = LocaleExpander::new();

let mut locale = locale!("zh-Hans-CN");
assert_eq!(lc.minimize(&mut locale), TransformResult::Modified);
assert_eq!(locale, locale!("zh"));

let mut locale = locale!("zh");
assert_eq!(lc.minimize(&mut locale), TransformResult::Unmodified);
assert_eq!(locale, locale!("zh"));

More Information

For more information on development, authorship, contributing etc. please visit ICU4X home page.

Dependencies

~0.6–1.2MB
~24K SLoC