#unicode-normalization #recomposition #unicode-text #text #decomposition #normalization #unicode

no-std unicode-normalization

This crate provides functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15

28 releases

0.1.24 Sep 17, 2024
0.1.23 Feb 20, 2024
0.1.22 Sep 16, 2022
0.1.21 Jul 1, 2022
0.1.1 Jul 9, 2015

#12 in Text processing

Download history 2222615/week @ 2025-01-28 2207979/week @ 2025-02-04 1987808/week @ 2025-02-11 2273392/week @ 2025-02-18 2066550/week @ 2025-02-25 2072128/week @ 2025-03-04 2109985/week @ 2025-03-11 2595644/week @ 2025-03-18 2601731/week @ 2025-03-25 2274982/week @ 2025-04-01 2362841/week @ 2025-04-08 1996604/week @ 2025-04-15 2033936/week @ 2025-04-22 1906757/week @ 2025-04-29 2035680/week @ 2025-05-06 1716080/week @ 2025-05-13

7,979,886 downloads per month
Used in 10,732 crates (235 directly)

MIT/Apache

710KB
24K SLoC

unicode-normalization

Build Status Docs

Unicode character composition and decomposition utilities as described in Unicode Standard Annex #15.

This crate requires Rust 1.36+.

extern crate unicode_normalization;

use unicode_normalization::char::compose;
use unicode_normalization::UnicodeNormalization;

fn main() {
    assert_eq!(compose('A','\u{30a}'), Some('Å'));

    let s = "ÅΩ";
    let c = s.nfc().collect::<String>();
    assert_eq!(c, "ÅΩ");
}

crates.io

You can use this package in your project by adding the following to your Cargo.toml:

[dependencies]
unicode-normalization = "0.1.23"

no_std + alloc support

This crate is completely no_std + alloc compatible. This can be enabled by disabling the std feature, i.e. specifying default-features = false for this crate on your Cargo.toml.

Dependencies