#unicode-characters #unicode #width #text #no-alloc

no-std unicode-width

Determine displayed width of char and str types according to Unicode Standard Annex #11 rules

14 releases

new 0.1.12 Apr 26, 2024
0.1.11 Sep 19, 2023
0.1.10 Sep 13, 2022
0.1.9 Sep 16, 2021
0.1.2 Jul 9, 2015

#7 in Text processing

Download history 1135051/week @ 2024-01-07 1176045/week @ 2024-01-14 1208786/week @ 2024-01-21 1255647/week @ 2024-01-28 1302912/week @ 2024-02-04 1289232/week @ 2024-02-11 1281968/week @ 2024-02-18 1352834/week @ 2024-02-25 1373868/week @ 2024-03-03 1362060/week @ 2024-03-10 1383969/week @ 2024-03-17 1380856/week @ 2024-03-24 1399251/week @ 2024-03-31 1382846/week @ 2024-04-07 1450796/week @ 2024-04-14 1398957/week @ 2024-04-21

5,729,495 downloads per month
Used in 21,990 crates (582 directly)

MIT/Apache

97KB
958 lines

unicode-width

Build status crates.io version Docs status

Determine displayed width of char and str types according to Unicode Standard Annex #11, other portions of the Unicode standard, and common implementations of POSIX wcwidth().

This crate is #![no_std].

use unicode_width::UnicodeWidthStr;

fn main() {
    let teststr = "Hello, world!";
    let width = UnicodeWidthStr::width(teststr);
    println!("{}", teststr);
    println!("The above string is {} columns wide.", width);
    let width = teststr.width_cjk();
    println!("The above string is {} columns wide (CJK).", width);
}

NOTE: The computed width values may not match the actual rendered column width. For example, the woman scientist emoji comprises of a woman emoji, a zero-width joiner and a microscope emoji. Such emoji ZWJ sequences are considered to have the sum of the widths of their constituent parts:

extern crate unicode_width;
use unicode_width::UnicodeWidthStr;

fn main() {
    assert_eq!(UnicodeWidthStr::width("👩"), 2); // Woman
    assert_eq!(UnicodeWidthStr::width("🔬"), 2); // Microscope
    assert_eq!(UnicodeWidthStr::width("👩‍🔬"), 4); // Woman scientist
}

Additionally, defective combining character sequences and nonstandard Korean jamo sequences may be rendered with a different width than what this crate says. (This is not an exhaustive list.)

crates.io

You can use this package in your project by adding the following to your Cargo.toml:

[dependencies]
unicode-width = "0.1.11"

Dependencies

~200KB