#unicode-characters #unicode #width #text #no-alloc

no-std unicode-width

Determine displayed width of char and str types according to Unicode Standard Annex #11 rules

14 releases

new 0.1.12 Apr 26, 2024
0.1.11 Sep 19, 2023
0.1.10 Sep 13, 2022
0.1.9 Sep 16, 2021
0.1.2 Jul 9, 2015

#6 in Text processing

Download history 1118140/week @ 2024-01-10 1240913/week @ 2024-01-17 1203160/week @ 2024-01-24 1286738/week @ 2024-01-31 1290977/week @ 2024-02-07 1276693/week @ 2024-02-14 1326292/week @ 2024-02-21 1359221/week @ 2024-02-28 1377864/week @ 2024-03-06 1358922/week @ 2024-03-13 1412527/week @ 2024-03-20 1340030/week @ 2024-03-27 1412359/week @ 2024-04-03 1425160/week @ 2024-04-10 1460644/week @ 2024-04-17 1243300/week @ 2024-04-24

5,797,542 downloads per month
Used in 22,044 crates (582 directly)

MIT/Apache

97KB
958 lines

unicode-width

Build status crates.io version Docs status

Determine displayed width of char and str types according to Unicode Standard Annex #11, other portions of the Unicode standard, and common implementations of POSIX wcwidth().

This crate is #![no_std].

use unicode_width::UnicodeWidthStr;

fn main() {
    let teststr = "Hello, world!";
    let width = UnicodeWidthStr::width(teststr);
    println!("{}", teststr);
    println!("The above string is {} columns wide.", width);
    let width = teststr.width_cjk();
    println!("The above string is {} columns wide (CJK).", width);
}

NOTE: The computed width values may not match the actual rendered column width. For example, the woman scientist emoji comprises of a woman emoji, a zero-width joiner and a microscope emoji. Such emoji ZWJ sequences are considered to have the sum of the widths of their constituent parts:

extern crate unicode_width;
use unicode_width::UnicodeWidthStr;

fn main() {
    assert_eq!(UnicodeWidthStr::width("👩"), 2); // Woman
    assert_eq!(UnicodeWidthStr::width("🔬"), 2); // Microscope
    assert_eq!(UnicodeWidthStr::width("👩‍🔬"), 4); // Woman scientist
}

Additionally, defective combining character sequences and nonstandard Korean jamo sequences may be rendered with a different width than what this crate says. (This is not an exhaustive list.)

crates.io

You can use this package in your project by adding the following to your Cargo.toml:

[dependencies]
unicode-width = "0.1.11"

Dependencies

~200KB