#unicode-characters #unicode #width #text #no-alloc

no-std unicode-width

Determine displayed width of char and str types according to Unicode Standard Annex #11 rules

14 releases

new 0.1.12 Apr 26, 2024
0.1.11 Sep 19, 2023
0.1.10 Sep 13, 2022
0.1.9 Sep 16, 2021
0.1.2 Jul 9, 2015

#6 in Text processing

Download history 1150666/week @ 2024-01-12 1237892/week @ 2024-01-19 1216854/week @ 2024-01-26 1298708/week @ 2024-02-02 1298118/week @ 2024-02-09 1266833/week @ 2024-02-16 1351883/week @ 2024-02-23 1377472/week @ 2024-03-01 1360180/week @ 2024-03-08 1382540/week @ 2024-03-15 1405625/week @ 2024-03-22 1360992/week @ 2024-03-29 1388250/week @ 2024-04-05 1445667/week @ 2024-04-12 1485778/week @ 2024-04-19 1205238/week @ 2024-04-26

5,776,914 downloads per month
Used in 22,060 crates (583 directly)

MIT/Apache

97KB
958 lines

unicode-width

Build status crates.io version Docs status

Determine displayed width of char and str types according to Unicode Standard Annex #11, other portions of the Unicode standard, and common implementations of POSIX wcwidth().

This crate is #![no_std].

use unicode_width::UnicodeWidthStr;

fn main() {
    let teststr = "Hello, world!";
    let width = UnicodeWidthStr::width(teststr);
    println!("{}", teststr);
    println!("The above string is {} columns wide.", width);
    let width = teststr.width_cjk();
    println!("The above string is {} columns wide (CJK).", width);
}

NOTE: The computed width values may not match the actual rendered column width. For example, the woman scientist emoji comprises of a woman emoji, a zero-width joiner and a microscope emoji. Such emoji ZWJ sequences are considered to have the sum of the widths of their constituent parts:

extern crate unicode_width;
use unicode_width::UnicodeWidthStr;

fn main() {
    assert_eq!(UnicodeWidthStr::width("👩"), 2); // Woman
    assert_eq!(UnicodeWidthStr::width("🔬"), 2); // Microscope
    assert_eq!(UnicodeWidthStr::width("👩‍🔬"), 4); // Woman scientist
}

Additionally, defective combining character sequences and nonstandard Korean jamo sequences may be rendered with a different width than what this crate says. (This is not an exhaustive list.)

crates.io

You can use this package in your project by adding the following to your Cargo.toml:

[dependencies]
unicode-width = "0.1.11"

Dependencies

~200KB