11 releases (breaking)

0.9.0 Mar 3, 2019
0.7.0 Feb 7, 2018
0.6.0 Sep 22, 2017
0.4.0 Jun 23, 2017

#387 in Internationalization (i18n)

Download history 187/week @ 2024-03-13 240/week @ 2024-03-20 216/week @ 2024-03-27 211/week @ 2024-04-03 253/week @ 2024-04-10 183/week @ 2024-04-17 216/week @ 2024-04-24 205/week @ 2024-05-01 230/week @ 2024-05-08 191/week @ 2024-05-15 214/week @ 2024-05-22 259/week @ 2024-05-29 169/week @ 2024-06-05 204/week @ 2024-06-12 187/week @ 2024-06-19 168/week @ 2024-06-26

759 downloads per month
Used in 21 crates (9 directly)

MIT/Apache

660KB
5.5K SLoC

UNIC — Unicode Character Database

Crates.io Documentation

This UNIC component provides access to character properties as defined in the Unicode® Standard Annex #44 - Unicode Character Database.

UCD is a UNIC super-crate, composed of smaller crates that provide data in specific areas, therefore, allowing access only to the data needed instead of forcing dependent crates to import all UCD data.

Crates

Here's a list of components (available or planned) for this super-crate:

  • version: The Unicode Version of UCD data.

  • common: Common properties, such as Alphabetic, White-Space, Control and Numeric.

  • age: Age property.

  • bidi: Bidirectional properties. (Hebrew, Arabic, ...)

  • block: Block properties.

  • case: Letter Case properties.

  • category: General_Category property.

  • hangul: Hangul Syllable Composition & Decomposition.

  • ident: Identifier properties.

  • name: Name property.

  • normal: Normalization properties.

  • segment: Segmentation properties.

  • ea-width: East Asian Width properties.

  • joining: Cursive joining properties. (Arabic, Syriac, ...)

  • numeric: Other character numeric properties.

  • script: Script properties.

See http://unicode.org/reports/tr44/#Property_List_Table for the complete list of properties defined in UCD. Eventually, all these properties will be available by under unic-ucd.

Dependencies