#text #unicode #character-property #character-database

unic-ucd

UNIC — Unicode Character Database

11 releases (breaking)

0.9.0 Mar 3, 2019
0.7.0 Feb 7, 2018
0.6.0 Sep 22, 2017
0.4.0 Jun 23, 2017

#12 in #character-property

Download history 340/week @ 2021-02-20 220/week @ 2021-02-27 177/week @ 2021-03-06 147/week @ 2021-03-13 229/week @ 2021-03-20 194/week @ 2021-03-27 106/week @ 2021-04-03 174/week @ 2021-04-10 406/week @ 2021-04-17 123/week @ 2021-04-24 279/week @ 2021-05-01 391/week @ 2021-05-08 245/week @ 2021-05-15 234/week @ 2021-05-22 160/week @ 2021-05-29 185/week @ 2021-06-05

967 downloads per month
Used in 12 crates (6 directly)

MIT/Apache

645KB
5.5K SLoC

UNIC — Unicode Character Database

Crates.io Documentation

This UNIC component provides access to character properties as defined in the Unicode® Standard Annex #44 - Unicode Character Database.

UCD is a UNIC super-crate, composed of smaller crates that provide data in specific areas, therefore, allowing access only to the data needed instead of forcing dependent crates to import all UCD data.

Crates

Here's a list of components (available or planned) for this super-crate:

  • version: The Unicode Version of UCD data.

  • common: Common properties, such as Alphabetic, White-Space, Control and Numeric.

  • age: Age property.

  • bidi: Bidirectional properties. (Hebrew, Arabic, ...)

  • block: Block properties.

  • case: Letter Case properties.

  • category: General_Category property.

  • hangul: Hangul Syllable Composition & Decomposition.

  • ident: Identifier properties.

  • name: Name property.

  • normal: Normalization properties.

  • segment: Segmentation properties.

  • ea-width: East Asian Width properties.

  • joining: Cursive joining properties. (Arabic, Syriac, ...)

  • numeric: Other character numeric properties.

  • script: Script properties.

See http://unicode.org/reports/tr44/#Property_List_Table for the complete list of properties defined in UCD. Eventually, all these properties will be available by under unic-ucd.

Dependencies