11 releases (breaking)
| 0.9.0 | Mar 3, 2019 |
|---|---|
| 0.7.0 | Feb 7, 2018 |
| 0.6.0 | Sep 22, 2017 |
| 0.4.0 | Jun 23, 2017 |
#2510 in Text processing
4,003 downloads per month
Used in 27 crates
(9 directly)
660KB
5.5K
SLoC
UNIC — Unicode Character Database
This UNIC component provides access to character properties as defined in the Unicode® Standard Annex #44 - Unicode Character Database.
UCD is a UNIC super-crate, composed of smaller crates that provide data in specific areas, therefore, allowing access only to the data needed instead of forcing dependent crates to import all UCD data.
Crates
Here's a list of components (available or planned) for this super-crate:
-
version: The Unicode Version of UCD data. -
common: Common properties, such as Alphabetic, White-Space, Control and Numeric. -
age: Age property. -
bidi: Bidirectional properties. (Hebrew, Arabic, ...) -
block: Block properties. -
case: Letter Case properties. -
category: General_Category property. -
hangul: Hangul Syllable Composition & Decomposition. -
ident: Identifier properties. -
name: Name property. -
normal: Normalization properties. -
segment: Segmentation properties. -
ea-width: East Asian Width properties. -
joining: Cursive joining properties. (Arabic, Syriac, ...) -
numeric: Other character numeric properties. -
script: Script properties.
See http://unicode.org/reports/tr44/#Property_List_Table for the complete
list of properties defined in UCD. Eventually, all these properties will be
available by under unic-ucd.