11 releases (breaking)

0.9.0 Mar 3, 2019
0.7.0 Feb 7, 2018
0.6.0 Sep 22, 2017
0.4.0 Jun 23, 2017

#369 in Internationalization (i18n)

Download history 213/week @ 2023-11-29 270/week @ 2023-12-06 340/week @ 2023-12-13 251/week @ 2023-12-20 195/week @ 2023-12-27 374/week @ 2024-01-03 319/week @ 2024-01-10 348/week @ 2024-01-17 228/week @ 2024-01-24 311/week @ 2024-01-31 282/week @ 2024-02-07 316/week @ 2024-02-14 254/week @ 2024-02-21 262/week @ 2024-02-28 338/week @ 2024-03-06 168/week @ 2024-03-13

1,108 downloads per month
Used in 21 crates (9 directly)

MIT/Apache

660KB
5.5K SLoC

UNIC — Unicode Character Database

Crates.io Documentation

This UNIC component provides access to character properties as defined in the Unicode® Standard Annex #44 - Unicode Character Database.

UCD is a UNIC super-crate, composed of smaller crates that provide data in specific areas, therefore, allowing access only to the data needed instead of forcing dependent crates to import all UCD data.

Crates

Here's a list of components (available or planned) for this super-crate:

  • version: The Unicode Version of UCD data.

  • common: Common properties, such as Alphabetic, White-Space, Control and Numeric.

  • age: Age property.

  • bidi: Bidirectional properties. (Hebrew, Arabic, ...)

  • block: Block properties.

  • case: Letter Case properties.

  • category: General_Category property.

  • hangul: Hangul Syllable Composition & Decomposition.

  • ident: Identifier properties.

  • name: Name property.

  • normal: Normalization properties.

  • segment: Segmentation properties.

  • ea-width: East Asian Width properties.

  • joining: Cursive joining properties. (Arabic, Syriac, ...)

  • numeric: Other character numeric properties.

  • script: Script properties.

See http://unicode.org/reports/tr44/#Property_List_Table for the complete list of properties defined in UCD. Eventually, all these properties will be available by under unic-ucd.

Dependencies