#unicode #collate #text

yanked unicode-collation

Unicode collation algorithm according to Technical Standard #10

Uses old Rust 2015

0.0.1 Mar 19, 2018

#449 in #unicode

MIT/Apache

15KB
298 lines

crates.io crates.io CircleCI

unicode-collation

A Unicode Collation Algorithm implemented according to Unicode Technical Standard #10.

Usage

Add this to your Cargo.toml:

[dependencies]
unicode-collation = "0.1"

Examples

Generate sort key for the given string:

extern crate unicode_collation;
use unicode_collation::{collate, CollationTable};

let table = CollationTable::from_text_file("data/allkeys.txt");
let key = collate("Hello!!!", &table);
assert_eq!(format!("{:?}", key), "[\
    1D7E 1D10 1DDD 1DDD 1E43 0261 0261 0261 | \
    0020 0020 0020 0020 0020 0020 0020 0020 | \
    0008 0002 0002 0002 0002 0002 0002 0002 |]");

License

Licensed under either of

at your option.

Contribution

Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.

Dependencies

~4MB
~102K SLoC