1 stable release
1.0.0 | May 21, 2019 |
---|
#1052 in Text processing
21,618 downloads per month
Used in 2 crates
14KB
270 lines
detone
An iterator adapter that takes an iterator over char
yielding a sequence of
char
s in Normalization Form C (this precondition is not checked!) and
yields char
s either such that tone marks that wouldn't otherwise fit into
windows-1258 are decomposed or such that text is decomposed into orthographic
units.
Use cases include preprocessing before encoding Vietnamese text into windows-1258 or converting precomposed Vietnamese text into a form that looks like it was written with the (non-IME) Vietnamese keyboard layout (e.g. for machine learning training or benchmarking purposes).
Licensing
Please see the file named COPYRIGHT.
Documentation
Generated API documentation is available online.
Release Notes
1.0.0
- Initial release.