2 stable releases
1.0.1 | Jul 4, 2024 |
---|---|
1.0.0 | May 21, 2019 |
#290 in Text processing
20,754 downloads per month
Used in 2 crates
18KB
269 lines
detone
An iterator adapter that takes an iterator over char
yielding a sequence of
char
s in Normalization Form C (this precondition is not checked!) and
yields char
s either such that tone marks that wouldn't otherwise fit into
windows-1258 are decomposed or such that text is decomposed into orthographic
units.
Use cases include preprocessing before encoding Vietnamese text into windows-1258 or converting precomposed Vietnamese text into a form that looks like it was written with the (non-IME) Vietnamese keyboard layout (e.g. for machine learning training or benchmarking purposes).
Licensing
Please see the file named COPYRIGHT.
Documentation
Generated API documentation is available online.
MSRV
1.60 to use, 1.67 to run tests. Pin version 1.0.0 of this crate if you need an even lower MSRV; there are no non-test changes.
Release Notes
1.0.1
- Updated metadata, internal documentation, and the dev dependency.
- No non-test code changes.
1.0.0
- Initial release.