3 releases (stable)
|1.1.0||Dec 30, 2021|
|1.0.0||Apr 28, 2021|
|0.0.0||Apr 28, 2021|
#236 in #unicode
Implementation of the WTF-8 encoding.
WTF-8 is a hack intended to be used internally in self-contained systems with components that need to support potentially ill-formed UTF-16 for legacy reasons.
Any WTF-8 data must be converted to a Unicode encoding at the system’s boundary before being emitted. UTF-8 is recommended. WTF-8 must not be used to represent text in a file format or for transmission over the Internet.
In particular, the Encoding Standard [ENCODING] defines UTF-8 and other encodings for the Web. There is no and will not be any encoding label [ENCODING] or IANA charset alias [CHARSETS] for WTF-8.
Depends on the standard library’s
alloc crate but not
Wtf8Buf- Similar to
String, provides type-safe WTF-8 strings.
CodePoint- Similar to
char, provides type-safe Unicode code points.
- Lossless conversion from potentially ill-formed UTF-16 to
CodePointiterator and from
Wtf8Buf, and from
- Conversion from
String, potentially lossy.
Licensed under either of
- Apache License, Version 2.0 (LICENSE-APACHE or http://www.apache.org/licenses/LICENSE-2.0)
- MIT license (LICENSE-MIT or http://opensource.org/licenses/MIT)
at your option.
Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.