#utf-8 #decoding #byte #individual #characters #applications #decode

rut

A small UTF-8 parsing library for applications that need to parse individual chars

10 unstable releases (3 breaking)

0.4.2 Dec 22, 2019
0.4.1 Dec 16, 2019
0.3.5 Dec 9, 2019
0.2.1 Dec 8, 2019
0.1.0 Dec 6, 2019

#1790 in Text processing

31 downloads per month

MPL-2.0 license

18KB
237 lines

Rut

Rut is a small UTF-8 decoding library for applications that need to decode individual characters.
It provides a bytewise decoder, and functions for decoding byte slices.

It is completely no_std and should provide good performance.[citation needed]

Conformance

Rut is fully conformant to the specifications and restrictions of the Unicode standard.
Additionally, it follows W3C's standard for UTF-8 decoding with regards to error signalling.

Testing

Tests are largely not implemented yet (aside from documentation examples).
However, Rut has been extensively fuzzed and passes this stress test for UTF-8 decoders.

As Seen on TV!

Rut began life, and is still used in, Termiku, a terminal emulator written in Rust.

No runtime deps