6 releases (breaking)

0.5.0 Nov 18, 2020
0.4.1 Jul 2, 2020
0.3.0 Jul 2, 2020
0.2.0 Jan 1, 2019
0.1.0 Dec 31, 2018

#835 in Text processing

Download history 47/week @ 2023-11-27 28/week @ 2023-12-04 41/week @ 2023-12-11 4/week @ 2024-01-08 7/week @ 2024-01-15 2/week @ 2024-02-12 23/week @ 2024-02-19 112/week @ 2024-02-26 36/week @ 2024-03-04 26/week @ 2024-03-11

197 downloads per month
Used in 7 crates (4 directly)

MIT license

23KB
532 lines

MUTF-8

Usage

fn main() {
    let output: Cow<u8> = mutf8::utf8_to_mutf8("Hello, \0World");

    // `output` contains no NUL bytes.
}

There's also a MString and mstr struct.
These are the counterparts to String and str within the standard library.

fn main() {
    let data = mstr::from_utf8(b"\0");
	assert_eq!(data.len(), 2);
}

About

This crate allows converting UTF-8 to and from MUTF-8.

Some data formats, such as the JVM classfile, make use of an altered UTF-8 encoding.
This one in particular is the MUTF-8 variant.

It allows a NUL byte to be encoded without using the NUL byte itself.

WIP

The algorithm itself is done, and useable.
It works as well as any other.

The reason I still call this crate WIP is because of the two String structs.
I'm not happy with them.

I do use this crate for a couple of projects, but none of them make use of the structs themselves.

I typically use this crate as just a jump from a [u8] to a Cow<str>.
So, until I work out where I want to go with this crate, it's probably going to stay like this.

Dependencies

~180KB