#string #unicode #unicode-characters #validation #simd #utf-8

no-std simdutf

Unicode validation and transcoding at billions of characters per second

13 releases

0.4.16 Jan 1, 2024
0.4.15 Nov 26, 2023
0.4.13 Oct 29, 2023
0.4.8 Jun 5, 2023
0.3.0 Jul 27, 2022

#931 in Text processing

Download history 25/week @ 2024-01-01 22/week @ 2024-01-08 46/week @ 2024-02-19 35/week @ 2024-02-26 603/week @ 2024-03-11 3/week @ 2024-03-18 199/week @ 2024-03-25 33/week @ 2024-04-01 6/week @ 2024-04-08 102/week @ 2024-04-15

340 downloads per month

MIT license

1.5MB
25K SLoC

C++ 25K SLoC // 0.2% comments Rust 544 SLoC // 0.0% comments

simdutf

Latest Version Documentation License

Unicode validation and transcoding at billions of characters per second.

This crate is the Rust binding of simdutf.

Documentation: https://docs.rs/simdutf


lib.rs:

Unicode validation and transcoding at billions of characters per second.

This crate is the Rust binding of simdutf.

Compilation

This crate works out of the box as long as you have a C++11-compatible toolchain installed correctly.

simdutf links C++ standard library, which adds a dynamic linking dependency.

For more details, see simdutf documentation and cc documentation.

Here is an example for local benchmark:

export RUSTFLAGS='-C target-cpu=native'
export CXXFLAGS='-march=native'
cargo build --release

Dependencies