#string #unicode #unicode-characters #validation #simd #utf-8

no-std simdutf

Unicode validation and transcoding at billions of characters per second

14 releases

0.4.17 May 2, 2024
0.4.16 Jan 1, 2024
0.4.15 Nov 26, 2023
0.4.8 Jun 5, 2023
0.3.0 Jul 27, 2022

#974 in Text processing

Download history 46/week @ 2024-02-19 35/week @ 2024-02-26 603/week @ 2024-03-11 3/week @ 2024-03-18 199/week @ 2024-03-25 33/week @ 2024-04-01 6/week @ 2024-04-08 102/week @ 2024-04-15 5/week @ 2024-04-22 157/week @ 2024-04-29 12/week @ 2024-05-06

280 downloads per month

MIT license

1.5MB
30K SLoC

C++ 29K SLoC // 0.2% comments Rust 544 SLoC // 0.0% comments

simdutf

Latest Version Documentation License

Unicode validation and transcoding at billions of characters per second.

This crate is the Rust binding of simdutf.

Documentation: https://docs.rs/simdutf


lib.rs:

Unicode validation and transcoding at billions of characters per second.

This crate is the Rust binding of simdutf.

Compilation

This crate works out of the box as long as you have a C++11-compatible toolchain installed correctly.

simdutf links C++ standard library, which adds a dynamic linking dependency.

For more details, see simdutf documentation and cc documentation.

Here is an example for local benchmark:

export RUSTFLAGS='-C target-cpu=native'
export CXXFLAGS='-march=native'
cargo build --release

Dependencies