#arabic #language #diacritics #dari #pashto

tashkil

A lightweight library for removing Arabic diacritics

1 unstable release

0.1.0 Oct 16, 2022

#5 in #diacritics

MIT license

4KB

Tashkil

A lightweight Rust library for removing Arabic diacritics (تَشْكِيل)

This library exposes a single function, tashkil::remove(), which removes from a &str all diacritics in the unicode specification for the Arabic alphabet and its variants.

It is my hope that this library can be used to improve search results in Meilisearch for languages using the Arabic alphabet and its variants, similarly to how niqqud has been used to improve search results for Hebrew.

No runtime deps