5 unstable releases

new 0.5.1 Apr 26, 2025
0.5.0 Apr 22, 2025
0.4.0 Apr 21, 2025
0.3.1 Apr 17, 2025
0.3.0 Apr 8, 2025

#2614 in Procedural macros

Download history 107/week @ 2025-04-07 129/week @ 2025-04-14 259/week @ 2025-04-21

495 downloads per month
Used in 3 crates (via alphabet_detector)

MIT/Apache

28KB
627 lines

Alphabet Detector

Crate API

Detects 387 alphabets in 170 scripts

one spoken language can be written in multiple scripts, so it will be detected as a different alphabet/language

look at the alphabet.rs to understand what languages have already defined alphabets. Some of them need validation

Separates words in text (from iterator CharIndices), and detects language of words by used alphabets (chars).

Warning: can return words with chars from the Unicode private area (for example Lingala, Nuer or Yoruba languages)

Dependencies

~1–1.6MB
~27K SLoC