#test #character #letter #whether #language #vowel #romance

is-vowel

Heuristically test whether a character is a vowel letter

1 unstable release

0.1.0 Apr 11, 2021

#1111 in Text processing

Download history 17/week @ 2024-07-20 22/week @ 2024-07-27 18/week @ 2024-08-03 18/week @ 2024-08-10 11/week @ 2024-08-17 17/week @ 2024-08-24 21/week @ 2024-08-31 13/week @ 2024-09-07 15/week @ 2024-09-14 39/week @ 2024-09-21 21/week @ 2024-09-28 10/week @ 2024-10-05 16/week @ 2024-10-12 13/week @ 2024-10-19 12/week @ 2024-10-26 24/week @ 2024-11-02

65 downloads per month
Used in 2 crates

MIT license

9KB

Maintenance CI crates-io api-docs

is-vowel: heuristically test whether a character is a vowel letter

Bart Massey 2021 (version 0.1.0)

Test for vowels in Romance languages.

Deciding whether some grapheme is a "vowel" (represents a "vowel sound") is language dependent, and the author is not aware of any standardization efforts for vowel identification.

Even for the Romance languages, the situation is a bit complicated. The basic Romance vowels are "a", "e", "i", "o", "u". However, for example:

  • Uppercase versions are also vowels.
  • Accented versions are also vowels.
  • "Sometimes 'y', sometimes 'w'." These letters are consonant in some situations, vowels in others.

This code attempts to provide reasonable heuristic answers as to the Romance-language-"vowelness" of a Unicode codepoint.

No attempt is made here to deal with non-Romance languages, even though some non-Romance vowel letters are borrowed in Romance languages: for example, "æ" and "Æ" are used in some British English. It would be great to use vowel letter tables for a variety of languages, but so far the author has been unable to locate such tables (surprisingly).

License

This crate is made available under the "MIT license". Please see the file LICENSE in this distribution for license terms.

Acknowledgments

Thanks to the cargo-readme crate for generation of this README.

Dependencies

~1MB
~35K SLoC