1 unstable release

0.1.0 Sep 14, 2022

#1707 in Text processing

41 downloads per month
Used in homoglyphs

MIT/Apache

42KB
745 lines

Homoglyphs (CLI)

Generate all homoglyphs for a given input sentence.

An homoglyph is one of two or more graphemes, characters, or glyphs with shapes that appear identical or very similar. The designation is also applied to sequences of characters sharing these properties.

The Unicode character set contains many strongly homoglyphic characters known as "confusables". These present security risks in a variety of situations. One might deliberately spoof a domain name by replacing one character with its homoglyph, thus creating a second domain name, not readily distinguishable from the first, that can be exploited in phishing. - wikipedia

Install

cargo install homoglyphs

Examples

Generate all possible homoglyphs with all the confusable characters

homoglyphs -a "rust"

Generate homoglyphs with default settings (100 homoglyphs, 8 confusables for each characters of input sentence)

homoglyphs "rust is nice"

Generate 'n' homoglyphs

homoglyphs -n 3000 "rust is awesome"

Generate homoglyphs possible for the given number of confusable

homoglyphs -n 3000 "rust is incredible"

Try to generate a maximum of 'n' homoglyphs with 'c' confusable for each characters of input sentence

homoglyphs -n 500 -c 6 "rust is the best"

Print output in different format

In array with ascii, empty, markdown, modern and psql styles

homoglyphs -n 25 "rust is the best" -f markdown

In json or just raw

homoglyphs -n 25 "rust is the best" -f json

Dependencies

~17–50MB
~691K SLoC