Text processing

regex-syntax

A regular expression parser

v0.8.3 14.8M no-std #regex #regex-parser #intermediate-representation #ast #hir #expression-parser #regular
regex-automata

Automata construction and matching using regular expressions

v0.4.6 13.2M no-std #regex #dfa #nfa #automata #automaton
aho-corasick

Fast multiple substring searching

v1.1.3 11.8M no-std #state-machine #pattern #string-pattern #pattern-matching #string-search #case-insensitive #search-pattern
regex

regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.

v1.10.4 10.6M no-std #regular-expression #string #string-search #match #automata #engine #finite
idna

IDNA (Internationalizing Domain Names in Applications) and Punycode

v0.5.0 9.3M no-std #domain-name #unicode #processing #internationalized #system #url #client
unicode-normalization

functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15

v0.1.23 7.8M no-std #unicode #decomposition #normalization #unicode-characters #recomposition #text
percent-encoding

Percent encoding and decoding

v2.3.1 7.6M no-std #codec #urlencode #url #encoded-string #percent #decoding #set
unicode-bidi

Unicode Bidirectional Algorithm

v0.3.15 7.2M no-std #unicode #unicode-characters #rtl #layout #unicode-text #bidi #text-layout
unicode-width

Determine displayed width of char and str types according to Unicode Standard Annex #11 rules

v0.1.12 5.9M no-std #unicode-characters #unicode #width #text #no-alloc
textwrap

word wrapping, indenting, and dedenting strings. Has optional support for Unicode and emojis as well as machine hyphenation.

v0.16.1 4.6M #wrap #formatting #typesetting #text-formatting #hyphenation #unicode-text #text
unicode-segmentation

Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules

v1.11.0 4.4M no-std #unicode #grapheme #word #boundary #unicode-text #text
convert_case

Convert strings into any case

v0.6.0 4.2M #convert-string #case #string #casing #snake-case
unicode-xid

Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31

v0.2.4 4.1M no-std #unicode-characters #unicode #xid #text
matchers

Regex matching on character and byte streams

v0.1.0 3.8M #regex #pattern #match #streaming #byte-array #byte-stream #debugging
ident_case

applying case rules to Rust identifiers

v1.0.1 3.8M #identifier #snake-case #rename #variant #rules #kebab-case #pascal-case
bstr

A string type that is not required to be valid UTF-8

v1.9.1 3.7M no-std #byte-string #byte #string
unicase

A case-insensitive wrapper around strings

v2.7.0 3.3M no-std #case-insensitive #case #lower-case #case-folding #lowercase #no-std
encoding_rs

A Gecko-oriented implementation of the Encoding Standard

v0.8.34 5.1M no-std #character-encoding #encoding #codec #unicode #charset #web
unindent

Remove a column of leading whitespace from a string

v0.2.3 2.2M #string #string-literal #literals #multiline #heredoc #nowdoc #compile-time
indoc

Indented document literals

v2.0.5 3.3M macro no-std #string-literal #string #literals #multiline #heredoc #nowdoc #no-alloc
diff

An LCS based slice and string diffing implementation

v0.1.13 2.0M #lcs #diffing #slice #string #line
ucd-trie

A trie for storing Unicode codepoint sets and maps

v0.1.6 2.7M no-std #unicode-characters #unicode #trie #character #character-set #code-point #codepoint
fancy-regex

regexes, supporting a relatively rich set of features, including backreferences and look-around

v0.13.0 1.8M no-std #regex #nfa #expression #fancy #backreferences #match #pattern-matching
difflib

Port of Python's difflib library to Rust

v0.4.0 1.6M #diff #text #python #port #comparing #unified-diff #sequences
unicode_categories

Query Unicode category membership for chars

v0.1.1 1.2M #unicode-characters #unicode #character #category #extension #member #querying
similar

A diff library for Rust

v2.5.0 1.2M #diff #difference #compare #change #patience #high-level
finl_unicode

handling Unicode functionality for finl (categories and grapheme segmentation)

v1.2.0 1.2M #unicode-characters #unicode #grapheme #segmentation
ascii

ASCII-only equivalents to char, str and String

v1.1.0 1.2M no-std #string #ascii-text #character #equivalents #standard #type #conversion
indenter

A formatter wrapper that indents the text, designed for error display impls

v0.3.3 1.1M no-std #error #formatter #error-message #display #fmt
const_format

Compile-time string formatting

v0.2.32 1.0M no-std #string-formatting #format #concat #no-std
widestring

wide string Rust library for converting to and from wide strings, such as those often used in Windows API or other FFI libaries. Both u16 and u32 string types are provided, including support for UTF-16 and UTF-32…

v1.1.0 1.0M no-std #utf-16 #utf-32 #wide #win32 #string
pulldown-cmark

A pull parser for CommonMark

v0.11.0 979K bin+lib #markdown-parser #pull-parser #markdown #common-mark #markdown-text #html-rendering
cesu8

Convert to and from CESU-8 encoding (similar to UTF-8)

v1.1.0 872K #character-encoding #utf-8 #data-encoding #utf-16 #convert #plane #multilingual
Inflector

Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…

v0.11.4 864K #snake #inflection #pluralize #camel
regex-lite

A lightweight regex engine that optimizes for binary size and compilation time

v0.1.5 775K no-std #regex #size #capture #binary #group #string #engine
deunicode

Convert Unicode strings to pure ASCII by intelligently transliterating them. Suppors Emoji and Chinese.

v1.6.0 674K no-std #unicode #unicode-characters #ascii #emoji #transliteration #string-conversion #unidecode
utf-8

Incremental, zero-copy UTF-8 decoding with error handling

v0.7.6 1.9M #decoding #zero-copy #incremental #error
const_format_proc_macros

detail of the const_format crate

v0.2.32 1.0M macro no-std #format #string-formatting #concat #no-std
arrow-row

Arrow row format

v51.0.0 602K #arrow #apache-arrow #row #format #array #sorting #│
uncased

Case-preserving, ASCII case-insensitive, no_std string types

v0.9.10 606K no-std #case-insensitive #ascii #ascii-string #case-preserving #no-std
ascii-canvas

canvas for drawing lines and styled text and emitting to the terminal

v3.0.0 470K #canvas #text #line #ascii #ansi #draw #terminal-text
unicode-id

Determine whether characters have the ID_Start or ID_Continue properties according to Unicode Standard Annex #31

v0.3.4 429K no-std #unicode-characters #unicode #tr31 #text
gix-utils

gitoxide utilities that don’t need feature toggles

v0.1.12 467K #git #utilities #version-control #toggles #gitoxide #feature #don-t
unic-char-property

UNIC — Unicode Character Tools — Character Property taxonomy, contracts and build macros

v0.9.0 428K #unicode #character-property #unicode-text
slug

Convert a unicode string to a slug

v0.1.5 419K #slugify #id #convert-string #generate #slugs
shell-escape

Escape characters that may have a special meaning in a shell

v0.1.5 416K #shell #escaping #characters #meaning #special
compact_str

A memory efficient string type that transparently stores strings on the stack, when possible

v0.8.0-beta 613K no-std #string #byte-string #memory #compact #mutable #stack-allocated #heap-allocated
tendril

Compact buffer/string type for zero-copy parsing

v0.4.3 509K #reference-counting #string #zero-copy #parser #byte #compact #thread-local
onig_sys

onig_sys crate contains raw rust bindings to the oniguruma library. This crate exposes a set of unsafe functions which can then be used by other crates to create safe wrappers around Oniguruma…

v69.8.1 305K sys #safe-wrapper #regex #bindings #oniguruma #unsafe-bindings #onig #set
onig

Rust-Onig is a set of Rust bindings for the Oniguruma regular expression library. Oniguruma is a modern regex library with support for multiple character encodings and regex syntaxes.

v6.4.0 304K #regex #character-encoding #bindings #syntaxes #oniguruma #regular #expression
pulldown-cmark-to-cmark

Convert pulldown-cmark Events back to the string they were parsed from

v13.0.0 293K #markdown #common-mark #converter #render #test-suite
diffy

Tools for finding and manipulating differences between files

v0.3.0 237K #diff #patch #merge #version-control #system
strip-ansi-escapes

Strip ANSI escape sequences from byte streams

v0.2.0 384K #strip-ansi #escaping #ansi #byte-stream #terminal #byte-sequences #escapes
const-str

compile-time string operations

v0.5.7 225K no-std #string #const #proc-macro
unicode-script

exposes the Unicode Script and Script_Extension properties from UAX #24

v0.5.6 209K #script #unicode #language #scripting-language #unicode-text #text
levenshtein_automata

Creates Levenshtein Automata in an efficient manner

v0.2.1 211K #levenshtein #automata #edit-distance #fuzzy
kstring

Key String: optimized for map keys

v2.0.0 306K no-std #key-string #string #serde #serialization #name
regress

A regular expression engine targeting EcmaScript syntax

v0.9.1 197K no-std #regex #javascript #regular #expression #engine #syntax #assertions
text-size

Newtypes for text offsets

v1.1.1 189K #text #offset #newtype #range #u32 #wrapper #type
tabled

An easy to use library for pretty print tables of Rust structs and enums

v0.15.0 272K no-std #table #pretty-table #terminal #format #print #macro
lazy-regex

lazy static regular expressions checked at compile time

v3.1.0 180K no-std #regex #lazy-evaluation #static #compile-time #macro #error-message
encoding-index-tradchinese

Index tables for traditional Chinese character encodings

v1.20141219.5 233K #character-encoding #chinese #index #table #encode #standard
inflections

High performance inflection transformation library for changing properties of words like the case

v1.1.1 180K #case #inflect #camel
encoding-index-singlebyte

Index tables for various single-byte character encodings

v1.20141219.5 233K #character-encoding #index #table #iso #encode #single-byte #string
encoding-index-japanese

Index tables for Japanese character encodings

v1.20141219.5 233K #character-encoding #japanese #table #string #index #standard
encoding-index-korean

Index tables for Korean character encodings

v1.20141219.5 233K #character-encoding #korean #index #table #standard
encoding-index-simpchinese

Index tables for simplified Chinese character encodings

v1.20141219.5 233K #character-encoding #chinese #index #table #standard #interface
indent_write

Write adapters to add line indentation

v2.2.0 175K no-std #io-write #indentation #adapter #line #display #insert #wrappers
ascii_utils

handle ASCII characters

v0.9.3 348K #ascii #characters #file
unicode-normalization-alignments

functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15

v0.1.12 154K #unicode-normalization #unicode #normalization #unicode-characters #decomposition #unicode-text #recomposition
fuzzy-matcher

Fuzzy Matching Library

v0.3.7 153K #fuzzy #text-search #match #search #text
difference

text diffing and assertion library

v2.0.0 321K bin+lib #text #diff #compare #change #assert
newline-converter

Newline byte converter library

v0.3.0 206K #newline #line-break #conversion #convert #sequence #line-breaks
htmlescape

HTML entity encoding and decoding

v0.3.1 236K #codec #html #decoding #entity #string #error #character
roff

ROFF (man page format) generation library

v0.2.1 139K #manpage #man #bold #roman #italic #control #page
rustybuzz

A complete harfbuzz shaping algorithm port to Rust

v0.14.0 127K no-std #shaping #true-type #opentype #truetype #text
ucd-util

A small utility library for working with the Unicode character database

v0.2.1 121K #unicode-characters #unicode #character #property #properties #database #symbols
encoding

Character encoding support for Rust

v0.2.33 235K #character-encoding #unicode #charset #whatwg #encode #iso #standard
mdbook

Creates a book from markdown files

v0.4.40 114K bin+lib #book #markdown #rust-book #gitbook #serve
unescape

Unescapes strings with escape sequences written out as literal characters

v0.1.0 123K #escaping #string #unicode #escaped
unescaper

Unescape strings with escape sequences written out as literal characters

v0.1.4 111K #escaping #string #unescape
tokenizers

today's most used tokenizers, with a focus on performances and versatility

v0.19.1 101K #nlp #tokenizer #bpe #huggingface #word-piece #tokenize #text-input
unicode-security

Detect possible security problems with Unicode usage according to Unicode Technical Standard #39 rules

v0.1.1 102K #unicode #security #unicode-text #text
unicode-ccc

Unicode Canonical Combining Class detection

v0.2.0 102K #unicode #class #canonical #combining #detection #ccc
utf16_lit

macro_rules to make utf-16 literals

v2.0.2 90K #utf-16 #macro-rules #literals #utf-8 #lit
pretty

Wadler-style pretty-printing combinators in Rust

v0.12.3 154K #pretty-printing #console #functional #functional-programming
unic-ucd-ident

UNIC — Unicode Character Database — Identifier Properties

v0.9.0 91K #unic #unicode #unicode-normalization #unicode-text #character-property
unicode-bidi-mirroring

Unicode Bidi Mirroring property detection

v0.2.0 90K #unicode #property #detection #mirroring #bidi
substring

method for string types

v1.4.5 89K no-std #string #slice #str #substr
prettydiff

Side-by-side diff for two files

v0.7.0 86K #diff #compare #change #text
pulldown-cmark-escape

An escape library for HTML created in the pulldown-cmark project

v0.11.0 85K #escaping #markdown-html #html #common-mark #markdown #html-string #escapes
yeslogic-fontconfig-sys

Raw bindings to Fontconfig without a vendored C library

v5.0.0 85K sys #font #bindings #fontconfig #sys
cruet

Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…

v0.14.0 83K #snake #inflection #camel #pluralize #inflector
ammonia

HTML Sanitization

v4.0.0 119K #html #security #sanitization #xss #html-parser #web-page
any_ascii

Unicode to ASCII transliteration

v0.3.2 78K no-std bin+lib #unicode-characters #ascii #unicode #transliteration #emoji #unicode-normalization #unidecode
unidecode

pure ASCII transliterations of Unicode strings

v0.3.0 170K #unicode #ascii #transliteration #unidecoder
grep-searcher

Fast line oriented regex searching as a library

v0.1.13 80K #regex #binary-data #grep #pattern #search-pattern #search #memory-map
pad

padding strings at runtime

v0.1.6 72K #string #run-time #padding #width #cases #stdlib #most
utf8_iter

Iterator by char over potentially-invalid UTF-8 in &[u8]

v1.0.4 64K #utf-8 #iterator #unicode #encoding
byteyarn

hyper-compact strings

v0.5.1 63K #byte-string #string #string-representation #binary #text
charset

Thunderbird-compatible character encoding decoding for email

v0.1.3 99K #character-encoding #codec #encoding #email #utf-7 #unicode
tabwriter

Elastic tabstops

v1.4.0 61K #elastic #white-space #table #tabs #aligned #alignment #whitespace
case

A set of letter case string helpers

v1.0.0 60K #snake-case #camel-case #string #snake #ascii #alphabet #ascii-text
glyph_brush_layout

Text layout for ab_glyph

v0.2.3 60K #layout #text-layout #glyph #font #font-rendering #line #positioning
font-types

Scalar types used in fonts

v0.5.3 59K #font #codec #type #decoding #byte #scalar #glyph
lexical-sort

Sort Unicode strings lexically

v0.3.1 58K no-std #sorting #unicode #transliteration #unicode-characters #sort #lexicographical #no-std
dwrote

Lightweight binding to DirectWrite

v0.11.0 64K #direct-write #wrapper #binding #webrender #servo #helper #windows
unicode-case-mapping

Fast lowercase, uppercase, and titlecase mapping for characters

v0.5.0 60K #unicode-characters #unicode #lower-case #upper-case #case #title-case #titlecase
utf16_iter

Iterator by char over potentially-invalid UTF-16 in &[u16]

v1.0.5 53K #utf-16 #iterator #unicode #encoding #character-encoding
punycode

Functions to decode and encode Punycode

v0.4.1 61K bin+lib #rfc-3492 #codec #rfc3492
linkify

Finds URLs and email addresses in plain text. Takes care to get the boundaries right with surrounding punctuation like parentheses.

v0.10.0 47K #url #links #web #link #text
str_indices

Count and convert between indexing schemes on string slices

v0.4.3 47K no-std #string #no-std #text
unicode-vo

Unicode vertical orientation detection

v0.1.0 61K #unicode #vertical #orientation #detection #property #annex #50
utf16string

String types to work directly with UTF-16 encoded strings

v0.2.0 69K #utf-16 #string #byte-string #endianness #encoded-string #wstring
write16

A UTF-16 analog of the Write trait

v1.0.0 46K no-std #utf-16 #unicode #traits #write #analog #small-vec #sink
entities

raw data needed to convert to and from HTML entities

v1.0.2-rc.1 40K #html #convert-html #character #escaping
comrak

A 100% CommonMark-compatible GitHub Flavored Markdown parser and formatter

v0.24.1 38K bin+lib #markdown-parser #markdown #common-mark #github #gfm #syntax-highlighter #port
text_lines

Information about lines of text in a string

v0.6.0 44K #string #line #information
codepage

Mapping between Windows code page numbers and encoding_rs character encodings

v0.1.1 73K #character-encoding #winapi #encoding #data-encoding #unicode #windows
termimad

Markdown Renderer for the Terminal

v0.29.2 37K #markdown #terminal #markdown-text #renderer #command-line-interface #tui #terminal-text
text_io

really simple to use panicking input functions

v0.1.12 38K #io-read #read #io #scan #read-line #scanf #iterator
stfu8

Sorta Text Format in UTF-8

v0.2.7 51K #binary-data #text-encoding #unicode #binary #repr #text-format #invalid
wezterm-bidi

The Unicode Bidi Algorithm (UBA)

v0.2.3 34K #unicode #algorithm #bidirectional #uba #bidi #wezterm
unicode-reverse

Unicode-aware in-place string reversal

v1.0.9 35K no-std #unicode #string #grapheme #reverse #unicode-characters #no-std
lopdf

PDF document manipulation

v0.32.0 34K #pdf #editing #pdf-file #merge #manipulation #hash-map #file-format
os_display

Display strings in a safe platform-appropriate way

v0.1.3 33K no-std #terminal #shell #terminal-text #cli #text #no-std
pcre2

High level wrapper library for PCRE2

v0.2.7 28K #regex #jit #high-level #perl #pcre
markdown-gen

generating Markdown files

v1.2.1 29K #markdown #generate-markdown #generator #markdown-generator
ngrams

Generate n-grams from sequences

v1.0.1 41K #generate #sequence #vec
unicode_names2

Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…

v1.2.2 26K no-std #unicode-characters #unicode #name #mapping #memory-mapping #run-time #lookup-tables
grep

Fast line oriented regex searching as a library

v0.3.1 48K #search-pattern #pattern #ripgrep #search #line #matcher #oriented
str_inflector

Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…

v0.12.0 27K #snake #inflector #inflection #camel #pluralize
ropey

A fast and robust text rope for Rust

v1.6.1 45K #rope #edit #buffer #text #text-editing #text-editors #text-file
line-index

Maps flat TextSize offsets to/from (line, column) representation

v0.1.1 27K #line-column #offset #convert-text #index #maps #flat #representation
unicode-general-category

Fast lookup of the Unicode General Category property for char

v0.6.0 26K no-std #unicode #general #category #properties #no-std
emojis

✨ Lookup emoji in *O(1)* time, access metadata and GitHub shortcodes, iterate over all emoji, and more!

v0.6.2 27K no-std #emoji #github #unicode #gemoji
byte_string

Wrapper types for outputting byte strings (b"Hello") using the Debug ({:?}) format

v1.0.0 40K #string #debug #ascii #byte-slice
shell2batch

Coverts simple basic shell scripts to windows batch scripts

v0.4.5 26K #shell #batch #conversion #scripting #convert
snailquote

Escape and unescape strings with shell-inspired quoting

v0.3.1 25K #escaping #escape-character #escape #quote #shell-escape #unescape #escaped
ansi-to-tui

convert ansi color coded text into ratatui::text::Text type from ratatui library

v4.0.1 35K #ansi #ansi-colors #convert-text #tui #ansi-term #colored-text #parser
swrite

Infallible alternatives to write! and writeln! for Strings

v0.1.0 40K #string #write #macro #formatting #writeln
titlecase

Capitalize text according to a style defined by John Gruber for Daring Fireball

v3.1.1 22K bin+lib #title #case #capitalization #text-style #wasm #capitalisation
chardetng

A character encoding detector for legacy Web content

v0.1.17 22K #character-encoding #encoding #unicode #web #charset #unicode-characters
filecheck

writing tests for utilities that read text files and produce text output

v0.5.0 23K #directive #testing #regex #output #text-file #variables #pattern
const-str-proc-macro

compile-time string operations

v0.5.7 59K macro no-std #string #const #proc-macro
jieba-rs

The Jieba Chinese Word Segmentation Implemented in Rust

v0.7.0 21K #nlp #chinese #segmenation
uwl

A management stream for bytes and characters

v0.6.0 22K #byte-stream #unicode #unicode-characters #lexer #code-point #unicode-aware #language
sublime_fuzzy

Fuzzy matching algorithm based on Sublime Text's string search

v0.7.0 19K bin+lib #string-search #fuzzy-search #fuzzy-matching #fuzzy #search #match #fuzzy-string
sliceslice

A fast implementation of single-pattern substring search using SIMD acceleration

v0.4.2 19K #text-search #simd #search-algorithms #search #string-search #text #string
lindera-decompress

A morphological analysis library

v0.30.0 19K #morphological #analysis #library
man

Generate structured man pages

v0.3.0 20K #manpage #pages #structured #generate #author #section #io
lindera-ipadic-builder

A Japanese morphological dictionary builder for IPADIC

v0.30.0 19K #japanese-morphological #dictionary #morphological #builder #ipadic
lindera-dictionary

A Japanese morphological dictionary

v0.30.0 19K #japanese-morphological #morphological #analysis #library
lindera-unidic-builder

A Japanese morphological dictionary builder for UniDic

v0.30.0 19K #japanese-morphological #dictionary #japanese #morphological #builder #unidic
select

extract useful data from HTML documents, suitable for web scraping

v0.6.0 20K #web-scraping #html #extract #document #data #suitable #node
lindera-ko-dic-builder

A Korean morphological dictionary builder for ko-dic

v0.30.0 19K #dictionary #korean #builder #morphological #ko-dic
lindera-cc-cedict-builder

A Chinese morphological dictionary builder for CC-CEDICT

v0.30.0 19K #chinese #dictionary #builder #morphological #cc-cedict #tokenize
html2text

Render HTML as plain text

v0.12.5 17K #html-text #html-parser #plain-text #convert-html #html #html-rendering #text-rendering
chardet

rust version of chardet

v0.2.4 17K #detect #encoding #version #charset #utf-8 #language
lindera-ipadic-neologd-builder

A Japanese morphological dictionary builder for IPADIC NEologd

v0.30.0 17K #japanese-morphological #dictionary #japanese #builder #ipadic #neologd
sanitizer

A collection of methods and macros to sanitize struct fields

v0.1.6 16K #trim #case #struct-fields #validate #e164 #macro-derive
lowcharts

draw low-resolution graphs in terminal

v0.5.8 17K bin+lib #graph #console #grep #troubleshooting #data-analysis #text
cow-utils

Copy-on-write string utilities for Rust

v0.1.3 16K no-std #string #cow #text #str
unicode-truncate

Unicode-aware algorithm to pad or truncate str in terms of displayed width

v1.0.0 17K no-std #unicode #unicode-characters #width #truncate #pad #unicode-text #text
unified-diff

GNU unified diff format

v0.2.1 22K bin+lib #diff #gnu #unified #patch #generator #format #package
ucd-parse

parsing data files in the Unicode character database

v0.1.13 16K #unicode-characters #unicode #character #properties #database
stringmatch

Allow the use of regular expressions or strings wherever you need string comparison

v0.4.0 17K #string #regex #comparison #compare #string-pattern #match
pcre2-sys

Low level bindings to PCRE2

v0.2.9 28K sys #regex #pcre2 #jit #pcre #low-level
ansi-width

Calculate the width of a string when printed to the terminal

v0.1.0 16K #ansi-codes #width #string #unicode-characters #escaping #terminal #character
charabia

detect the language, tokenize the text and normalize the tokens

v0.8.10 13K #tokenizer #normalize #segmenter #tokenize #language #document
harfbuzz-sys

Rust bindings to the HarfBuzz text shaping engine

v0.6.1 11K sys #unicode #font #shaping #opentype
xlsxwriter

Write xlsx file with number, formula, string, formatting, autofilter, merged cells, data validation and more

v0.6.0 24K #excel #xlsx #string-formatting #api-bindings
grok

popular java & ruby grok library which allows easy text and log file processing with composable patterns

v2.0.0 14K #log-file #pattern #processing #java #ruby #composable #unstructured
uuhelp_parser

A collection of functions to parse the markdown code of help files

v0.0.26 14K #parse-markdown #help #parser #text #cross-platform #collection #functions
flexstr

A flexible, simple to use, immutable, clone-efficient String replacement for Rust

v0.9.2 22K no-std #string-literal #string #inline #heap-allocated #reference-counting #refcount #replace
etch

Not just a text formatter, don't mark it down, etch it

v0.4.2 13K bin+lib #text #formatter #don-t #mark #down #document #plugin
jetscii

A tiny library to efficiently search strings and byte slices for sets of ASCII characters or bytes

v0.5.3 13K #byte-slice #string-search #string #byte #ascii #search #simd
garde

Validation library

v0.18.0 13K #validation #validate #valid #email #domain-name
detone

Decompose Vietnamese tone marks

v1.0.0 19K #vietnamese #unicode #unicode-normalization #tone #marks #forms #iterator
ripgrep

line-oriented search tool that recursively searches the current directory for a regex pattern while respecting gitignore rules. ripgrep has first class support on Windows, macOS and Linux.

v14.1.0 19K app #grep #pattern #search-pattern #gitignore #search
lindera-core

A morphological analysis library

v0.30.0 19K #morphological #analysis #library
cedarwood

efficiently-updatable double-array trie in Rust (ported from cedar)

v0.4.6 21K #string-search #trie #search #cedar #string #text-search #text
escape-bytes

Escapes bytes that are not printable ASCII characters

v0.1.1 12K no-std #escaping #ascii #byte-sequences #characters #forms #printable #utf-8
doccy

brace based markup language

v0.3.2 13K bin+lib #markup-language #html #markup #language #text #command-line-interface #line-break
lindera-tokenizer

A morphological analysis library

v0.30.0 12K #tokenizer #analysis #morphological #tokenize #library
lindera-compress

A morphological analysis library

v0.30.0 12K #morphological #analysis #library
lindera-ko-dic

A Japanese morphological dictionary for ko-dic

v0.30.0 12K #japanese #japanese-morphological #morphological #dictionary #ko-dic #korean
bk-tree

A Rust BK-tree implementation

v0.5.0 12K #levenshtein #fuzzy-search #fuzzy #search #tree #metrics #distance
glob-match

An extremely fast glob matcher

v0.2.1 18K #pattern-matching #glob-pattern #wildcard #path #character #class #braces
array_tool

Helper methods for processing collections

v1.0.3 11K #string #substitution #unique #grapheme #vector
harfbuzz-traits

Rust Traits for the HarfBuzz text shaping engine

v0.6.0 14K #shaping #unicode #opentype #font #text
lexicmp

comparing and sorting strings lexicographically and naturally

v0.1.0 16K #sorting #emoji #unicode #transliteration #unicode-characters #lexicographical #sorted
slugify

Macro for flexible slug generation

v0.1.0 10K #slug #macro #flexible #words #unicode #stop #generation
lipsum

lorem ipsum text generation library. It generates pseudo-random Latin text. Use this if you need filler or dummy text for your application. The text is generated using a simple Markov chain…

v0.9.1 11K #text #text-generation #random #markov #typography
cuid

An ipmlementation of CUID protocol in rust

v1.3.2 10K bin+lib #unique #horizontal #lookup #collision-resistant #binary #unique-id #benchmark
pretty-xmlish

Pretty print XML-ish data with unicode art

v0.1.13 10K #pretty-printing #unicode-characters #art #output #author #limited #data
minify-html-common

Common code and data for minify-html*

v0.0.2 10K #minify-html #white-space #html-css #nodejs #js #bindings #speed
hyphenation

Knuth-Liang hyphenation for a variety of languages

v0.8.4 10K #typesetting #dictionary #language #pattern #utf-8 #standard #built
hyperscan

bindings for Rust with Multiple Pattern and Streaming Scan

v0.3.2 11K #regex #streaming #pattern-matching #expression #regular #run-time #scan
wchar

Procedural macros for compile time UTF-16 and UTF-32 wide strings

v0.11.0 10K #string #utf-16 #wide #compile-time #proc-macro
rutie

The tie between Ruby and Rust

v0.9.0 13K sys #ruby #object #exception #integration #cruby #applications #directory
unicode-blocks

contains a list of all unicode blocks and provides some functions to search across them

v0.1.9 10K no-std #unicode-characters #unicode #blocks #character #cjk #block
suffix

arrays

v1.3.0 10K bin+lib #suffix-array #search #search-index #index #text-search #linear-time #text
ferris-says

flavored replacement for the classic cowsay

v0.3.1 9.0K #cowsay #print #rustaceans #ferris #fsays #byte-string #rustacean
ucd

Extends the char type to provide access to most fields of the UCD, Unicode Character Database, as of version 9.0.0. It aims to be compact, fast, and use minimal dependencies (only rust's core crate)…

v0.1.1 9.2K #unicode-characters #unicode #character
printpdf

writing PDF files

v0.7.0 8.4K #pdf #graphics #wkhtmltopdf #gui
hyperscan-sys

Hyperscan bindings for Rust with Multiple Pattern and Streaming Scan

v0.3.2 11K sys #regex #hyperscan #streaming
sd

An intuitive find & replace CLI

v1.0.0 8.6K app #regex #replace #sed #find
caseless

Unicode caseless matching

v0.2.1 8.7K #unicode #matching
print-positions

providing string segmentation on grapheme clusters and ANSI escape sequences for accurate length arithmetic based on visible print positions

v0.6.1 8.6K #escaping #ansi #grapheme #unicode #escape-sequence #source-string #text
unicode-casing

Titlecase helper function on characters

v0.1.0 9.0K #character #title-case #helper
fm

Non-backtracking fuzzy text matcher

v0.3.0 8.1K #pattern-matching #line #fuzzy #multi-line #wildcard #non-backtracking #regex
console_static_text

Logging for text that should stay in the same place in a console

v0.8.2 8.8K #console #progress-bar #logging #place #static #bars #words
commonregex

Rust port for CommonRegex. Find all times, dates, links, phone numbers, emails, ip addresses, prices, hex colors, and credit card numbers in a string. We did the hard work so you don't have to.

v0.2.0 9.6K #regex #phone-number #string #numbers #date #find #email
nucleo-matcher

plug and play high performance fuzzy matcher

v0.3.1 7.9K #fuzzy-matching #pattern-matching #nucleo #fuzzy-search #matcher #performance #comparison
svgbobdoc

Renders ASCII diagrams in doc comments as SVG images

v0.3.0 9.6K macro #svg #diagram #documentation #figure #rustdoc #proc-macro
lindera-unidic

A Japanese morphological dictionary for UniDic

v0.30.0 7.4K #japanese-morphological #japanese #morphological #dictionary #unidic
harfbuzz

Rust bindings to the HarfBuzz text shaping engine

v0.6.0 5.2K no-std #font #shaping #unicode #opentype #unicode-text #text
lindera-ipadic

A Japanese morphological dictionary for IPADIC

v0.30.0 7.4K #japanese-morphological #japanese #morphological #dictionary #ipadic
gh-emoji

Convert :emoji: to Unicode using GitHub’s emoji names

v1.0.8 7.5K #emoji #github #unicode #convert #markdown
nu-utils

Nushell utility functions

v0.93.0 6.5K bin+lib #nushell #shell #utility #functions
text-splitter

Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.

v0.13.1 6.8K #chunk-size #split #nlp #tokenizer #ai #language-model #text
synoptic

low-level, syntax highlighting library with unicode support

v2.0.0 7.0K #syntax-highlighting #syntax-highlighter #unicode #configurable #low-level #editor #projects
pager

pipe your output through an external pager

v0.16.1 6.9K #env-var #less #output #command-output #variables #environment #external
text-diff

text diffing and assertion library

v0.4.0 9.9K bin+lib #diff #difference #change #assert
hyphenation_commons

Proemial code for the hyphenation library

v0.8.4 10K #hyphenation #proemial #commons #unicode #text #internal
bwrap

A fast, lightweight, embedded systems-friendly library for wrapping text

v1.3.0 6.8K no-std #heap-allocation #formatting #wrap #line-feed #80-column #text-formatting #no-std
target_info

Get text strings of attributes concernign the build target

v0.1.0 13K #target #build #attributes #info #string #information #text
precis-tools

Tools and parsers to generate PRECIS tables from the Unicode Character Database (UCD)

v0.1.7 6.8K #unicode-characters #internationalized #comparison #precis #enforcement #preparation
precis-profiles

PRECIS Framework: Preparation, Enforcement, and Comparison of Internationalized Strings Representing Usernames and Passwords as defined in rfc8265; and Nicknames as defined in rfc8266

v0.1.10 6.8K #profiles #precis #profile #rfc8265 #rfc8266 #user-name #rfc8264
ascii_tree

generates ascii trees

v0.1.1 8.2K #tree #ascii #generates
terminal-supports-emoji

Check whether the current terminal supports emoji

v0.1.3 8.4K #emoji #terminal #stream #supports-emoji
unic-ucd-age

UNIC — Unicode Character Database — Age

v0.9.0 5.4K #unicode #age #character-property #unicode-text
svgbob

Transform your ascii diagrams into happy little SVG

v0.7.2 5.2K #diagram #svg #ascii #bob #ascii-text #convert-text #text
imperative

Check for imperative mood in text

v1.0.5 5.6K #text #mood #word
lexis

Generates human-readable sequences from numeric values using a predefined word list

v0.2.2 7.0K #text-encoding #human-readable #text #word-list #encoding
simple-logging

logger for the log facade

v2.0.2 7.5K #logging #log #logger #log-messages #log-file #log-level #simple
genpdf

User-friendly PDF generator written in pure Rust

v0.2.0 5.1K #pdf-document #pdf #layout #pdf-file #text-layout #text-rendering #text
prop-check-rs

A Property-based testing Library in Rust

v0.0.583 5.3K bin+lib #testing #property-based #property-based-testing #value #choose
encoding_c_mem

C API for encoding_rs::mem

v0.2.6 6.7K sys #c-api #unicode #charset #ffi #encoding #capi
tracing-texray

Tracing layer to view a plaintext timeline of spans and events

v0.2.0 7.8K #tracing-layer #spans #plain-text #events #tracing-subscriber #timeline #examine
pdf-extract

extract content from pdfs

v0.7.7 4.7K #pdf #pdf2txt #pdf2text #text
ra_ap_test_utils

TBD

v0.0.216 4.6K #testing-utilities #comparison #string #marker #fixture #diff #module
mdbook-mermaid

mdbook preprocessor to add mermaid support

v0.13.0 4.8K bin+lib #mdbook #mermaid #graph #add #book #js
typos-dict

Source Code Spelling Correction

v0.11.18 4.4K #spelling #spelling-correction #spell-checking #false-positives #typos #development #monorepo
utf8-cstr

Type wrappers promising null termination and utf-8 validity. The intersection of std::ffi::CStr and str

v0.1.6 13K #utf-8 #null #str #intersection #wrappers #c-str #termination
pluralizer

Rust package to pluralize or singularize any word based on a count inspired on pluralize NPM package

v0.4.0 4.9K #plural #word-count #singular #pluralize #npm-package #plurals
mdbook-linkcheck

A backend for mdbook which will check your links for you

v0.7.7 4.6K bin+lib #mdbook #check #link #backend #book #linkcheck #directory
textdistance

Lots of algorithms to compare how similar two sequences are

v1.0.2 3.9K bin+lib #levenshtein #distance #hamming #similarity #jaro
textnonce

Text based random nonce generator

v1.0.0 7.1K #nonce #random #text #numbers #generator #length #cryptography
fax

Decoder and Encoder for CCITT Group 3 and 4 bi-level image encodings used by fax machines TIFF and PDF

v0.2.4 6.5K #image-encoding #codec #ccitt #decoding #tiff #ccitt-fax-decode #pdf
mdbook-svgbob

SvgBob mdbook preprocessor which swaps code-blocks with neat SVG

v0.2.1 4.5K app #mdbook #svg #markdown #ascii #bob
textwrap-macros

procedural macros to use textwrap utilities at compile time

v0.3.0 4.6K no-std #text-formatting #proc-macro #text #formatting #macro #compile-time #typesetting
mdbook-preprocessor-boilerplate

Boilerplate code for mdbook preprocessors

v0.1.2 5.3K #mdbook #boilerplate #proprocessor
qp-trie

An idiomatic and fast QP-trie implementation in pure Rust, written with an emphasis on safety

v0.8.2 7.2K no-std #trie #key-value #key #radix #value #map #data-structures
mdbook-pandoc

A mdbook backend that outsources most of the rendering process to pandoc

v0.6.4 4.3K bin+lib #mdbook #pandoc #pdf #latex #book
ra_ap_ide_ssr

Structural search and replace of Rust code

v0.0.216 5.6K #ide #replace #search #structural #compiler #front-end #create
unic-bidi

UNIC — Unicode Bidirectional Algorithm

v0.9.0 4.6K #unicode #rtl #unicode-text #layout #bidi #text
evcxr

An Evaluation Context for Rust

v0.17.0 3.9K bin+lib #evaluation #context #variables #local #eval #detail #eval-context
regex-cursor

regex fork that can search discontiguous haystacks

v0.1.4 4.2K #regex #dfa #automata #nfa #search-engine #byte-range
lingua-english-language-model

The English language model for Lingua, an accurate natural language detection library

v1.1.0 5.6K #english #nlp #language-model #language-detection #language-recognition
lindera-cc-cedict

A Japanese morphological dictionary for CC-CEDICT

v0.30.0 4.6K #japanese-morphological #dictionary #morphological #cc-cedict #chinese
scanlex

lexical scanner for parsing text into tokens

v0.1.4 3.8K #input #tokenize #scan #text #text-parser
typos-cli

Source Code Spelling Correction

v1.21.0 5.9K bin+lib #spelling-correction #spelling #typos #development #source #monorepo #spell-checker
adobe-cmap-parser

parse Adobe CMap files

v0.4.0 5.3K #pdf #postscript #font #cmap
lindera

A morphological analysis library

v0.30.0 3.7K #morphological #analysis #library
hypher

separates words into syllables

v0.1.5 3.8K no-std #syllable #hyphenation #language #words #pattern #byte #binary-data
regex_mutator

The Nautilus regex_mutator

v0.3.1 3.3K #regex #input #rule #grammar #generate #fuzzer #script
easy_reader

easily navigating forward, backward or randomly through the lines of huge files

v0.5.2 3.4K #file-line #line #backward #random #reader #reverse #lines
trigram

Trigram-based string similarity for fuzzy matching

v0.4.4 4.8K #string-similarity #string-matching #fuzzy-string #fuzzy-matching #string #fuzzy #matching
rustyline-async

A minimal readline with multiline and async support

v0.4.2 3.4K #read-line #multiline #input #history #unicode #async #crossterm
lingua-german-language-model

The German language model for Lingua, an accurate natural language detection library

v1.1.0 4.9K #nlp #language-detection #language-model #language-recognition
quoted-string-parser

Quoted string parser for grammar defined in RFC3261

v0.1.0 6.4K #parser #rfc3261 #quoted-string
compact_bytes

A memory efficient bytes container that transparently stores bytes on the stack, when possible

v0.1.1 5.5K #byte #memory #compact #mutable #small
varcon-core

Varcon-relevant data structures

v4.0.7 4.2K #spell-checking #data-structures #code-quality
reword

some utility functions for human-readable formatting of words

v7.0.0 2.9K #formatting #human-readable #words
grep-pcre2

Use PCRE2 with the 'grep' crate

v0.1.7 4.9K #grep #regex #look #backreference #pcre
mdxjs

Compile MDX to JavaScript in Rust

v0.2.2 3.0K #compile #markdown #mdx
mdbook-toc

mdbook preprocessor to add Table of Contents

v0.14.2 3.2K bin+lib #table #content #add #marker #inline #toc #level
uwuify

fastest text uwuifier in the west

v0.2.2 3.3K bin+lib #uwu #simd #owo #cli
hunspell-rs

Rust bindings to the Hunspell library

v0.4.0 3.6K #hunspell #spell-checking #spellcheck #dictionary #bindings #hunspell-sys
esl01-renderdag

Render a graph into ASCII or Unicode text

v0.3.0 5.4K #graph #render #ascii #render-graph #unicode #unicode-text
wana_kana

checking and converting between Japanese characters - Kanji, Hiragana, Katakana - and Romaji

v3.0.0 3.0K bin+lib #japanese #katakana #hiragana #romaji #kana
sanitize-filename-reader-friendly

A filename sanitizer aiming to produce reader friendly filenames

v2.2.1 2.8K bin+lib #filename #filenames #sanitizer
vaporetto

pointwise prediction based tokenizer

v0.6.3 2.7K no-std #japanese #tokenizer #analyzer #morphological
file-size

a function formatting file sizes in 4 chars

v1.0.3 3.3K #size #file #utility #format
typos

Source Code Spelling Correction

v0.10.23 4.2K #spelling-correction #spelling #spell-checker #development #false-positives #source #monorepo
typos-vars

Source Code Spelling Correction

v0.8.17 4.2K #spelling-correction #spelling #spell-checking #false-positives #source #development #typos
dictgen

Compile-time case-insensitive map

v0.2.8 4.1K no-std #spelling #development #no-std
egui-dropdown

An actual dropdown list for egui

v0.9.0 2.7K #egui #list #ui #dropdown #items #text
rapidfuzz

rapid fuzzy string matching library

v0.5.0 2.6K #levenshtein #string-similarity #string-matching #fuzzy-string #string #similarity #hamming
hyper-old-types

HTTP types from hyper 0.11.x

v0.11.0 5.9K #hyper #deprecated #type #ease #11 #backwards-compatibility #http
sre-engine

A low-level implementation of Python's SRE regex engine

v0.4.3 2.6K #regex #python #low-level #engine #sre
frida-build

Rust bindings for Frida

v0.13.6 3.0K #frida #bindings #build
keyvalues-parser

A parser/renderer for vdf text

v0.2.0 2.9K #vdf #key-value #text-parser #parser #steam #keyvalues
utfx

v0.1.0 3.8K no-std #string #wide #utf-32 #utf-16 #converting #api #winapi
lingua-french-language-model

The French language model for Lingua, an accurate natural language detection library

v1.1.0 4.3K #nlp #language-model #language-detection #language-recognition
re_space_view_text_document

space view that shows a single text box

v0.17.0-alpha.2 2.9K #text-document #view #space #single #rerun #box #show
line-span

Find line ranges and jump between next and previous lines

v0.1.5 2.5K no-std #line #text #streaming #line-end #lines
lingua-spanish-language-model

The Spanish language model for Lingua, an accurate natural language detection library

v1.1.0 4.2K #nlp #language-detection #language-model #language-recognition
svgbob_cli

Transform your ascii diagrams into happy little SVG

v0.7.2 2.5K app #svg #ascii #convert #bob #convert-text
sedregex

Sed-like regex library

v0.2.5 2.4K #regex #sed #replace #command #processing #one-time #sed-like
harfbuzz_rs

A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust

v2.0.1 2.4K #harfbuzz #shaping #textlayout #text #ffi
unicode-canonical-combining-class

Fast lookup of the Canonical Combining Class property

v0.5.0 2.3K no-std #unicode #combining #class #canonical #no-std
mini_paste

Fast-to-compile equivalent to ::paste

v0.1.11 4.2K #paste #fast-to-compile #replacing #seamlessly #offers #nor #licensed
text_trees

textual output for tree-like structures

v0.1.2 2.3K #tree-node #tree-structure #output #text #textual #formatting #child
hunspell-sys

Bindings to the hunspell C API

v0.3.1 3.6K sys #hunspell #bindings #api #static-libclang
mdbook-graphviz

mdbook preprocessor to add graphviz support

v0.2.0 2.1K app #graphviz #dot #add #process #flags #file
stringcase

Converts string cases between camelCase, COBOL-CASE, kebab-case, and so on

v0.2.1 2.3K #convert-string #camel-case #pascal-case #case #ascii-text #pascal #snake
tiny-gradient

Make your string colored in gradient

v0.1.0 3.6K no-std #gradient #ansi-colors #ansi-term #color #ansi #terminal #cli
fuzzt

Implementations of string similarity metrics. Includes Hamming, Levenshtein, OSA, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Sørensen-Dice.

v0.3.1 1.8K #levenshtein #string-similarity #string #similarity #hamming #jaro
in_definite

Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.

v1.0.0 2.3K #nlp #grammar #english #text
glyph-names

Mapping of characters to glyph names according to the Adobe Glyph List Specification

v0.2.0 2.0K #glyph #name #font
line-numbers

Find line numbers in strings by byte offsets, quickly

v0.3.0 2.4K #byte-offset #line-string #numbers #find #within #text #line-positions
unicode-joining-type

Fast lookup of the Unicode Joining Type and Joining Group properties

v0.7.0 1.9K no-std #unicode #joining #shaping #arabic #no-std
lindera-analyzer

A morphological analysis library

v0.30.0 2.1K #unicode-normalization #analysis #morphological #japanese #library
secular

No Diacr!

v1.0.1 2.0K #unicode-normalization #unicode #normalization #diacritics
strings

String utilities, including an unbalanced Rope

v0.1.1 3.4K #string #rope #utf-8 #unbalanced #structures #src-rope #character
pest_ascii_tree

Helper crates converting the parsing result of any pest grammar into an ascii tree

v0.1.0 3.6K #pest-grammar #pest-parser #pest #ascii #tree
lingua-chinese-language-model

The Chinese language model for Lingua, an accurate natural language detection library

v1.1.0 3.2K #nlp #language-detection #language-model #language-recognition
readability

Port of arc90's readability project to rust

v0.3.0 2.0K #port #content #arc90 #webpage #primary #readable #scrape
lingua-japanese-language-model

The Japanese language model for Lingua, an accurate natural language detection library

v1.1.0 3.1K #nlp #language-model #language-detection #language-recognition
regex-macro

A macro to generate a lazy regex expression

v0.2.0 3.0K #regex #lazy-evaluation #macro
encoding8

various 8-bit encodings

v0.3.2 2.8K #ascii #8-bit #encoding #ebcdic
pandoc

API that wraps calls to the pandoc 2.x executable

v0.8.11 1.9K #markdown #latex #executable #api #builder #calls #wraps
cargo-spellcheck

Checks all doc comments for spelling mistakes

v0.14.0 1.7K bin+lib #spelling #grammar #spell-checking #spellcheck
mdbook-admonish

A preprocessor for mdbook to add Material Design admonishments

v1.16.0 1.8K bin+lib #material-design #mdbook #design #material #markdown #ui
clippy_lints

A bunch of helpful lints to avoid common pitfalls in Rust

v0.0.212 2.4K nightly #lint #clippy #plugin
text_unit

Newtypes for text offsets

v0.1.10 2.4K #offset #newtype #text #u32 #wrappers
linkcheck

extracting and validating links

v0.4.1 2.7K #links #check #link #link-checker
lingua-portuguese-language-model

The Portuguese language model for Lingua, an accurate natural language detection library

v1.1.0 2.9K #nlp #language-detection #language-model #language-recognition
textcode

Text encoding/decoding library. Supports: UTF-8, ISO6937, ISO8859, GB2312

v0.2.2 1.7K #charset #unicode #encoding #text-encoding
lingua-italian-language-model

The Italian language model for Lingua, an accurate natural language detection library

v1.1.0 2.9K #nlp #language-model #language-detection #language-recognition
lingua-russian-language-model

The Russian language model for Lingua, an accurate natural language detection library

v1.1.0 2.9K #nlp #language-detection #language-model #language-recognition
lingua-ukrainian-language-model

The Ukrainian language model for Lingua, an accurate natural language detection library

v1.1.0 2.8K #nlp #language-model #language-detection #language-recognition
lingua-arabic-language-model

The Arabic language model for Lingua, an accurate natural language detection library

v1.1.0 2.8K #nlp #language-model #language-detection #language-recognition #compression
lingua-turkish-language-model

The Turkish language model for Lingua, an accurate natural language detection library

v1.1.0 2.8K #nlp #language-model #language-detection #language-recognition
lingua-hindi-language-model

The Hindi language model for Lingua, an accurate natural language detection library

v1.1.0 2.8K #nlp #language-model #language-detection #language-recognition
lingua-korean-language-model

The Korean language model for Lingua, an accurate natural language detection library

v1.1.0 2.8K #language-model #nlp #language-detection #language-recognition
lingua-thai-language-model

The Thai language model for Lingua, an accurate natural language detection library

v1.1.0 2.8K #nlp #language-detection #language-model #language-recognition
crop

A pretty fast text rope

v0.4.2 2.9K #rope #buffer #edit #tree #data-structures #line-break
lingua-vietnamese-language-model

The Vietnamese language model for Lingua, an accurate natural language detection library

v1.1.0 2.8K #nlp #language-model #language-detection #language-recognition
lingua-latvian-language-model

The Latvian language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #language-model #nlp #language-detection #language-recognition
soup

Inspired by the python library BeautifulSoup, this is a layer on top of html5ever that adds a different API for querying and manipulating HTML

v0.5.1 1.8K #html #querying #manipulating #html5ever #python #different #top
lindera-filter

Character and token filters for Lindera

v0.30.0 1.6K #morphological #analysis #library
mdbook-katex

mdBook preprocessor rendering LaTeX equations to HTML

v0.8.1 1.5K bin+lib #math-expressions #mdbook #katex #latex #html-rendering #build-time #equations
lingua-dutch-language-model

The Dutch language model for Lingua, an accurate natural language detection library

v1.1.0 2.7K #nlp #language-model #language-detection #language-recognition
lingua-polish-language-model

The Polish language model for Lingua, an accurate natural language detection library

v1.1.0 2.6K #nlp #language-model #language-detection #language-recognition
lingua-indonesian-language-model

The Indonesian language model for Lingua, an accurate natural language detection library

v1.1.0 2.4K #nlp #language-detection #language-model #language-recognition
lingua-persian-language-model

The Persian language model for Lingua, an accurate natural language detection library

v1.1.0 2.4K #language-model #nlp #language-detection #language-recognition
lingua-bokmal-language-model

The Bokmal language model for Lingua, an accurate natural language detection library

v1.1.0 2.4K #nlp #language-model #language-detection #language-recognition
lingua-mongolian-language-model

The Mongolian language model for Lingua, an accurate natural language detection library

v1.1.0 2.4K #nlp #language-model #language-detection #language-recognition
lingua-malay-language-model

The Malay language model for Lingua, an accurate natural language detection library

v1.1.0 2.4K #nlp #language-detection #language-model #language-recognition
lingua-nynorsk-language-model

The Nynorsk language model for Lingua, an accurate natural language detection library

v1.1.0 2.4K #nlp #language-model #language-detection #language-recognition
pulldown-cmark-mdcat

Render pulldown-cmark events to TTY

v2.1.2 1.3K #markdown #markdown-syntax #cmark #cat #less
fast2s

A fast Traditional Chinese to Simplified Chinese conversion library. Built with FST, faster than most of other libraries.

v0.3.1 1.9K #chinese #convert #hanzi #traditional #simplified #localization
srx

A mostly compliant Rust implementation of the Segmentation Rules eXchange (SRX) 2.0 standard for text segmentation

v0.1.4 1.4K #segmentation #standard #plain-text #regex #compliant #rules #exchange
neo-mime

Strongly Typed Mimes

v0.1.1 2.1K #mime #media-type #media-extensions #media-types
simple_excel_writer

Excel Writer

v0.2.0 2.0K #excel #xlsx #xls #office
rasciigraph

function to plot ascii graphs

v0.2.0 1.2K #ascii #graph #plot #characters #terminal #line #chart
lingua-romanian-language-model

The Romanian language model for Lingua, an accurate natural language detection library

v1.1.0 2.3K #nlp #language-detection #language-model #language-recognition
lingua-greek-language-model

The Modern Greek language model for Lingua, an accurate natural language detection library

v1.1.0 2.3K #nlp #language-detection #language-model #language-recognition
lingua-hungarian-language-model

The Hungarian language model for Lingua, an accurate natural language detection library

v1.1.0 2.3K #language-detection #nlp #language-model #language-recognition
lingua-danish-language-model

The Danish language model for Lingua, an accurate natural language detection library

v1.1.0 2.3K #language-detection #nlp #language-model #language-recognition
lingua-finnish-language-model

The Finnish language model for Lingua, an accurate natural language detection library

v1.1.0 2.3K #nlp #language-detection #language-model #language-recognition
lingua-swedish-language-model

The Swedish language model for Lingua, an accurate natural language detection library

v1.1.0 2.3K #nlp #language-detection #language-model #language-recognition
lingua-slovak-language-model

The Slovak language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-detection #language-model #language-recognition
lingua-armenian-language-model

The Armenian language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-model #language-detection #language-recognition
lingua-estonian-language-model

The Estonian language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-detection #language-model #language-recognition
lingua-lithuanian-language-model

The Lithuanian language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-detection #language-model #language-recognition
lingua-catalan-language-model

The Catalan language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-model #language-detection #language-recognition
lingua-slovene-language-model

The Slovene language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-detection #language-model #language-recognition
lingua-czech-language-model

The Czech language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-model #language-detection #language-recognition
lingua-bulgarian-language-model

The Bulgarian language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-model #language-detection #language-recognition
lingua-tamil-language-model

The Tamil language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-model #language-detection #language-recognition
lingua-serbian-language-model

The Serbian language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-model #language-detection #language-recognition
lingua-icelandic-language-model

The Icelandic language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-detection #language-model #language-recognition
lingua-azerbaijani-language-model

The Azerbaijani language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-detection #language-model #language-recognition
lingua-esperanto-language-model

The Esperanto language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #language-model #nlp #language-detection #language-recognition
lingua-shona-language-model

The Shona language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-model #language-detection #language-recognition
lingua-hebrew-language-model

The Hebrew language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-detection #language-model #language-recognition
lingua-irish-language-model

The Irish language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-detection #language-model #language-recognition
lingua-georgian-language-model

The Georgian language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-detection #language-model #language-recognition
lingua-xhosa-language-model

The Xhosa language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-detection #language-model #language-recognition
lingua-macedonian-language-model

The Macedonian language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-detection #language-model #language-recognition
lingua-kazakh-language-model

The Kazakh language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-model #language-detection #language-recognition
lingua-zulu-language-model

The Zulu language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-model #language-detection #language-recognition
lingua-urdu-language-model

The Urdu language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-model #language-detection #language-recognition
lingua-sotho-language-model

The Sotho language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #language-model #nlp #language-detection #language-recognition
lingua-welsh-language-model

The Welsh language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-detection #language-model #language-recognition
lingua-belarusian-language-model

The Belarusian language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-detection #language-model #language-recognition
lingua-tagalog-language-model

The Tagalog language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #language-model #nlp #language-detection #language-recognition
lingua-marathi-language-model

The Marathi language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #language-model #nlp #language-detection #language-recognition
lingua-afrikaans-language-model

The Afrikaans language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-detection #language-model #language-recognition
lingua-maori-language-model

The Māori language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #language-detection #language-model #nlp #language-recognition
lingua-somali-language-model

The Somali language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-detection #language-model #language-recognition
lingua-albanian-language-model

The Albanian language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-model #language-detection #language-recognition
lingua-yoruba-language-model

The Yoruba language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #language-model #nlp #language-detection #language-recognition
lingua-telugu-language-model

The Telugu language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #language-model #language-detection #nlp #language-recognition
lingua-tswana-language-model

The Tswana language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-detection #language-model #language-recognition
lingua-croatian-language-model

The Croatian language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-detection #language-model #language-recognition
lingua-latin-language-model

The Latin language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-detection #language-model #language-recognition
lingua-basque-language-model

The Basque language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-model #language-detection #language-recognition
lingua-ganda-language-model

The Ganda language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-detection #language-model #language-recognition
lingua-swahili-language-model

The Swahili language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-model #language-detection #language-recognition
lingua-bengali-language-model

The Bengali language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-model #language-detection #language-recognition
lingua-gujarati-language-model

The Gujarati language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-detection #language-model #language-recognition
lingua-punjabi-language-model

The Punjabi language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-model #language-detection #language-recognition
lingua-bosnian-language-model

The Bosnian language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-model #language-detection #language-recognition
lingua-tsonga-language-model

The Tsonga language model for Lingua, an accurate natural language detection library

v1.1.0 2.2K #nlp #language-model #language-detection #language-recognition
text-colorizer

Transitionary package

v1.0.0 1.8K #text-colorizer #package #text #transitionary
pinot

Fast, high-fidelity OpenType parser

v0.1.5 1.3K #opentype #font #parse #graphics #parser
no-comment

Remove rust-style line and block comments from a char iterator

v0.0.3 1.7K #comments #line-comment #iterator #block #string #specification #rust-style
jayce

tokenizer 🌌

v12.1.0 1.1K #tokenizer #token #found #name #sync #default #🌌
float-pretty-print

Format f64 for showing to user, not for serialisation

v0.1.1 1.2K #pretty-print #float #format #print #human #pretty #string-formatting
vi

An input method library for vietnamese IME

v0.6.0 1.1K #vietnamese #input #ime #user-input #tone #mark #output
xml2json-rs

converting to and from XML/JSON

v1.0.1 2.1K #json-xml #json #xml #conversion
diacritics

Remove diacritics from letters, for example when standardizing input for a search

v0.2.2 1.0K #search #normalize #text
like

A SQL like style pattern matching

v0.3.1 1.9K #pattern-matching #pattern #matching #escaping #sql #style
lean-sys

Bindings to Lean 4's C API

v0.0.6 1.6K sys #lean #bindings #math #api-bindings
marker

finding issues in CommonMark documents

v0.9.0 1.0K app #common-mark #links #markdown #validate #broken-links #validation #file-path
xmldecl

Extracts an encoding from an ASCII-based bogo-XML declaration in text/html in a Web-compatible way

v0.2.0 1.5K #charset #unicode #web #encoding
unicode_reader

Adaptors which wrap byte-oriented readers and yield the UTF-8 data as Unicode code points or grapheme clusters

v1.0.2 1.2K #unicode #grapheme #reader #unicode-text #code-point #codepoint #adaptor
wkhtmltopdf

High-level bindings to wkhtmltopdf

v0.4.0 1.2K #pdf #html #wkhtmltox #wkhtmltoimage
moto

motivated automation

v0.2.29 1.0K bin+lib #automation #task #workflow #runtimes #language #within #text
xsv

A high performance CSV command line toolkit

v0.13.0 1.2K app #csv #csv-tsv #slice #command #tsv
terminal-clipboard

a minimal cross-platform clipboard

v0.4.1 1.1K #clipboard #terminal #string #terminal-text #cross-platform #copying #pasting
atelier_test

Test and example models used within the other Atelier crates

v0.1.4 2.4K #model #within #atelier #smithy
doc-chunks

Clusters of doc comments and dev comments as coherent view

v0.1.0 1.2K #documentation #cluster #chunks #string-representation #file-path
regex_generate

Use regular expressions to generate text

v0.2.3 900 #regex #text-generation #generation #text
mdbook-cmdrun

mdbook preprocessor to run arbitrary commands

v0.6.0 1.0K bin+lib #mdbook #preprocessor #run-command #command-output #runcmd
string_wizard

manipulate string like wizards

v0.0.19 2.2K #string #wizard #manipulate
symspell

Spelling correction & Fuzzy search

v0.4.3 900 #spelling-correction #fuzzy-search #spell-checking #spellcheck #algorithm #dictionary #wolfgarbe
norad

Read and write Unified Font Object files

v0.14.1 950 #font #ufo #read-write #graphics
lindera-ipadic-neologd

A Japanese morphological dictionary for IPADIC NEologd

v0.30.0 1.2K #japanese-morphological #japanese #morphological #dictionary #ipadic #neologd
newdoc

Generate pre-populated module files formatted with AsciiDoc that are used in Red Hat and Fedora documentation

v2.17.0 900 bin+lib #redhat #documentation #asciidoc #red-hat
mdbook-codeblocks

A mdbook preprocessor to prepend customizable vignette to code blocks

v0.1.15 900 app #mdbook #preprocessor #pre-processor #code-block
mdbook-tailor

mdbook preprocessor for image-tailor

v0.6.4 900 bin+lib #mdbook #image #tailor #delay #page #width-height
censor

text profanity filter

v0.3.0 2.0K #filter #profanity #swear #political
focaccia

no_std implementation of Unicode case folding comparisons

v1.4.0 900 no-std #case-insensitive #unicode #case #case-folding #no-std #unicode-text #order
sourceannot

render snippets of source code with annotations

v0.2.0 800 #annotations #annotation #report #error #code #unicode-characters
opml

OPML library for Rust

v1.1.6 850 #xml #documentation #standalone
local-encoding

encoding/decoding string with local charset. It usefull for work with ANSI strings on Windows.

v0.2.0 1.7K #string #local #ansi #codec #charset #codepage #utf-8
cffdrs

Canadian Forest Fire Danger Rating System

v0.6.2 950 #fbp #fwi #wildfire
words-count

Count the words and characters, with or without whitespaces

v0.1.6 850 no-std #word-count #count #word #character #utf-8 #letter #white-space
rustpython-sre_engine

A low-level implementation of Python's SRE regex engine

v0.3.1 950 #regex #python #engine #low-level #sre #interpreter #language
boreal

evaluate YARA rules, used to scan bytes for textual and binary pattern

v0.7.0 850 #yara #string-matching #scan #execution-time #replace
basic-text

Basic Text strings and I/O streams

v0.19.2 1.2K #stream #io-stream #basic #plain-text #text-format #text #unicode
create_broken_files

Create broken files from other ones

v3.0.1 750 app #input-file #broken #testing #random #fuzzer #fuzzing #data
confusables

around Unicode confusables/homoglyphs

v0.1.0 1.5K #unicode #confusable #homoglyphs
chewing

(酷音) intelligent Zhuyin input method

v0.8.2 800 #input #intelligent #phonetic #chinese #user-input #bopomofo #keyboard-input
fuzzywuzzy

A pure-Rust clone of the incredibly useful fuzzy string matching python package, FuzzyWuzzy

v0.0.2 1.0K #fuzzy-string #string-matching #string #fuzzy-matching #utility #text #python-packages
savvy

R extension interface

v0.6.3 850 #interface #extension #sexp #package #vector #name #convert
vaporetto_rules

Rule-base filters for Vaporetto

v0.6.3 800 no-std #japanese #tokenizer #morphological #analyzer
metatensor-sys

Bindings to the metatensor C library

v0.1.8 2.0K sys #metatensor #shared #api #matching #bindings #static #build
char_reader

Safely read wild streams as chars or lines

v0.1.1 1.2K #reader #unicode-characters #unicode #char #io-error
fontconfig

Safe, higher-level wrapper around the Fontconfig library

v0.8.0 800 #font #wrapper #search
heckcheck

A heckin small test case generator

v2.0.1 800 #rgb #test-cases #generator #testing #case #serialization #heckin
bashtestmd

Compiles shell commands in .md files into Bash scripts for testing

v0.4.1 850 app #bash #markdown #shell #script #command #compile #tags
stringzilla

Faster SIMD-accelerated string search, sorting, fingerprints, and edit distances

v3.8.4 600 sys no-std #simd #sorting #search #hash #character-set #set-operations #search-algorithms
str-utils

some traits to extend types which implement AsRef<[u8]> or AsRef<str>

v0.1.7 800 no-std #string #ascii #starts-with #ends-with #caseless
stream-rate-limiter

A rate limiter for Tokio streams

v0.4.0 750 #rate-limiting #stream #element #delay #tokio #customization #constant
yffi

Bindings for the Yrs native C foreign function interface

v0.18.8 750 #yrs #crdt #c-ffi
decancer

that removes common unicode confusables/homoglyphs from strings

v3.2.0 750 #unicode #confusable #unicode-characters #homoglyphs #security #moderation #binary-search
tremor-kv

A logstash inspured key value extractor

v0.6.2 700 #key-value #logstash #parser #kv #string #map #extractor
m_lexer

extensible regular expressions based lexer

v0.0.4 1.5K #lexer #regex #extensible #expressions #regular
epub-builder

generating EPUB files

v0.7.4 750 #epub #generate #content #builder #default #xhtml #version
vader_sentiment

Bindings for Rust from the original Python VaderSentiment analysis tool

v0.1.1 750 bin+lib #analysis #sentiment #language #original #vader #bindings #tool
ellipse

Truncate and ellipse strings in a human-friendly way

v0.2.0 1.3K #string #truncate #human
capitalize

Change first character to upper case and the rest to lower case, and other common alternatives

v0.3.4 500 #case #change #string #title
regex-split

split_inclusive for the regex crate

v0.1.0 1.2K #regex #split #split-inclusive #substring #string #place
qpdf

Rust bindings to QPDF C++ library

v0.3.1 700 #pdf #safe-bindings #targets #encryption #tested #popular #legacy
crlify

A std::io::Write wrapper that replaces with on Windows

v1.0.3 650 #line-ending #io #ending #crlf #io-write #line #windows
stop-words

Common stop words in many languages

v0.8.0 650 #nlp #language #text #text-processing #languages #localization
tauri-plugin-clipboard

A clipboard plugin for Tauri that supports text, files and image, as well as clipboard update listening

v0.6.10 700 bin+lib #tauri-plugin #clipboard #text-image #update #monitor #framework #svelte
basen

Convert binary data to ASCII with a variety of supported bases

v0.1.0 1.3K #binary-data #convert-binary #ascii #variety #bases #base58 #base-16
rslint_errors

Pretty error reporting library based on codespan-reporting built for the RSLint project

v0.2.0 650 #error-reporting #javascript-linter #typescript #label #file #default #rs-lint
slugify-rs

generate slugs from strings

v0.0.3 750 #slug #slugify #macro
sanitise-file-name

An unusually flexible and efficient file name sanitiser

v1.0.0 1.3K no-std #filename #sanitizer #filesystem #sanitiser #nodejs
fiberplane-markdown

convert Fiberplane Notebooks to and from Markdown

v1.0.0-beta.14 700 #markdown #notebook #fiberplane #convert #transforming #data
inline_colorization

format!("Lets the user {color_red}colorize{color_reset} and {style_underline}style the output{style_reset} text using inline variables");

v0.1.6 650 #text #inline #text-color #variables #text-style #colorization #output
unic-ucd-block

UNIC — Unicode Character Database — Unicode Blocks

v0.9.0 850 #unicode #block #unicode-text #text
html2runes

An HTML to Text converter

v1.0.1 1.0K bin+lib #plain-text #html #converter #html-text #markdown #plaintext
unicode-jp

convert Japanese Half-width-kana[半角ｶﾅ] and Wide-alphanumeric[全角英数] into normal ones

v0.4.0 700 bin+lib #japanese #unicode #kana #hankaku #zenkaku
intuicio-data

Data module for Intuicio scripting platform

v0.31.6 700 #modular-scripting #intuicio #platform #scripting-language #solution #book
wkhtmltox-sys

FFI bindings to wkhtmltox

v0.1.2 1.3K #pdf #html #wkhtmltox #wkhtmltopdf #wkhtmltoimage
mandown

Markdown to groff (man page) converter

v0.1.3 850 bin+lib #manpage #markdown #convert-markdown #troff #groff #roff #manpages
apidoc-attr

Apidoc attr

v0.2.6 900 #attr #apidoc-attr #apidoc
smartcat

Putting a brain behind cat. CLI interface to bring language models in the Unix ecosystem 🐈‍⬛

v1.3.0 800 app #language-model #cat #ai #chatgpt #pipe #user-input #unix-command
notan_glyph

glyph's support for Notan

v0.12.0 750 #notan #glyph #text #glyph-brush #renderer
tectonic_status_base

Basic types for reporting status messages to a user

v0.2.1 800 #tectonic #reporting #status #typesetting #messages #tex-engine #user
codegenrs

Moving code-gen our of build.rs

v3.0.1 1.0K #build #codegen #development #ci #moving #times #reducing
kakasi

Romanize hiragana, katakana and kanji (Japanese text)

v0.1.0 800 bin+lib #japanese #katakana #hiragana #kanji #rōmaji #alphabet #characters
asciifolding

ascii folding library

v0.1.0 1.1K #ascii #lucene #unicode #folding
null-terminated-str

FFI-friendly utf-8 string, enabling const null-terminated str and caching of the non-terminated string to avoid frequent allocation

v0.1.4 850 #string #c-str #ffi #compile-time
indented

Format data with indentation

v0.1.0 1.3K #indentation #indent #format
terminal-emoji

safely displaying emoji inside of terminals

v0.4.1 1.0K #emoji #terminals #displaying #safely
controlled-option

Custom Option type with explicit control over niches and memory layout

v0.4.1 850 #memory-layout #control #explicit #option #variant #niches #pattern
unic-idna-mapping

UNIC — IDNA — IDNA Mapping Table

v0.9.0 600 #unic #unicode #idna #character-property #unicode-text #text
posix-space

Pure Rust implementation of isspace for the POSIX locale

v1.0.4 850 no-std #posix #locale #space #isspace #no-alloc

Next page?

regex-syntax

regex-automata

aho-corasick

regex

idna

unicode-normalization

percent-encoding

unicode-bidi

unicode-width

textwrap

unicode-segmentation

convert_case

unicode-xid

matchers

ident_case

bstr

unicase

encoding_rs

unindent

indoc

diff

ucd-trie

fancy-regex

difflib

unicode_categories

similar

finl_unicode

ascii

indenter

const_format

widestring

pulldown-cmark

cesu8

Inflector

regex-lite

deunicode

utf-8

const_format_proc_macros

arrow-row

uncased

ascii-canvas

unicode-id

gix-utils

unic-char-property

slug

shell-escape

compact_str

tendril

onig_sys

onig

pulldown-cmark-to-cmark

diffy

strip-ansi-escapes

const-str

unicode-script

levenshtein_automata

kstring

regress

text-size

tabled

lazy-regex

encoding-index-tradchinese

inflections

encoding-index-singlebyte

encoding-index-japanese

encoding-index-korean

encoding-index-simpchinese

indent_write

ascii_utils

unicode-normalization-alignments

fuzzy-matcher

difference

newline-converter

htmlescape

roff

rustybuzz

ucd-util

encoding

mdbook

unescape