-
regex-syntax
A regular expression parser
-
regex-automata
Automata construction and matching using regular expressions
-
aho-corasick
Fast multiple substring searching
-
regex
regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
-
idna
IDNA (Internationalizing Domain Names in Applications) and Punycode
-
percent-encoding
Percent encoding and decoding
-
unicode-normalization
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
unicode-bidi
Unicode Bidirectional Algorithm
-
unicode-width
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
textwrap
word wrapping, indenting, and dedenting strings. Has optional support for Unicode and emojis as well as machine hyphenation.
-
unicode-segmentation
Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules
-
convert_case
Convert strings into any case
-
unicode-xid
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
matchers
Regex matching on character and byte streams
-
ident_case
applying case rules to Rust identifiers
-
bstr
A string type that is not required to be valid UTF-8
-
encoding_rs
A Gecko-oriented implementation of the Encoding Standard
-
unicase
A case-insensitive wrapper around strings
-
indoc
Indented document literals
-
diff
An LCS based slice and string diffing implementation
-
unindent
Remove a column of leading whitespace from a string
-
ucd-trie
A trie for storing Unicode codepoint sets and maps
-
difflib
Port of Python's difflib library to Rust
-
fancy-regex
regexes, supporting a relatively rich set of features, including backreferences and look-around
-
ascii
ASCII-only equivalents to
char
,str
andString
-
finl_unicode
handling Unicode functionality for finl (categories and grapheme segmentation)
-
unicode_categories
Query Unicode category membership for chars
-
similar
A diff library for Rust
-
indenter
A formatter wrapper that indents the text, designed for error display impls
-
widestring
wide string Rust library for converting to and from wide strings, such as those often used in Windows API or other FFI libaries. Both
u16
andu32
string types are provided, including support for UTF-16 and UTF-32… -
const_format
Compile-time string formatting
-
pulldown-cmark
A pull parser for CommonMark
-
Inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
cesu8
Convert to and from CESU-8 encoding (similar to UTF-8)
-
regex-lite
A lightweight regex engine that optimizes for binary size and compilation time
-
utf-8
Incremental, zero-copy UTF-8 decoding with error handling
-
const_format_proc_macros
detail of the
const_format
crate -
deunicode
Convert Unicode strings to pure ASCII by intelligently transliterating them. Suppors Emoji and Chinese.
-
uncased
Case-preserving, ASCII case-insensitive, no_std string types
-
arrow-row
Arrow row format
-
unicode-id
Determine whether characters have the ID_Start or ID_Continue properties according to Unicode Standard Annex #31
-
gix-utils
gitoxide
utilities that don’t need feature toggles -
ascii-canvas
canvas for drawing lines and styled text and emitting to the terminal
-
shell-escape
Escape characters that may have a special meaning in a shell
-
unic-char-property
UNIC — Unicode Character Tools — Character Property taxonomy, contracts and build macros
-
slug
Convert a unicode string to a slug
-
compact_str
A memory efficient string type that transparently stores strings on the stack, when possible
-
unic-segment
UNIC — Unicode Text Segmentation Algorithms
-
pulldown-cmark-to-cmark
Convert pulldown-cmark Events back to the string they were parsed from
-
tendril
Compact buffer/string type for zero-copy parsing
-
onig_sys
onig_sys
crate contains raw rust bindings to the oniguruma library. This crate exposes a set of unsafe functions which can then be used by other crates to create safe wrappers around Oniguruma… -
onig
Rust-Onig is a set of Rust bindings for the Oniguruma regular expression library. Oniguruma is a modern regex library with support for multiple character encodings and regex syntaxes.
-
tabled
An easy to use library for pretty print tables of Rust
struct
s andenum
s -
diffy
Tools for finding and manipulating differences between files
-
const-str
compile-time string operations
-
strip-ansi-escapes
Strip ANSI escape sequences from byte streams
-
encoding-index-tradchinese
Index tables for traditional Chinese character encodings
-
encoding-index-singlebyte
Index tables for various single-byte character encodings
-
encoding-index-japanese
Index tables for Japanese character encodings
-
encoding-index-korean
Index tables for Korean character encodings
-
encoding-index-simpchinese
Index tables for simplified Chinese character encodings
-
kstring
Key String: optimized for map keys
-
unicode-script
exposes the Unicode
Script
andScript_Extension
properties from UAX #24 -
levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
text-size
Newtypes for text offsets
-
lazy-regex
lazy static regular expressions checked at compile time
-
inflections
High performance inflection transformation library for changing properties of words like the case
-
ascii_utils
handle ASCII characters
-
regress
A regular expression engine targeting EcmaScript syntax
-
difference
text diffing and assertion library
-
indent_write
Write adapters to add line indentation
-
fuzzy-matcher
Fuzzy Matching Library
-
unicode-normalization-alignments
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
roff
ROFF (man page format) generation library
-
encoding
Character encoding support for Rust
-
newline-converter
Newline byte converter library
-
htmlescape
HTML entity encoding and decoding
-
rustybuzz
A complete harfbuzz shaping algorithm port to Rust
-
mdbook
Creates a book from markdown files
-
ucd-util
A small utility library for working with the Unicode character database
-
unescaper
Unescape strings with escape sequences written out as literal characters
-
tokenizers
today's most used tokenizers, with a focus on performances and versatility
-
unescape
Unescapes strings with escape sequences written out as literal characters
-
unicode-security
Detect possible security problems with Unicode usage according to Unicode Technical Standard #39 rules
-
pretty
Wadler-style pretty-printing combinators in Rust
-
unicode-ccc
Unicode Canonical Combining Class detection
-
substring
method for string types
-
unic-ucd-ident
UNIC — Unicode Character Database — Identifier Properties
-
prettydiff
Side-by-side diff for two files
-
unicode-bidi-mirroring
Unicode Bidi Mirroring property detection
-
cruet
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
ammonia
HTML Sanitization
-
pulldown-cmark-escape
An escape library for HTML created in the pulldown-cmark project
-
utf16_lit
macro_rules to make utf-16 literals
-
any_ascii
Unicode to ASCII transliteration
-
utf8_iter
Iterator by char over potentially-invalid UTF-8 in &[u8]
-
yeslogic-fontconfig-sys
Raw bindings to Fontconfig without a vendored C library
-
grep-searcher
Fast line oriented regex searching as a library
-
pad
padding strings at runtime
-
glyph_brush_layout
Text layout for ab_glyph
-
utf16_iter
Iterator by char over potentially-invalid UTF-16 in &[u16]
-
case
A set of letter case string helpers
-
indent
Functions for indenting multiline strings
-
tabwriter
Elastic tabstops
-
byteyarn
hyper-compact strings
-
charset
Thunderbird-compatible character encoding decoding for email
-
punycode
Functions to decode and encode Punycode
-
lexical-sort
Sort Unicode strings lexically
-
unidecode
pure ASCII transliterations of Unicode strings
-
dwrote
Lightweight binding to DirectWrite
-
font-types
Scalar types used in fonts
-
write16
A UTF-16 analog of the Write trait
-
str_indices
Count and convert between indexing schemes on string slices
-
utf16string
String types to work directly with UTF-16 encoded strings
-
linkify
Finds URLs and email addresses in plain text. Takes care to get the boundaries right with surrounding punctuation like parentheses.
-
termimad
Markdown Renderer for the Terminal
-
codepage
Mapping between Windows code page numbers and encoding_rs character encodings
-
wezterm-bidi
The Unicode Bidi Algorithm (UBA)
-
stfu8
Sorta Text Format in UTF-8
-
entities
raw data needed to convert to and from HTML entities
-
comrak
A 100% CommonMark-compatible GitHub Flavored Markdown parser and formatter
-
unicode-reverse
Unicode-aware in-place string reversal
-
lopdf
PDF document manipulation
-
os_display
Display strings in a safe platform-appropriate way
-
text_io
really simple to use panicking input functions
-
text_lines
Information about lines of text in a string
-
unicode-vo
Unicode vertical orientation detection
-
pcre2
High level wrapper library for PCRE2
-
unicode_names2
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
harfbuzz-sys
Rust bindings to the HarfBuzz text shaping engine
-
line-index
Maps flat
TextSize
offsets to/from(line, column)
representation -
unicode-general-category
Fast lookup of the Unicode General Category property for char
-
str_inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
ropey
A fast and robust text rope for Rust
-
byte_string
Wrapper types for outputting byte strings (b"Hello") using the Debug ({:?}) format
-
grep
Fast line oriented regex searching as a library
-
shell2batch
Coverts simple basic shell scripts to windows batch scripts
-
ngrams
Generate n-grams from sequences
-
swrite
Infallible alternatives to write! and writeln! for Strings
-
titlecase
Capitalize text according to a style defined by John Gruber for Daring Fireball
-
markdown-gen
generating Markdown files
-
chardetng
A character encoding detector for legacy Web content
-
uwl
A management stream for bytes and characters
-
man
Generate structured man pages
-
emojis
✨ Lookup emoji in *O(1)* time, access metadata and GitHub shortcodes, iterate over all emoji, and more!
-
snailquote
Escape and unescape strings with shell-inspired quoting
-
harfbuzz
Rust bindings to the HarfBuzz text shaping engine
-
jieba-rs
The Jieba Chinese Word Segmentation Implemented in Rust
-
sliceslice
A fast implementation of single-pattern substring search using SIMD acceleration
-
ucd-parse
parsing data files in the Unicode character database
-
lindera-decompress
A morphological analysis library
-
sublime_fuzzy
Fuzzy matching algorithm based on Sublime Text's string search
-
lindera-ipadic-builder
A Japanese morphological dictionary builder for IPADIC
-
lindera-dictionary
A Japanese morphological dictionary
-
select
extract useful data from HTML documents, suitable for web scraping
-
lindera-cc-cedict-builder
A Chinese morphological dictionary builder for CC-CEDICT
-
lindera-ko-dic-builder
A Korean morphological dictionary builder for ko-dic
-
lindera-unidic-builder
A Japanese morphological dictionary builder for UniDic
-
ripgrep
line-oriented search tool that recursively searches the current directory for a regex pattern while respecting gitignore rules. ripgrep has first class support on Windows, macOS and Linux.
-
const-str-proc-macro
compile-time string operations
-
chardet
rust version of chardet
-
ansi-to-tui
convert ansi color coded text into ratatui::text::Text type from ratatui library
-
lindera-ipadic-neologd-builder
A Japanese morphological dictionary builder for IPADIC NEologd
-
cow-utils
Copy-on-write string utilities for Rust
-
unified-diff
GNU unified diff format
-
pcre2-sys
Low level bindings to PCRE2
-
detone
Decompose Vietnamese tone marks
-
jetscii
A tiny library to efficiently search strings and byte slices for sets of ASCII characters or bytes
-
filecheck
writing tests for utilities that read text files and produce text output
-
unicode-case-mapping
Fast lowercase, uppercase, and titlecase mapping for characters
-
flexstr
A flexible, simple to use, immutable, clone-efficient
String
replacement for Rust -
html2text
Render HTML as plain text
-
sanitizer
A collection of methods and macros to sanitize struct fields
-
unicode-truncate
Unicode-aware algorithm to pad or truncate
str
in terms of displayed width -
stringmatch
Allow the use of regular expressions or strings wherever you need string comparison
-
charabia
detect the language, tokenize the text and normalize the tokens
-
uuhelp_parser
A collection of functions to parse the markdown code of help files
-
lowcharts
draw low-resolution graphs in terminal
-
xlsxwriter
Write xlsx file with number, formula, string, formatting, autofilter, merged cells, data validation and more
-
bk-tree
A Rust BK-tree implementation
-
lindera-core
A morphological analysis library
-
escape-bytes
Escapes bytes that are not printable ASCII characters
-
pretty-xmlish
Pretty print XML-ish data with unicode art
-
grok
popular java & ruby grok library which allows easy text and log file processing with composable patterns
-
garde
Validation library
-
lindera-tokenizer
A morphological analysis library
-
lindera-compress
A morphological analysis library
-
lindera-ko-dic
A Japanese morphological dictionary for ko-dic
-
cedarwood
efficiently-updatable double-array trie in Rust (ported from cedar)
-
wchar
Procedural macros for compile time UTF-16 and UTF-32 wide strings
-
slugify
Macro for flexible slug generation
-
array_tool
Helper methods for processing collections
-
lexicmp
comparing and sorting strings lexicographically and naturally
-
minify-html-common
Common code and data for minify-html*
-
lipsum
lorem ipsum text generation library. It generates pseudo-random Latin text. Use this if you need filler or dummy text for your application. The text is generated using a simple Markov chain…
-
hyphenation
Knuth-Liang hyphenation for a variety of languages
-
ferris-says
flavored replacement for the classic cowsay
-
hyperscan
bindings for Rust with Multiple Pattern and Streaming Scan
-
cuid
An ipmlementation of CUID protocol in rust
-
commonregex
Rust port for CommonRegex. Find all times, dates, links, phone numbers, emails, ip addresses, prices, hex colors, and credit card numbers in a string. We did the hard work so you don't have to.
-
rutie
The tie between Ruby and Rust
-
print-positions
providing string segmentation on grapheme clusters and ANSI escape sequences for accurate length arithmetic based on visible print positions
-
unicode-blocks
contains a list of all unicode blocks and provides some functions to search across them
-
svgbobdoc
Renders ASCII diagrams in doc comments as SVG images
-
glob-match
An extremely fast glob matcher
-
ucd
Extends the char type to provide access to most fields of the UCD, Unicode Character Database, as of version 9.0.0. It aims to be compact, fast, and use minimal dependencies (only rust's core crate)…
-
suffix
arrays
-
fm
Non-backtracking fuzzy text matcher
-
hyperscan-sys
Hyperscan bindings for Rust with Multiple Pattern and Streaming Scan
-
textwrap-macros
procedural macros to use textwrap utilities at compile time
-
gh-emoji
Convert
:emoji:
to Unicode using GitHub’s emoji names -
printpdf
writing PDF files
-
caseless
Unicode caseless matching
-
nucleo-matcher
plug and play high performance fuzzy matcher
-
nu-utils
Nushell utility functions
-
lindera-unidic
A Japanese morphological dictionary for UniDic
-
unicode-casing
Titlecase helper function on characters
-
sd
An intuitive find & replace CLI
-
ansi-width
Calculate the width of a string when printed to the terminal
-
synoptic
low-level, syntax highlighting library with unicode support
-
lexis
Generates human-readable sequences from numeric values using a predefined word list
-
bwrap
A fast, lightweight, embedded systems-friendly library for wrapping text
-
console_static_text
Logging for text that should stay in the same place in a console
-
text-diff
text diffing and assertion library
-
pager
pipe your output through an external pager
-
precis-profiles
PRECIS Framework: Preparation, Enforcement, and Comparison of Internationalized Strings Representing Usernames and Passwords as defined in rfc8265; and Nicknames as defined in rfc8266
-
precis-tools
Tools and parsers to generate PRECIS tables from the Unicode Character Database (UCD)
-
imperative
Check for imperative mood in text
-
text-splitter
Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.
-
hyphenation_commons
Proemial code for the
hyphenation
library -
svgbob
Transform your ascii diagrams into happy little SVG
-
lindera-ipadic
A Japanese morphological dictionary for IPADIC
-
terminal-supports-emoji
Check whether the current terminal supports emoji
-
target_info
Get text strings of attributes concernign the build target
-
ra_ap_test_utils
TBD
-
unic-ucd-age
UNIC — Unicode Character Database — Age
-
typos-cli
Source Code Spelling Correction
-
mdbook-linkcheck
A backend for
mdbook
which will check your links for you -
pluralizer
Rust package to pluralize or singularize any word based on a count inspired on pluralize NPM package
-
fax
Decoder and Encoder for CCITT Group 3 and 4 bi-level image encodings used by fax machines TIFF and PDF
-
tracing-texray
Tracing layer to view a plaintext timeline of spans and events
-
prop-check-rs
A Property-based testing Library in Rust
-
simple-logging
logger for the log facade
-
ascii_tree
generates ascii trees
-
mdbook-pandoc
A mdbook backend that outsources most of the rendering process to pandoc
-
mdbook-svgbob
SvgBob mdbook preprocessor which swaps code-blocks with neat SVG
-
ra_ap_ide_ssr
Structural search and replace of Rust code
-
textnonce
Text based random nonce generator
-
evcxr
An Evaluation Context for Rust
-
mdbook-mermaid
mdbook preprocessor to add mermaid support
-
encoding_c_mem
C API for encoding_rs::mem
-
adobe-cmap-parser
parse Adobe CMap files
-
typos-dict
Source Code Spelling Correction
-
unic-bidi
UNIC — Unicode Bidirectional Algorithm
-
pdf-extract
extract content from pdfs
-
trigram
Trigram-based string similarity for fuzzy matching
-
scanlex
lexical scanner for parsing text into tokens
-
regex-cursor
regex fork that can search discontiguous haystacks
-
mdbook-preprocessor-boilerplate
Boilerplate code for mdbook preprocessors
-
textdistance
Lots of algorithms to compare how similar two sequences are
-
genpdf
User-friendly PDF generator written in pure Rust
-
compact_bytes
A memory efficient bytes container that transparently stores bytes on the stack, when possible
-
varcon-core
Varcon-relevant data structures
-
harfbuzz-traits
Rust Traits for the HarfBuzz text shaping engine
-
utf8-cstr
Type wrappers promising null termination and utf-8 validity. The intersection of
std::ffi::CStr
andstr
-
rustyline-async
A minimal readline with multiline and async support
-
quoted-string-parser
Quoted string parser for grammar defined in RFC3261
-
lingua-english-language-model
The English language model for Lingua, an accurate natural language detection library
-
qp-trie
An idiomatic and fast QP-trie implementation in pure Rust, written with an emphasis on safety
-
regex_mutator
The Nautilus regex_mutator
-
typos
Source Code Spelling Correction
-
typos-vars
Source Code Spelling Correction
-
hypher
separates words into syllables
-
sre-engine
A low-level implementation of Python's SRE regex engine
-
re_ui
Rerun GUI theme and helpers, built around egui
-
pact_matching
Pact-Rust support library that implements request and response matching logic
-
grep-pcre2
Use PCRE2 with the 'grep' crate
-
reword
some utility functions for human-readable formatting of words
-
dictgen
Compile-time case-insensitive map
-
lindera-cc-cedict
A Japanese morphological dictionary for CC-CEDICT
-
mdbook-toc
mdbook preprocessor to add Table of Contents
-
ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched
-
lingua-german-language-model
The German language model for Lingua, an accurate natural language detection library
-
text_trees
textual output for tree-like structures
-
mdxjs
Compile MDX to JavaScript in Rust
-
utfx
-
vaporetto
pointwise prediction based tokenizer
-
lindera
A morphological analysis library
-
uwuify
fastest text uwuifier in the west
-
keyvalues-parser
A parser/renderer for vdf text
-
easy_reader
easily navigating forward, backward or randomly through the lines of huge files
-
esl01-renderdag
Render a graph into ASCII or Unicode text
-
re_space_view_text_document
space view that shows a single text box
-
line-span
Find line ranges and jump between next and previous lines
-
svgbob_cli
Transform your ascii diagrams into happy little SVG
-
lingua-french-language-model
The French language model for Lingua, an accurate natural language detection library
-
lingua-spanish-language-model
The Spanish language model for Lingua, an accurate natural language detection library
-
harfbuzz_rs
A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust
-
frida-build
Rust bindings for Frida
-
wana_kana
checking and converting between Japanese characters - Kanji, Hiragana, Katakana - and Romaji
-
sanitize-filename-reader-friendly
A filename sanitizer aiming to produce reader friendly filenames
-
sedregex
Sed-like regex library
-
file-size
a function formatting file sizes in 4 chars
-
fuzzt
Implementations of string similarity metrics. Includes Hamming, Levenshtein, OSA, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Sørensen-Dice.
-
hunspell-rs
Rust bindings to the Hunspell library
-
mini_paste
Fast-to-compile equivalent to
::paste
-
hyper-old-types
HTTP types from hyper 0.11.x
-
unicode-canonical-combining-class
Fast lookup of the Canonical Combining Class property
-
rapidfuzz
rapid fuzzy string matching library
-
tiny-gradient
Make your string colored in gradient
-
textcode
Text encoding/decoding library. Supports: UTF-8, ISO6937, ISO8859, GB2312
-
mdbook-graphviz
mdbook preprocessor to add graphviz support
-
pest_ascii_tree
Helper crates converting the parsing result of any pest grammar into an ascii tree
-
pinot
Fast, high-fidelity OpenType parser
-
unicode-joining-type
Fast lookup of the Unicode Joining Type and Joining Group properties
-
glyph-names
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
lingua-chinese-language-model
The Chinese language model for Lingua, an accurate natural language detection library
-
strings
String utilities, including an unbalanced Rope
-
cargo-spellcheck
Checks all doc comments for spelling mistakes
-
clippy_lints
A bunch of helpful lints to avoid common pitfalls in Rust
-
hunspell-sys
Bindings to the hunspell C API
-
line-numbers
Find line numbers in strings by byte offsets, quickly
-
lingua-japanese-language-model
The Japanese language model for Lingua, an accurate natural language detection library
-
encoding8
various 8-bit encodings
-
linkcheck
extracting and validating links
-
lingua-portuguese-language-model
The Portuguese language model for Lingua, an accurate natural language detection library
-
lingua-latvian-language-model
The Latvian language model for Lingua, an accurate natural language detection library
-
lingua-russian-language-model
The Russian language model for Lingua, an accurate natural language detection library
-
mdbook-admonish
A preprocessor for mdbook to add Material Design admonishments
-
lingua-italian-language-model
The Italian language model for Lingua, an accurate natural language detection library
-
lingua-turkish-language-model
The Turkish language model for Lingua, an accurate natural language detection library
-
lingua-arabic-language-model
The Arabic language model for Lingua, an accurate natural language detection library
-
lingua-hindi-language-model
The Hindi language model for Lingua, an accurate natural language detection library
-
secular
No Diacr!
-
lingua-thai-language-model
The Thai language model for Lingua, an accurate natural language detection library
-
lingua-vietnamese-language-model
The Vietnamese language model for Lingua, an accurate natural language detection library
-
pandoc
API that wraps calls to the pandoc 2.x executable
-
pulldown-cmark-mdcat
Render pulldown-cmark events to TTY
-
lingua-ukrainian-language-model
The Ukrainian language model for Lingua, an accurate natural language detection library
-
regex_generate
Use regular expressions to generate text
-
mdbook-katex
mdBook preprocessor rendering LaTeX equations to HTML
-
lingua-korean-language-model
The Korean language model for Lingua, an accurate natural language detection library
-
lingua-bokmal-language-model
The Bokmal language model for Lingua, an accurate natural language detection library
-
lingua-persian-language-model
The Persian language model for Lingua, an accurate natural language detection library
-
lingua-nynorsk-language-model
The Nynorsk language model for Lingua, an accurate natural language detection library
-
lingua-malay-language-model
The Malay language model for Lingua, an accurate natural language detection library
-
text_unit
Newtypes for text offsets
-
crop
A pretty fast text rope
-
readability
Port of arc90's readability project to rust
-
egui-dropdown
An actual dropdown list for egui
-
lingua-indonesian-language-model
The Indonesian language model for Lingua, an accurate natural language detection library
-
lingua-polish-language-model
The Polish language model for Lingua, an accurate natural language detection library
-
regex-macro
A macro to generate a lazy regex expression
-
lingua-mongolian-language-model
The Mongolian language model for Lingua, an accurate natural language detection library
-
sourceannot
render snippets of source code with annotations
-
lingua-dutch-language-model
The Dutch language model for Lingua, an accurate natural language detection library
-
no-comment
Remove rust-style line and block comments from a char iterator
-
lingua-lithuanian-language-model
The Lithuanian language model for Lingua, an accurate natural language detection library
-
lingua-hungarian-language-model
The Hungarian language model for Lingua, an accurate natural language detection library
-
lingua-danish-language-model
The Danish language model for Lingua, an accurate natural language detection library
-
lingua-slovak-language-model
The Slovak language model for Lingua, an accurate natural language detection library
-
lingua-romanian-language-model
The Romanian language model for Lingua, an accurate natural language detection library
-
lingua-swedish-language-model
The Swedish language model for Lingua, an accurate natural language detection library
-
lingua-finnish-language-model
The Finnish language model for Lingua, an accurate natural language detection library
-
lingua-greek-language-model
The Modern Greek language model for Lingua, an accurate natural language detection library
-
lingua-estonian-language-model
The Estonian language model for Lingua, an accurate natural language detection library
-
lingua-bulgarian-language-model
The Bulgarian language model for Lingua, an accurate natural language detection library
-
lingua-armenian-language-model
The Armenian language model for Lingua, an accurate natural language detection library
-
lingua-icelandic-language-model
The Icelandic language model for Lingua, an accurate natural language detection library
-
lingua-hebrew-language-model
The Hebrew language model for Lingua, an accurate natural language detection library
-
lingua-catalan-language-model
The Catalan language model for Lingua, an accurate natural language detection library
-
lingua-belarusian-language-model
The Belarusian language model for Lingua, an accurate natural language detection library
-
lingua-esperanto-language-model
The Esperanto language model for Lingua, an accurate natural language detection library
-
lingua-croatian-language-model
The Croatian language model for Lingua, an accurate natural language detection library
-
lingua-serbian-language-model
The Serbian language model for Lingua, an accurate natural language detection library
-
lingua-ganda-language-model
The Ganda language model for Lingua, an accurate natural language detection library
-
lingua-macedonian-language-model
The Macedonian language model for Lingua, an accurate natural language detection library
-
lingua-georgian-language-model
The Georgian language model for Lingua, an accurate natural language detection library
-
lingua-bosnian-language-model
The Bosnian language model for Lingua, an accurate natural language detection library
-
lingua-urdu-language-model
The Urdu language model for Lingua, an accurate natural language detection library
-
lingua-albanian-language-model
The Albanian language model for Lingua, an accurate natural language detection library
-
lingua-basque-language-model
The Basque language model for Lingua, an accurate natural language detection library
-
lingua-maori-language-model
The Māori language model for Lingua, an accurate natural language detection library
-
lingua-marathi-language-model
The Marathi language model for Lingua, an accurate natural language detection library
-
lingua-afrikaans-language-model
The Afrikaans language model for Lingua, an accurate natural language detection library
-
lingua-bengali-language-model
The Bengali language model for Lingua, an accurate natural language detection library
-
lingua-tsonga-language-model
The Tsonga language model for Lingua, an accurate natural language detection library
-
lingua-somali-language-model
The Somali language model for Lingua, an accurate natural language detection library
-
lingua-gujarati-language-model
The Gujarati language model for Lingua, an accurate natural language detection library
-
lingua-shona-language-model
The Shona language model for Lingua, an accurate natural language detection library
-
lingua-latin-language-model
The Latin language model for Lingua, an accurate natural language detection library
-
lingua-tagalog-language-model
The Tagalog language model for Lingua, an accurate natural language detection library
-
lingua-xhosa-language-model
The Xhosa language model for Lingua, an accurate natural language detection library
-
lingua-tswana-language-model
The Tswana language model for Lingua, an accurate natural language detection library
-
lingua-zulu-language-model
The Zulu language model for Lingua, an accurate natural language detection library
-
lingua-yoruba-language-model
The Yoruba language model for Lingua, an accurate natural language detection library
-
neo-mime
Strongly Typed Mimes
-
rasciigraph
function to plot ascii graphs
-
lingua-czech-language-model
The Czech language model for Lingua, an accurate natural language detection library
-
lingua-slovene-language-model
The Slovene language model for Lingua, an accurate natural language detection library
-
lingua-azerbaijani-language-model
The Azerbaijani language model for Lingua, an accurate natural language detection library
-
lingua-kazakh-language-model
The Kazakh language model for Lingua, an accurate natural language detection library
-
srx
A mostly compliant Rust implementation of the Segmentation Rules eXchange (SRX) 2.0 standard for text segmentation
-
lingua-irish-language-model
The Irish language model for Lingua, an accurate natural language detection library
-
unicode_reader
Adaptors which wrap byte-oriented readers and yield the UTF-8 data as Unicode code points or grapheme clusters
-
lingua-tamil-language-model
The Tamil language model for Lingua, an accurate natural language detection library
-
lingua-sotho-language-model
The Sotho language model for Lingua, an accurate natural language detection library
-
lingua-telugu-language-model
The Telugu language model for Lingua, an accurate natural language detection library
-
censor
text profanity filter
-
lingua-welsh-language-model
The Welsh language model for Lingua, an accurate natural language detection library
-
lingua-punjabi-language-model
The Punjabi language model for Lingua, an accurate natural language detection library
-
lingua-swahili-language-model
The Swahili language model for Lingua, an accurate natural language detection library
-
savvy
R extension interface
-
tauri-plugin-clipboard
A clipboard plugin for Tauri that supports text, files and image, as well as clipboard update listening
-
wkhtmltopdf
High-level bindings to wkhtmltopdf
-
xsv
A high performance CSV command line toolkit
-
float-pretty-print
Format f64 for showing to user, not for serialisation
-
lindera-analyzer
A morphological analysis library
-
soup
Inspired by the python library BeautifulSoup, this is a layer on top of html5ever that adds a different API for querying and manipulating HTML
-
xml2json-rs
converting to and from XML/JSON
-
marker
finding issues in CommonMark documents
-
atelier_test
Test and example models used within the other Atelier crates
-
pdf_composer
Create PDF documents from YAML Front Matter source documents
-
fast2s
A fast Traditional Chinese to Simplified Chinese conversion library. Built with FST, faster than most of other libraries.
-
text-colorizer
Transitionary package
-
inline_colorization
format!("Lets the user {color_red}colorize{color_reset} and {style_underline}style the output{style_reset} text using inline variables");
-
inkjet
A batteries-included syntax highlighting library for Rust, based on tree-sitter
-
opml
OPML library for Rust
-
confusables
around Unicode confusables/homoglyphs
-
terminal-clipboard
a minimal cross-platform clipboard
-
simple_excel_writer
Excel Writer
-
simple-string-patterns
Makes it easier to match, split and extract strings in Rust without regular expressions. The parallel string-patterns crate provides extensions to work with regular expressions via the Regex library
-
stringzilla
Faster SIMD-accelerated string search, sorting, fingerprints, and edit distances
-
words-count
Count the words and characters, with or without whitespaces
-
lean-sys
Bindings to Lean 4's C API
-
crlify
A std::io::Write wrapper that replaces with on Windows
-
smartcat
Putting a brain behind
cat
. CLI interface to bring language models in the Unix ecosystem 🐈⬛ -
fontconfig
Safe, higher-level wrapper around the Fontconfig library
-
stream-rate-limiter
A rate limiter for Tokio streams
-
vaporetto_rules
Rule-base filters for Vaporetto
-
lindera-filter
Character and token filters for Lindera
-
m_lexer
extensible regular expressions based lexer
-
local-encoding
encoding/decoding string with local charset. It usefull for work with ANSI strings on Windows.
-
decancer
that removes common unicode confusables/homoglyphs from strings
-
regex-split
split_inclusive for the regex crate
-
rustpython-sre_engine
A low-level implementation of Python's SRE regex engine
-
asciifolding
ascii folding library
-
codegenrs
Moving code-gen our of build.rs
-
focaccia
no_std implementation of Unicode case folding comparisons
-
qpdf
Rust bindings to QPDF C++ library
-
asciidork-ast
Asciidork AST nodes
-
tremor-kv
A logstash inspured key value extractor
-
capitalize
Change first character to upper case and the rest to lower case, and other common alternatives
-
epub-builder
generating EPUB files
-
numeric-sort
A zero-allocation, human-readable sorting library
-
char_reader
Safely read wild streams as chars or lines
-
vader_sentiment
Bindings for Rust from the original Python VaderSentiment analysis tool
-
tectonic_io_base
Basic types for Tectonic's pluggable I/O backend system
-
asciidork-parser
Asciidork parser
-
create_broken_files
Create broken files from other ones
-
symspell
Spelling correction & Fuzzy search
-
fuzzywuzzy
A pure-Rust clone of the incredibly useful fuzzy string matching python package, FuzzyWuzzy
-
bashtestmd
Compiles shell commands in .md files into Bash scripts for testing
-
ellipse
Truncate and ellipse strings in a human-friendly way
-
xmldecl
Extracts an encoding from an ASCII-based bogo-XML declaration in text/html in a Web-compatible way
-
wkhtmltox-sys
FFI bindings to wkhtmltox
-
mdbook-cmdrun
mdbook preprocessor to run arbitrary commands
-
safe-string
safe interface for interacting with multi-byte strings in Rust, namely IndexedStr, IndexedString, and IndexedSlice
-
stop-words
Common stop words in many languages
-
vibrato
viterbi-based accelerated tokenizer
-
mdbook-pagetoc
A mdbook plugin that provides a table of contents for each page
-
apidoc-attr
Apidoc attr
-
str-utils
some traits to extend types which implement
AsRef<[u8]>
orAsRef<str>
-
promkit
A toolkit for building your own interactive command-line tools
-
intuicio-data
Data module for Intuicio scripting platform
-
html2runes
An HTML to Text converter
-
asciidork-backend
Asciidork backend
-
unic-ucd-block
UNIC — Unicode Character Database — Unicode Blocks
-
tectonic_status_base
Basic types for reporting status messages to a user
-
null-terminated-str
FFI-friendly utf-8 string, enabling const null-terminated str and caching of the non-terminated string to avoid frequent allocation
-
unic-idna-mapping
UNIC — IDNA — IDNA Mapping Table
-
metatensor-sys
Bindings to the metatensor C library
-
doc-chunks
Clusters of doc comments and dev comments as coherent view
-
tectonic_bridge_flate
Exposing flate compression to the Tectonic C code
-
qpdf-sys
Rust bindings to QPDF C++ library via FFI and bindgen
-
tectonic_bridge_icu
Expose a subset of the ICU Unicode APIs to Rust/Cargo
-
diacritics
Remove diacritics from letters, for example when standardizing input for a search
-
md-tui
A terminal markdown viewer
-
tectonic_bridge_harfbuzz
Expose the Harfbuzz C/C++ APIs to Rust/Cargo
-
tectonic_bridge_graphite2
Expose the graphite2 library to Rust/Cargo
-
terminal-emoji
safely displaying emoji inside of terminals
-
sanitise-file-name
An unusually flexible and efficient file name sanitiser
-
unic-idna
UNIC — Unicode IDNA Compatibility Processing
-
basen
Convert binary data to ASCII with a variety of supported bases
-
utf8_slice
Lightweight UTF8 Slice Utilities
-
shingles
in rust
-
indented
Format data with indentation
-
tectonic_bridge_core
Exposing core backend APIs to the Tectonic C/C++ code
-
seal
Needleman-Wunsch & Smith-Waterman sequence alignment
-
tibco_ems
A high level API for the Tibco EMS