-
regex
regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
-
textwrap
word wrapping, indenting, and dedenting strings. Has optional support for Unicode and emojis as well as machine hyphenation.
-
unicode-segmentation
Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules
-
fancy-regex
regexes, supporting a relatively rich set of features, including backreferences and look-around
-
similar
A diff library for Rust
-
const_format
Compile-time string formatting
-
ascii
ASCII-only equivalents to
char
,str
andString
-
unicode-xid
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
tabled
An easy to use library for pretty print tables of Rust
struct
s andenum
s -
pulldown-cmark
A pull parser for CommonMark
-
unicase
A case-insensitive wrapper around strings
-
tokenizers
today's most used tokenizers, with a focus on performances and versatility
-
lazy-regex
lazy static regular expressions checked at compile time
-
Inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
convert_case
Convert strings into any case
-
comrak
A 100% CommonMark-compatible GitHub Flavored Markdown parser and formatter
-
onig
Rust-Onig is a set of Rust bindings for the Oniguruma regular expression library. Oniguruma is a modern regex library with support for multiple character encodings and regex syntaxes.
-
deunicode
Convert Unicode strings to pure ASCII by intelligently transliterating them. Suppors Emoji and Chinese.
-
mdbook
Creates a book from markdown files
-
unicode-width
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
termimad
Markdown Renderer for the Terminal
-
lopdf
PDF document manipulation
-
diff
An LCS based slice and string diffing implementation
-
widestring
wide string Rust library for converting to and from wide strings, such as those often used in Windows API or other FFI libaries. Both
u16
andu32
string types are provided, including support for UTF-16 and UTF-32… -
rustybuzz
A complete harfbuzz shaping algorithm port to Rust
-
fuzzy-matcher
Fuzzy Matching Library
-
regress
A regular expression engine targeting EcmaScript syntax
-
sd
An intuitive find & replace CLI
-
html2text
Render HTML as plain text
-
printpdf
writing PDF files
-
chardetng
A character encoding detector for legacy Web content
-
mdxjs
Compile MDX to JavaScript in Rust
-
unicode-script
exposes the Unicode
Script
andScript_Extension
properties from UAX #24 -
comfy-table
An easy to use library for building beautiful tables with automatic content wrapping
-
emojis
✨ Lookup emoji in *O(1)* time, access metadata and GitHub shortcodes, iterate over all emoji, and more!
-
prettydiff
Side-by-side diff for two files
-
shell-escape
Escape characters that may have a special meaning in a shell
-
regex-syntax
A regular expression parser
-
lngcnv
linguistics: display pronunciation, translate between dialects, convert between orthographies; support for multiple languages: English, Latin, Polish, Quechua, Spanish, Tikuna
-
const-str
compile-time string operations
-
linkify
Finds URLs and email addresses in plain text. Takes care to get the boundaries right with surrounding punctuation like parentheses.
-
titlecase
that capitalizes text according to a style defined by John Gruber for post titles on his website Daring Fireball
-
decancer
A tiny package that removes common unicode confusables/homoglyphs from strings
-
unicode-normalization
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
lipsum
lorem ipsum text generation library. It generates pseudo-random Latin text. Use this if you need filler or dummy text for your application. The text is generated using a simple Markov chain…
-
textdistance
Lots of algorithms to compare how similar two sequences are
-
garde
Validation library
-
synoptic
low-level, syntax highlighting library with unicode support
-
unicode-bidi
Unicode Bidirectional Algorithm
-
mdbook-pdf
A backend for mdBook written in Rust for generating PDF based on headless chrome and Chrome DevTools Protocol
-
mdbook-katex
mdBook preprocessor rendering LaTeX equations to HTML
-
line-index
Maps flat
TextSize
offsets to/from(line, column)
representation -
slug
Convert a unicode string to a slug
-
unicode-id
Determine whether characters have the ID_Start or ID_Continue properties according to Unicode Standard Annex #31
-
unicode_names2
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
str_indices
Count and convert between indexing schemes on string slices
-
charabia
detect the language, tokenize the text and normalize the tokens
-
md-tui
A terminal markdown viewer
-
create_broken_files
Create broken files from other ones
-
stam
powerful library for dealing with stand-off annotations on text
-
epub-builder
generating EPUB files
-
pad
padding strings at runtime
-
languagetool-rust
LanguageTool API bindings in Rust
-
focaccia
no_std implementation of Unicode case folding comparisons
-
autocorrect
A linter and formatter for help you improve copywriting, to correct spaces, words, punctuations between CJK (Chinese, Japanese, Korean)
-
chamkho
Khmer, Lao, Myanmar, and Thai word segmentation/breaking library and command line
-
cruet
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
mdbook-admonish
A preprocessor for mdbook to add Material Design admonishments
-
entities
raw data needed to convert to and from HTML entities
-
rphonetic
Rust port of phonetic Apache commons-codec algorithms
-
text-size
Newtypes for text offsets
-
unindent
Remove a column of leading whitespace from a string
-
cuid
An ipmlementation of CUID protocol in rust
-
vesti
A preprocessor that compiles into LaTeX
-
substudy
Language-learning tools for working with parallel, bilingual subtitles and media files
-
hgrep
grep tool with human-friendly search output. This is similar to
-C
option ofgrep
command, but its output is enhanced with syntax highlighting focusing on human readable outputs. -
uncased
Case-preserving, ASCII case-insensitive, no_std string types
-
jieba-rs
The Jieba Chinese Word Segmentation Implemented in Rust
-
text-splitter
Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens (when used with large language models).
-
uuhelp_parser
A collection of functions to parse the markdown code of help files
-
text_io
really simple to use panicking input functions
-
indenter
A formatter wrapper that indents the text, designed for error display impls
-
mktoc
Generate Table of Contents from Markdown files
-
pulldown-cmark-to-cmark
Convert pulldown-cmark Events back to the string they were parsed from
-
probly-search
A lightweight full-text search engine with a fully customizable scoring function
-
select
extract useful data from HTML documents, suitable for web scraping
-
roff
ROFF (man page format) generation library
-
unescaper
Unescape strings with escape sequences written out as literal characters
-
cow-utils
Copy-on-write string utilities for Rust
-
any_ascii
Unicode to ASCII transliteration
-
creature_feature
Composable n-gram combinators that are ergonomic and bare-metal fast
-
font-types
Scalar types used in fonts
-
norad
Read and write Unified Font Object files
-
pager
pipe your output through an external pager
-
clipcount
Counting words from the clipboard content
-
line-span
Find line ranges and jump between next and previous lines
-
bwrap
A fast, lightweight, embedded systems-friendly library for wrapping text
-
igrepper
The interactive grepper
-
pandoc
API that wraps calls to the pandoc 2.x executable
-
difflib
Port of Python's difflib library to Rust
-
glyph_brush_layout
Text layout for ab_glyph
-
fontconfig
Safe, higher-level wrapper around the Fontconfig library
-
aki-resort
sort lines of text. You can use regex to specify the KEY.
-
hyphenation
Knuth-Liang hyphenation for a variety of languages
-
byteyarn
hyper-compact strings
-
hypher
separates words into syllables
-
jayce
tokenizer 🌌
-
cargo-spellcheck
Checks all doc comments for spelling mistakes
-
percent-encoding
Percent encoding and decoding
-
rapidfuzz
rapid fuzzy string matching library
-
boreal
evaluate YARA rules, used to scan bytes for textual and binary pattern
-
llmvm-core
The core application for llmvm
-
ra_ap_test_utils
TBD
-
wana_kana
checking and converting between Japanese characters - Kanji, Hiragana, Katakana - and Romaji
-
opml
OPML library for Rust
-
vmks-exam-generator
CLI program for pseudo-randomly generating different variants of an embedded programming exam
-
mlc
The markup link checker (mlc) checks for broken links in markup files
-
rustyline-async
A minimal readline with multiline and async support
-
substring
method for string types
-
r4d
Text oriented macro processor
-
uclanr
A random word picker that gives you actually useful words
-
simple-string-patterns
Makes it easier to match, split and extract strings in Rust without regular expressions. The parallel string-patterns crate provides extensions to work with regular expressions via the Regex library
-
rascii_art
Advanced ASCII Art Generator
-
shell2batch
Coverts simple basic shell scripts to windows batch scripts
-
lindera-cli
A morphological analysis command line interface
-
utf8_iter
Iterator by char over potentially-invalid UTF-8 in &[u8]
-
frawk
an efficient Awk-like language
-
unicode_titlecase
add Unicode titlecase and Turkish and Azeri locale upper/lowercase utilities to chars and strings
-
svgbob
Transform your ascii diagrams into happy little SVG
-
tengwar
Transliterate text into J.R.R. Tolkien's Tengwar.
-
mdbook-epub
An EPUB renderer for mdbook
-
mdbook-catppuccin
🎊 Soothing pastel theme for mdBook
-
hck
A sharp cut(1) clone
-
emojic
Emoji constants
-
mdopen
Preview markdown files in a browser
-
hyperlink
Very fast link checker for CI
-
readability
Port of arc90's readability project to rust
-
lgtmeow
🐾 —— 「本喵觉得很不错~」
-
chewing
(酷音) intelligent Zhuyin input method
-
ferris-says
flavored replacement for the classic cowsay
-
obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
near-facsimile
Find similar or identical text files in a directory
-
yffi
Bindings for the Yrs native C foreign function interface
-
crowbook-text-processing
some utilities functions for escaping text (HTML/LaTeX) and formatting it according to typographic rules (smart quotes, ellipsis, french typograhic rules)
-
pdf-extract
extract content from pdfs
-
mdbook-embedify
based mdbook preprocessor plugin that allows you to embed apps to your book, like youtube, codepen and some other apps
-
unidown
Convert Markdown to Unicode
-
mdcat
cat for markdown: Show markdown documents in terminals
-
jmemo
CUI tool for maintaining memos
-
nu_plugin_clipboard
A nushell plugin to copy text into clipboard or get text from it
-
omekasy
Decorate alphanumeric characters in your input with various font; special characters in Unicode
-
chinese_dictionary
A searchable Chinese / English dictionary with helpful utilities
-
nu-utils
Nushell utility functions
-
uwu_cli
uwuifying the terminal
-
scraps
A static site generator that builds a wiki from a set of markdown files. Inspired by https://scrapbox.io/.
-
prop-check-rs
A Property-based testing Library in Rust
-
repgrep
An interactive command line replacer for
ripgrep
-
gh-emoji
Convert
:emoji:
to Unicode using GitHub’s emoji names -
vaporetto
pointwise prediction based tokenizer
-
ab_glyph
API for loading, scaling, positioning and rasterizing OpenType font glyphs
-
slice-command
slice is a command-line tool that allows you to slice the contents of a file using syntax similar to Python's slice notation
-
wezterm-bidi
The Unicode Bidi Algorithm (UBA)
-
regex-cursor
regex fork that can search discontiguous haystacks
-
mdbook-quiz
Interactive quizzes for your mdBook
-
colonnade
format tabular data for display
-
unicode-general-category
Fast lookup of the Unicode General Category property for char
-
treegrep
A pattern matcher frontend or backend which displays results in a tree
-
qpdf
Rust bindings to QPDF C++ library
-
instant-segment
Fast English word segmentation
-
mdbook-linkcheck
A backend for
mdbook
which will check your links for you -
ncount
A word count tool intended to derive useful stats from markdown
-
unicode-blocks
contains a list of all unicode blocks and provides some functions to search across them
-
ultra-nlp
A NLP library
-
datafusion-physical-expr
Physical expression implementation for DataFusion query engine
-
codespan_preprocessed
Beautiful diagnostic reporting for M4 (or cpp) preprocessed text files
-
sf-api
API to send commands to the Shakes & Fidget servers and parse their responses into characters
-
mini_markdown
Small markdown renderer
-
minimo
terminal ui library combining alot of things from here and there and making it slightly easier to play with
-
snailquote
Escape and unescape strings with shell-inspired quoting
-
arf-strings
Encoding and decoding for ARF strings
-
reason-shell
Reason: A Shell for Research Papers
-
regex-literal
delimited regular expression literals
-
bashtestmd
Compiles shell commands in .md files into Bash scripts for testing
-
arrow-row
Arrow row format
-
skyspell
Fast and handy spell checker for the command line
-
rust-ai
A collection of 3rd-party AI APIs for Rust
-
cskk
C ABIから使う事を目的とした SKK(Simple Kana Kanji henkan)方式のかな漢字変換ライブラリ
-
tauri-plugin-clipboard
A clipboard plugin for Tauri that supports text, files and image, as well as clipboard update listening
-
text_analysis
Analyze text stored as *.txt in provided file or directory. Doesn't read files in subdirectories. Counting all words and then searching for every unique word in the vicinity (+-5 words)…
-
fm
Non-backtracking fuzzy text matcher
-
newdoc
Generate pre-populated module files formatted with AsciiDoc that are used in Red Hat and Fedora documentation
-
mdbook-yapp
A mdbook preprocessor for simple text replacements
-
uwc
Counts things in unicode text files
-
mdbook-graphviz
mdbook preprocessor to add graphviz support
-
mecab
Safe Rust wrapper for mecab a japanese language part-of-speech and morphological analyzer library
-
vi
An input method library for vietnamese IME
-
mdbook-toc
mdbook preprocessor to add Table of Contents
-
levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
asmfmt
A formatter designed for programs in assembly language with AT&T syntax
-
in_definite
Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.
-
subplot
tools for specifying, documenting, and implementing automated acceptance tests for systems and software
-
mdbook-typst
An mdBook backend to output Typst markup, pdf, png, or svg
-
diffy
Tools for finding and manipulating differences between files
-
sre-engine
A low-level implementation of Python's SRE regex engine
-
words-count
Count the words and characters, with or without whitespaces
-
zalgo-codec
converting an ASCII text string to a single unicode grapheme cluster and back. Provides a macro for embedding Rust source code that has been encoded in this way.
-
array_tool
Helper methods for processing collections
-
kathoey
text feminization using open corpus linguistics data
-
betacode
conversion
-
term-rustdoc
A TUI for Rust docs
-
fuzzt
Implementations of string similarity metrics. Includes Hamming, Levenshtein, OSA, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Sørensen-Dice.
-
simple_peg
A command-line peg parser implemented by Rust
-
mix
markdown compiler with 🚀 fuel
-
two_percent
Fuzzy Finder in rust!
-
mdbook-presentation-preprocessor
A preprocessor for utilizing an MDBook as slides for a presentation
-
savvy
R extension interface
-
pink_accents
Replacement of patterns in string to simulate speech accents
-
unic-char-property
UNIC — Unicode Character Tools — Character Property taxonomy, contracts and build macros
-
crowbook
Render a Markdown book in HTML, PDF or Epub
-
finl_unicode
handling Unicode functionality for finl (categories and grapheme segmentation)
-
unicode-ccc
Unicode Canonical Combining Class detection
-
cindex
CSV indexing library
-
idna
IDNA (Internationalizing Domain Names in Applications) and Punycode
-
ferret
A trigram-based tool for detecting similarity in groups of text documents or program code
-
sublime_fuzzy
Fuzzy matching algorithm based on Sublime Text's string search
-
bogrep
Full-text search for bookmarks from multiple browsers
-
domrs
Document builder and serializer
-
tremor-kv
A logstash inspured key value extractor
-
stringcase
Converts string cases between camelCase, COBOL-CASE, kebab-case, and so on
-
autotex
Continuously compile TeX and LaTeX
-
runiq
An efficient way to filter duplicate lines from input, à la uniq
-
oxford_join
Join string slices with Oxford Commas!
-
mdbook-pandoc
A mdbook backend that outsources most of the rendering process to pandoc
-
moto
motivated automation
-
tfidf-text-summarizer
extractive text summarization system which uses TF-IDF scores of words present in the text to rank sentences and generate a summary
-
utf64
encode utf-8 strings into utf-64, and decode them back
-
sliceslice
A fast implementation of single-pattern substring search using SIMD acceleration
-
bfom
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
mdbook-repl
based mdbook preprocessor plugin that allows you to live code in your markdown book
-
ib-pinyin
一个高性能拼音匹配库
-
string-patterns
Makes it easier to work with common string patterns and regular expressions in Rust, adding convenient regex match and replace methods (pattern_match and pattern_replace) to the standard…
-
aki-mcolor
mark up text with color
-
pinyin-parser
Parses a string of pinyin syllables. Covers marginal cases such as
ẑ
,ŋ
andê
. -
async-utf8-decoder
Convert AsyncRead to incremental UTF8 string stream
-
jetscii
A tiny library to efficiently search strings and byte slices for sets of ASCII characters or bytes
-
utf8-io
Traits and types for UTF-8 I/O
-
mdbook-theme
A preprocessor and a backend to config theme for mdbook, especially creating a pagetoc on the right and setting full color themes from the offical ace editor
-
docket
markdown to HTML documentation rendering
-
dicexp
A Dice Expression Interpreter program and library for parsing (and rolling) role-playing game style dice notations (e.g. "2d8+5")
-
os_display
Display strings in a safe platform-appropriate way
-
mdbook-angular
mdbook renderer to run angular code samples
-
arabic_reshaper
Reconstruct Arabic sentences to be used in applications that doesn't support Arabic script
-
unicode-reverse
Unicode-aware in-place string reversal
-
linebreak
breaking a given text into lines within a specified width
-
txt_to_md
Command converting from a txt file to a markdown file
-
nucleo-matcher
plug and play high performance fuzzy matcher
-
somedoc
A very simple document model and markup generator
-
galm
pattern matching library
-
markov_namegen
Random text generators based on Markov models
-
mdbook-chess
An mdbook preprocessing plugin to generate chess boards
-
zspell
Native Rust library for spellchecking
-
stop-words
Common stop words in many languages
-
inkjet
A batteries-included syntax highlighting library for Rust, based on tree-sitter
-
text2num
Parse and convert numbers written in English, Spanish or French into their digit representation
-
marker
finding issues in CommonMark documents
-
console_static_text
Logging for text that should stay in the same place in a console
-
ident_case
applying case rules to Rust identifiers
-
grok
popular java & ruby grok library which allows easy text and log file processing with composable patterns
-
evcxr
An Evaluation Context for Rust
-
nom-grapheme-clusters
Adapter that allows nom to account for unicode grapheme clusters
-
kanjidic_server
A REST API server for Kanjidic and EDRDG radical decompositions
-
divvunspell
Spell checking library for ZHFST/BHFST spellers, with case handling and tokenization support
-
hyperscan
bindings for Rust with Multiple Pattern and Streaming Scan
-
chocodye
changing the color of the chocobos' plumage in Final Fantasy XIV
-
mdbook-tailor
mdbook preprocessor for image-tailor
-
advreader
Split text file into text sequences, strings and (line) comments
-
wordshk_tools
A combination of parsers and other tools for words.hk (粵典)
-
mdbook-combiner
combine mdbook summaries from multiple source into one mdbook
-
decasify
A CLI utility and library to cast strings to title-case according to locale specific style guides including Turkish support
-
hangeul
Korean alphabet manipulation library
-
tossicat
입력된 단어에 맞게 같이 입력된 토시(조사)를 적절하게 변환하는 라이브러리
-
toolstr
formatting data into String's
-
mdbook-indexing
mdbook preprocessor for index generation
-
ruby_inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
ik-rs
chinese segment, ik-analyzer for rust
-
pluralizer
Rust package to pluralize or singularize any word based on a count inspired on pluralize NPM package
-
relog
Strong String Normalization
-
align
aligning text
-
stringsext
find multi-byte-encoded strings in binary data
-
harfbuzz_rs
A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust
-
mdbook-dtmo
Creates a book from markdown files with added plugins
-
indefinite
Prefix a noun with an indefinite article - a or an - based on whether it begins with a vowel
-
asciigraph
Ascii Graph
-
mut-str
A toolkit for working with mutable string slices (&mut str)
-
heckcheck
A heckin small test case generator
-
lindera-tantivy
Lindera Tokenizer for Tantivy
-
whichlicense_detection
detect licenses used by the WhichLicense project
-
labyr
A PDDL benchmarker
-
pretty-xmlish
Pretty print XML-ish data with unicode art
-
two-face
Extra syntax and theme definitions for syntect
-
pragmatic-segmenter
Rust port of pySBD v3.1.0
-
dcsv
Dyanmic csv reader,writer,editor
-
grader
Stream-based CLI for binary sorting text files via a given shell command
-
regex_generate
Use regular expressions to generate text
-
line-numbers
Find line numbers in strings by byte offsets, quickly
-
multimarkov
generic tool for training and using multi-order Markov chains for procedural generation applications such as generating randomized but authentic-sounding names for characters and locations
-
twas
A text substitution application for using random look-up tables to generate text in a manner similar to the Mad Libs game
-
hello-phext
A rust-native implementation of phext
-
chisel-lexers
Chisel backend lexers/scanners
-
yozuk
Chatbot for Programmers
-
cbfr
A buffer that run on stack, focusing on performance and speed
-
dnd_spellbook_maker
making pdf documents of spells that ressemble 5th edition D&D official source book spell descriptions
-
wikidot-normalize
provide Wikidot-compatible string normalization
-
typed-oid
Typed Object IDs
-
escape-bytes
Escapes bytes that are not printable ASCII characters
-
ra_ap_sourcegen
TBD
-
rsonpath-lib
Blazing fast JSONPath query engine powered by SIMD. Core library of
rsonpath
. -
gspell
Rust bindings for gspell
-
mdi
markdown include
-
gosyn
Golang syntax parser in Rust
-
stream-rate-limiter
A rate limiter for Tokio streams
-
lindera-tokenizer
A morphological analysis library
-
stego_wps
text-based steganography using the Words Per Sentence (WPS) method. It allows encoding and decoding hidden messages within the structure of a given text.
-
mdbook-svgbob
SvgBob mdbook preprocessor which swaps code-blocks with neat SVG
-
flatpage
file system based markdown flat page
-
stylin
Convert markdown to pandoc markdown with custom styles
-
mdbook-callouts
mdBook preprocessor to add Obsidian Flavored Markdown's Callouts to your book
-
alien_temple
This program is so hyperspecific to me that I'm not going to even try to make it work for others
-
yeslogic-fontconfig-sys
Raw bindings to Fontconfig without a vendored C library
-
stringzilla
Faster SIMD-accelerated string search, sorting, fingerprints, and edit distances
-
xi-rope
A generic rope data structure built on top of B-Trees
-
choco
markup language for dialogue systems
-
stam-python
STAM is a library for dealing with standoff annotations on text
-
quranize
Encoding transliterations into Quran forms
-
reword
some utility functions for human-readable formatting of words
-
djotters
Djot (Markdown) parser that runs at hyper speeds!
-
mdbook-open-on-gh
mdbook preprocessor to add a open-on-github link on every page
-
mdbook-pagetoc
A mdbook plugin that provides a table of contents for each page
-
ssbm_utils
A collection of useful tools for working with data from Super Smash Bros Melee
-
jpreprocess
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
datatroll
a robust and user-friendly Rust library for efficiently loading, manipulating, and exporting data stored in CSV files
-
vndb_tags_get
convert VNDB tag list (JSON to markdown)
-
dnd-character
A Dungeons and Dragons character generator
-
roe
Unicode case conversion
-
amongify
A very ඞ sus ඞ program
-
aki-xcat
concatenate files that are plain, gzip, xz and zstd
-
srx
A mostly compliant Rust implementation of the Segmentation Rules eXchange (SRX) 2.0 standard for text segmentation
-
mudder
Generate lexicographically-spaced strings between two strings from pre-defined alphabets
-
unfurl
expanding links in text
-
mdbook-davids_cooking
A preprocesor for whatever https://davidsotomarchena.gitlab.io/davids-cooking/ needs
-
quixote
Quizzes and tests in Markdown
-
mdbook-kroki-preprocessor
render kroki diagrams from files or code blocks in mdbook
-
wa
WebAssembly and Rust utility library
-
thesaurus
An offline thesaurus library for Rust
-
collclean
Clean up collaboration commands in LaTeX files
-
human_regex
A regex library for humans
-
text_layout
Text layout algorithms
-
indented_text_writer
IndentedTextWriter
-
miHoYo-API
A API-Wrapper of miHoYo games in Rust
-
twilight-mention
working with mentions in the Twilight ecosystem
-
unicode-display-width
Unicode 15.1.0 compliant utility for determining the number of columns required to display an arbitrary string
-
screen_printer
displaying rectangular blocks of text to a terminal
-
unicode-bidi-mirroring
Unicode Bidi Mirroring property detection
-
fmtt
A stupid text formatter that tries to break lines on sensible punctuations
-
hr-id
A human-readable unique ID which supports Unicode, safe for URLs and file paths
-
groop
Outline-aware grep with syntax highlighting
-
wchar
Procedural macros for compile time UTF-16 and UTF-32 wide strings
-
image-to-ascii
Converts images and gifs to ascii art
-
harfbuzz
Rust bindings to the HarfBuzz text shaping engine
-
litua
Read a text document, receive its tree in Lua and manipulate it before representing it as string
-
egg-mode-text
Text parsing for Twitter: character counting, hashtag/mention extraction
-
sayit
String replacements using regex
-
html-auto-p
function like
wpautop
in Wordpress. It uses a group of regex replaces used to identify text formatted with newlines and replace double line-breaks with HTML paragraph tags. -
zhconv
Traditional/Simplified and regional Chinese variants converter based on MediaWiki & OpenCC rulesets and powered by AC automata 轉換简体、繁體及兩岸、新馬中文地區詞,基於MediaWiki和OpenCC之字詞轉…
-
utf16_iter
Iterator by char over potentially-invalid UTF-16 in &[u16]
-
timeblok
A language for event scheduling in plain text
-
wikipedia_prosesize
Count Wikipedia prose size
-
dekor
styling and character repository in Rust
-
unicode-security
Detect possible security problems with Unicode usage according to Unicode Technical Standard #39 rules
-
wkhtmlapp
Convert html to pdf or image
-
zet
zet finds the union, intersection, set difference, etc of files considered as sets of lines
-
crlify
A std::io::Write wrapper that replaces with on Windows
-
analiticcl
approximate string matching or fuzzy-matching system that can be used to find variants for spelling correction or text normalisation
-
linoleum
but ergonomic line editor
-
str_inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
mdbook-cat-prep
a preprocessor for mdbook which provides teacher, subject, material and tag functionality
-
streampager
pager for command output or large files
-
textcode
Text encoding/decoding library. Supports: UTF-8, ISO6937, ISO8859, GB2312
-
prom-write
CLI for writing metrics to Prometheus over the remote write API
-
sbert
Sentence Bert (SBert)
-
gematria_rs
Gematria, a traditional Hebrew numerology system
-
rewrite
Safely rewrite file contents from stdin, even when file is open as an input
-
divvunspell-bin
Spellchecker for ZHFST/BHFST spellers, with case handling and tokenization support
-
cloc
Count, or compute differences of, lines of source code and comments
-
egui-dropdown
An actual dropdown list for egui
-
malachi
A domain specific pattern matching language made for defining bot commands
-
numeric-sort
A zero-allocation, human-readable sorting library
-
onig_sys
onig_sys
crate contains raw rust bindings to the oniguruma library. This crate exposes a set of unsafe functions which can then be used by other crates to create safe wrappers around Oniguruma… -
chord3
Create pdf songbooks from chopro source
-
utf16_lit
macro_rules to make utf-16 literals
-
morph-rs
Dictionary Morphologizer for Russian language
-
techlead
CLI is a command-line interface that enables developers to chat with an AI assistant powered by the OpenAI GPT language model, designed specifically to help with your Rust project
-
moon-phases
Fast command-line application to show the moon phase
-
filecheck
writing tests for utilities that read text files and produce text output
-
vibrato
viterbi-based accelerated tokenizer
-
poriborton
Interconversion between Unicode and various Bengali ANSI encodings
-
line-straddler
Determine how lines through text (underlines, strikethroughs, etc) should be rendered
-
minify-html-common
Common code and data for minify-html*
-
precis-profiles
PRECIS Framework: Preparation, Enforcement, and Comparison of Internationalized Strings Representing Usernames and Passwords as defined in rfc8265; and Nicknames as defined in rfc8266
-
indent
Functions for indenting multiline strings
-
mdbook-codeblocks
A mdbook preprocessor to prepend customizable vignette to code blocks
-
chargrid
Text UI component framework
-
mdbook-keeper
An improved testing experience for mdbook
-
slicestring
slicing Strings
-
meaningsearch
package that helps you find meaningful lines of any given input. Especially useful in CTFs.
-
g2-unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
presenterm
A terminal slideshow presentation tool
-
clarifai_grpc
The official Clarifai gRPC Rust client
-
wimbd
A CLI for inspecting and analyzing large text datasets
-
kataru
YAML-based dialogue engine for game development
-
catalog-of-markdown
Generate the catalog of markdown file
-
rustic_print
A versatile Rust library for enhancing console output. It offers a range of features to create a more engaging and informative command-line interface.
-
text-to-ascii-art
program to convert text to ASCII art
-
mdbook-aquascope
Interactive Aquascope editor for your mdBook
-
kelp
A convert tool for Japanese
-
rust-tfidf
calculate TF-IDF (Term Frequency - Inverse Document Frequency) for generic documents
-
cgi2
Create CGI programmes
-
mdbook-variables
mdBook proprocessor for risolve variables configured from book.toml
-
etch
Not just a text formatter, don't mark it down, etch it
-
csml_interpreter
The CSML Interpreter is the official interpreter for the CSML programming language, a DSL designed to make it extremely easy to create rich and powerful chatbots
-
hmd
Custom Markdown Engine for my personal blog
-
rslint_errors
Pretty error reporting library based on codespan-reporting built for the RSLint project
-
jawk
JSON AWK
-
regex_mutator
The Nautilus regex_mutator
-
mdtrans
Markdown parser and transformer using
pest.rs
, focused on flexibility to a project’s needs -
lsp-textdocument
A LSP text documents manager that map of text document
-
orly
Download O'Reilly books as EPUB
-
hh-highligh
hh: The text highlighter
-
man
Generate structured man pages
-
unicode_categories
Query Unicode category membership for chars
-
text_trees
textual output for tree-like structures
-
lister-cli
Lister: Navigate Markdown Lists
-
asciimath-unicode
Convert asciimath to unicode
-
syllabize-es
Syllabize Spanish text, and much more
-
glk
Bindings for the Glk I/O interface for hosting interactive fiction interpreters
-
ungoliant
The pipeline for the OSCAR corpus
-
jcalendar
Japanese Calendar for Rust
-
inflections
High performance inflection transformation library for changing properties of words like the case
-
wz
Count words, fast
-
dtex
Better TeX
-
lcat
lolcat in rust! Full unicode support, escapes for ANSI escape sequences, hue shift in Cubehelix
-
tre-regex
Rust safe bindings to the TRE regex module
-
gulagcleaner_rs
Ad removal tool for PDFs
-
unicode-truncate
Unicode-aware algorithm to pad or truncate
str
in terms of displayed width -
sanitize-filename-reader-friendly
A filename sanitizer aiming to produce reader friendly filenames
-
loc
Count lines of code (cloc) fast
-
webreg
A CLI tool for testing regexes against web pages
-
strloin
copy on write slices of a string
-
docx-rust
parsing and generating docx files
-
cglue-bindgen
cleanup cbindgen headers for CGlue
-
repvar
A tiny CLI tool that replaces variables of the style
${KEY}
in text with their respective value. It can also be used as a rust library -
intuicio-frontend-simpleton
Simpleton frontend module for Intuicio scripting platform
-
grepox
Minimalist's grep written in Rust
-
gregex
Regex solver utilizing NFA
-
stylish-html
stylish
helpers for writing styles as HTML elements -
scanlex
lexical scanner for parsing text into tokens
-
aprilasr
High-level wrapper for the april-asr C api (libaprilasr) using aprilasr-sys
-
pandoc_types
Rust port of pandoc-types
-
rust-texas
generate latex documents
-
tagalyzer
A CLI tool to gather statistics on collections of plaintext-adjacent files
-
string_wizard
manipulate string like wizards
-
fast_aug
Fast data augmentation for text
-
detect-newline-style
Determine a string's preferred newline character
-
character_converter
Turn Traditional Chinese script ot Simplified Chinese script and vice-versa and tokenize
-
bcdown
Bilibili漫画下载器,written in Rust,支持epub pdf zip格式
-
clafrica
This application allows you to type most of the characters in the african alphabet in any text field
-
punycode
Functions to decode and encode Punycode
-
readability-text-cleanup
cleaning up text from HTML using Mozilla's Readability algorithm
-
koji
An interactive CLI for creating conventional commits
-
mdbook-pikchr
A mdbook preprocessor to render pikchr code blocks as images in your book
-
compute-file-server-cli
Uploads files to Fastly for serving directly from within Fastly Compute applications. Upload any type of file: images, text, video etc and serve directly from Fastly. It is ideal for…
-
distrs
PDF, CDF, and percent-point/quantile functions for the normal and Student’s t distributions
-
unic-ucd-ident
UNIC — Unicode Character Database — Identifier Properties
-
symspell
Spelling correction & Fuzzy search
-
sms_splitter
An SMS message splitter and part calculator with support for GSM and Unicode
-
uwu-rs
uwuifying library
-
nu-explore
Nushell table pager
-
lowcharts
draw low-resolution graphs in terminal
-
soundex-rs
that calculates soundex value
-
runanum
Существительные с правильными окончаниями после чисел
-
rep-grep
wgrep/write-grep CLI
-
tectonic_engine_bibtex
The
bibtex
program as a reusable crate -
extract-words
Extracts words from text without allocation
-
syllarust
quickly counting syllables
-
dotrain
.rain to rainlang composer
-
akai
Next-generation, functional and simple ID system
-
regex-cli-test
A command line tool for testing against regex patterns
-
md-localizer
Localize markdown with remote links
-
plsfix
Text cleaner upper
-
zipcodes
Query US zipcodes without SQLite
-
aki-gsub
substitude text command, replace via regex
-
dispnet-hash
Hash Library used for Dispnet
-
ncase
Enforce a case style
-
kasedenv
Read environment variables by lower, upper case or case-insensitive keys
-
mdbook-plantuml
A preprocessor for mdbook which will convert plantuml code blocks into inline SVG diagrams
-
gret
command line tool to search for patterns and show matches in a tree structure
-
bibliofile
A TUI epub reader inspired by DOS-era programs
-
schmfy
Schmfication library
-
markdown-formatter
Flavored Markdown (ZH) content formatter
-
broken-md-links
A command-line tool and library to detect broken links in Markdown files
-
gaze
small utility library with the goal of making it easier to scan/lex text and collections
-
dptran
run DeepL translations on command line written by Rust
-
easy_reader
easily navigating forward, backward or randomly through the lines of huge files
-
khat
A cat clone, nothing more nothing less
-
mdbook-gitbook
mdBook preprocessor to properly render GitBook specific syntax
-
emojicon
Find Emoji by using Emoticons and GitHub's, Bengali emoji names
-
aki-mcycle
mark up text with cycling color
-
slack_update
app to set Slack status, emoji and photo
-
pprint
Flexible and lightweight pretty printing library for Rust
-
bitranslit
Bi-directional transliterator for Rust. Transliterates strings according to the rules specified in the language packs.
-
shortguid
Short URL-safe Base64 encoded UUIDs
-
octor
rmd combines all readmes into one
-
textwrap-macros
procedural macros to use textwrap utilities at compile time
-
unfuck
Python 2.7 bytecode deobfuscator
-
royal_road_archiver
An archival program and library for the webnovel site RoyalRoad
-
print-positions
providing string segmentation on grapheme clusters and ANSI escape sequences for accurate length arithmetic based on visible print positions
-
str-utils
some traits to extend types which implement
AsRef<[u8]>
orAsRef<str>
-
secular
No Diacr!
-
termdiff
Write a diff with color codes to a string
-
text_manipulation_rs
generating random placeholder text in different languages
-
heatseeker
A fast, robust, and portable fuzzy finder
-
nlpo3
Thai natural language processing library, with Python and Node bindings
-
ngrammatic
Character-oriented ngram generator and fuzzy matching library
-
ttaw
talking to a wall, a piecemeal natural language processing library
-
string_manip_rust
Demo of managing projects
-
uulm-sopra23-group03-core
Core library for implementing Battle of the Centerländ
-
simple-word-count
word count function, try to get same result with Microsoft Office Word application
-
product-os-content
Product OS : Content provides a complete solution for content management for the purpose of serving content via Product OS : Server
-
patterns
no_std compatible pattern scan library using simd
-
indentation
Formatter
-
normalize-hebrew-rs
package that normalizes special symbols within Hebrew string used in the Qumran-Digital project
-
sedregex
Sed-like regex library
-
dodo-zh
manipulate the cedict dictionary and pinyin
-
mdbook-private
An mdbook preprocessor that controls visibility of private chapters and sections within them
-
float-pretty-print
Format f64 for showing to user, not for serialisation
-
unicode-casing
Titlecase helper function on characters
-
cellumina
configure and run cellular automata
-
newbase60
that implements Tantek Çelik's New Base 60
-
leven-distance
Compute operational differences between two sequences using the Levenshtein algorithm
-
pangu2
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
stringmatch
Allow the use of regular expressions or strings wherever you need string comparison
-
hyphertool
Hypertool is a command-line tool for syllabification and hyphenisation
-
speech-synthesis
Common traits for speech synthesis
-
ut1_blocklist
UT1 blocklist URL/domain filters
-
mdbook-fs-summary
Summary generator for mdbook
-
help_crafter
help message generator without hussle
-
case
A set of letter case string helpers
-
read_chars
An iterator over characters read from some I/O source
-
flowistry_ide
Information Flow in the IDE for Rust
-
matchers
Regex matching on character and byte streams
-
block-list
A minimalist hosts-based tool for managing block lists and ad-blocking
-
gdnative-doc
Documentation tool for gdnative
-
hns
Human numeric sorting program — does what
sort -h
is supposed to do! -
inflector-plus
Adds String based inflections for Rust. Snake, kebab, camel, word, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
redpatterns
a list of patterns for scanners 📟
-
highlight-pulldown
Process pulldown-cmark events to apply syntax highlighting to code blocks
-
regex-chunker
Iterate over the data in a
Read
type in a regular-expression-delimited way -
lindera
A morphological analysis library
-
ansi-width
Calculate the width of a string when printed to the terminal
-
recursive-file-loader
recursively load files via references in the files
-
prefix
A customizable pretty printer for FIX messages
-
markov
A generic markov chain implementation in Rust
-
anslatortray
translate from English to Pig Latin!
-
genpdf
User-friendly PDF generator written in pure Rust
-
string_utils
Some tools that are 'internal' for now because they are insufficiently refined and unstable, but which are used by other 'public' crates
-
texcore
Create LaTeX documents using native Rust types
-
whitespace-sifter
Sift duplicate whitespaces away!
-
gecliht
A disparate collection of text manipulation and formatting algorithms
-
character_text_splitter
splitting text into chunks with overlap, designed for handling large amounts of text efficiently. Implementation is identical to langchain's CharacterTextSplitter
-
cermic
Print .png images in your terminal as ascii art!
-
rsclp
A lib crate to parse proccess arguments
-
kansuji
漢数字と数字の相互変換のためのライブラリ
-
notan_glyph
glyph's support for Notan
-
easy-regex
Make long regular expressions like pseudocodes
-
grace-cli
CLI tool for processing files and strings
-
tibco_ems-sys
Rust bindings for the Tibco EMS C library
-
ini_puga
manage INI files
-
dingtalk
Robot Util, Send text/markdown/link messages using DingTalk robot, 钉钉机器人
-
mdbook-morsels
Morsels plugin for Mdbook
-
poppler
Wrapper for the GPL-licensed Poppler PDF rendering library
-
rs-wordcloud
☁ Flexible WordCloud implementation for ⚙Rust
-
pulldown-cmark-escape
An escape library for HTML created in the pulldown-cmark project
-
correct_word
A No brainer 'did you mean' library for Rust
-
rscx-mdx
MDX support for RSCx
-
rust_iso15924
ISO 15924, Codes for the representation of names of scripts, is an international standard defining codes for writing systems or scripts (a "set of graphic characters used for the written form of one or more languages")…
-
azusa
String index transformer for Rust utf8 to JavaScript utf16
-
ab-radix-trie
A compressed radix trie implementation supporting matching rules
-
mdbook-ocirun
mdbook preprocessor to run arbitrary commands and code snippets inside containers
-
grammalecte_client
Grammalecte HTTP client
-
presciidoc
Preprocessing AsciiDoc for other tools
-
svgbob_cli
Transform your ascii diagrams into happy little SVG
-
re_space_view_text_document
Space View that shows a single text box
-
minigrep_empty
读取文件中的指定字符串信息
-
mdbook-svgbob2
Alternative mdbook preprocessor for svgbob
-
ufofmt
A fast, flexible UFO source file formatter based on the Norad library
-
refac
Transform some text given a generic natural language prompt
-
slugify
Macro for flexible slug generation
-
unidoc
Unite all Markdown
-
char_trie
Text segmentation based on trie tree, High performance, support for custom dictionary
-
keyvalues-parser
A parser/renderer for vdf text
-
easy_io
Fast and dead-simple IO for competitive programming in Rust
-
ryaspeller
lib for searching typos in text, files and websites
-
emoji
Every emoji, their metadata, and localized annotations
-
mdbook-chapter-list
mdbook preprocessor for listing subchapters
-
dmntk-recognizer
DMNTK | Decision Table Recognizer
-
delay_writer
Wraps a writer and delays its output after each newline
-
rusk
a Specification Language
-
uniquewords-rs
Count the frequencies of words in text file(s)
-
kana
-
markdown-gen
generating Markdown files
-
pdf_encoding
Font related encodings
-
text2art
translate text to ascii art
-
sanitizer
A collection of methods and macros to sanitize struct fields
-
md-include
include any file in markdown files
-
slugify-rs
generate slugs from strings
-
senax-inflector
Pluralizes and singularizes English nouns. This crate is forked from https://github.com/whatisinternet/Inflector.
-
seshat-unicode
A Unicode Library for Rust. Unicode 15.1.0 ready. XID_Start and XID_Continue are also available.
-
csv-groupby
execute a sql-like group-by on arbitrary text or csv files
-
tagsearch
Filter plaintext files based on @keyword tags
-
fast-str
A flexible, easy-to-use, immutable, efficient
String
replacement for Rust -
match-pinyin-with-hanzi
Checks whether the sentence in Chinese characters (汉字) matches with the sentence in pinyin (拼音). Erhua is supported.
-
any2utf8
Convert any enncoding to utf-8
-
stardict_wrapper
Rust implement stardict
-
dsntk-gendoc
DSNTK | Documentation generator
-
refalizer
Make sure your branch names are good to drive
-
tectonic_io_base
Basic types for Tectonic's pluggable I/O backend system
-
rasciigraph
function to plot ascii graphs
-
jellybean
Syntax highlighting with tree-sitter. Sweet colors.
-
spoilers
High-level Rust bindings for CTranslate2
-
path2regex
Express style path to RegExp utility
-
pomsky-bin
Compile pomsky expressions, a new regular expression language
-
json_to_table
pretty print JSON as a table
-
ferrissay
cowsay
-
pulldown-cmark-mdcat
Render pulldown-cmark events to TTY
-
cfasttext-sys
fastText ffi binding
-
mdbook-bib
mdbook plugin allowing to load and present a bibliography in BibLaTex format in your books and cite its references
-
jira-clean
clean up Jira task description that is an output of jira-cli tool
-
kl-hyphenate
Knuth-Liang hyphenation for a variety of languages
-
hitori
Generic compile-time regular expressions
-
soft-ascii-string
char/str/string wrappers which add a "is-ascii" soft constraint
-
detect-indent
Detect the indentation of code
-
pandoc-ac
pandoc filter for converting acronym codes to LaTeX
-
lindera-dictionary
A Japanese morphological dictionary
-
cyrconv
A funny faux cyrillic character mapper
-
fancy-regex-fork-pb
A custom fork of the fancy-regex crate. You probably don't want to use this.
-
CFDTCP
Short text clustering for determining center points and correlation judgment [one-hot,Jenks Natural Breaks]
-
rust_book_code
The code of rust book
-
mdbook-quiz-schema
Schema for quizzes used in mdbook-quiz
-
indent_write
Write adapters to add line indentation
-
mdbook-mermaid
mdbook preprocessor to add mermaid support
-
tradukisto
Kinda useful natural language translation library and utility
-
sc_decode
Special character decoding
-
translitrs
Transliteration utility for Serbian language
-
rsnltk
Rust-based Natural Language Toolkit
-
csi
processing a directory of text files. Particularly useful for building static websites.
-
proc-macro-regex
A proc macro regex library
-
mdbook-preprocessor-boilerplate
Boilerplate code for mdbook preprocessors
-
stylish-stringlike
API for string-like objects that have styles applied
-
lingua-english-language-model
The English language model for Lingua, an accurate natural language detection library
-
kas-text
Text layout and font management
-
tracery
Text-expansion library
-
jput
puts and putc on unicode-width align for Rust
-
mdxbook
Fork of mdBook, with more customizations and flexibility for programmers
-
chardet
rust version of chardet
-
ranting
Linguistic formatting placeholder extensions for rust
-
bbcode-tagger
BBCode tree parser and tagger
-
markdown-extract
Extract sections of a markdown file
-
notmecab
tokenizing text with mecab dictionaries. Not a mecab wrapper.
-
java_string
Java strings, tolerant of invalid UTF-16 encoding
-
bk-tree
A Rust BK-tree implementation
-
dmntk-gendoc
DMNTK | Documentation generator
-
syntax-error
Syntax error report
-
didaskalia
over-complicated language to store text entries and sequences across different languages
-
cesu8
Convert to and from CESU-8 encoding (similar to UTF-8)
-
RustMqtt
Mqtt Encoding and decoding package
-
ob
A Blog and RSS system written in Rust
-
slidedeck
Create an HTML slide deck from Markdown
-
bytepiece_rs
The Bytepiece Tokenizer Implemented in Rust
-
bstr
A string type that is not required to be valid UTF-8
-
pullup
Convert between markup formats
-
montchik_minigrep
Testdrive of the Rust program
-
bfom-lib
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
ipa-translate
translating between IPA and ASCII text
-
lexmatch
lexicon matching tool that, given a lexicon of words or phrases, identifies all matches in a given target text. Uses suffix arrays.
-
chisel-parsers
Chisel parser front ends
-
jp_utils
Utils for working with Japanese text
-
mandown
Markdown to groff (man page) converter
-
lindera-ko-dic-builder
A Korean morphological dictionary builder for ko-dic
-
r-matrix
Rust port of cmatrix
-
grep-reader
short text for crates.io
-
cattocol
Combine two text into one text as columns
-
pascii
image => text
-
spark_sdk
Rust sdk of the Spark
-
fiberplane-markdown
convert Fiberplane Notebooks to and from Markdown
-
aki-xtee
copy standard input to each files and standard output
-
rustplexity
bigram-based perplexity calculator, useful for filtering out boilerplate or other abnormal text
-
group-similar
Group similar values based on Jaro-Winkler distance
-
hh-highlight
hh: The text highlighter
-
lexical-sort
Sort Unicode strings lexically
-
enum-ts
TypeScript Enum pattern matcher codegen
-
owned_chars
Owned iterators with the same output as Chars and CharIndices
-
LatexImgGen-lib
generating Latex images of equations with an edition number to avoid cache flushing and timestamp attributes
-
roxy_syntect
Roxy plugin for providing syntax highlighting via Syntect
-
pulldown-cmark-toc
Generate a table of contents from a Markdown document
-
waifu-calendar
fetch your favorite characters' birthdays from AniList
-
abjad
Calculate the numerical abjad value of Arabic-script text
-
mdbook-curly-quotes
mdBook preprocessor that replaces straight quotes with curlyquotes, except within code blocks or code spans
-
mdbook-bibfile-referencing
An mdBook preprocessor to add bibfile referencing to each page
-
tet_rs
A third-party implementation of Text Entry Throughput (ref. https://doi.org/10.1145/3290605.3300866) for Rust
-
ruSTLa
A reStructuredText → LarST ⊂ LaTeX transpiler
-
wcrs
GNU wc in Rust
-
unescape
Unescapes strings with escape sequences written out as literal characters
-
snp_sminigrep
my practice
-
recase
Changes the convention case of input text
-
rusty-x
A snippet manager in Rust
-
wit_owo
Finally, an adequate library for wit.ai in Rust
-
aho-corasick
Fast multiple substring searching
-
commonregex
Rust port for CommonRegex. Find all times, dates, links, phone numbers, emails, ip addresses, prices, hex colors, and credit card numbers in a string. We did the hard work so you don't have to.
-
linurgy
Manipulate the output of multiple newlines. Replace/Insert/Append newlines with text. Input and output from stdio/files/buffers
-
cutters
Rule based sentence segmentation library
-
dr
Command-line data file processing in Rust
-
ultra
Cryptanalysis of the Engima
-
campfire
A tiny static site generator, greatly inspired by Zola
-
mupdf
Safe Rust wrapper to MuPDF
-
textos
Texts, strings, formatting, unicode…
-
mdbook-auto-gen-summary
A preprocessor and cli tool for mdbook to auto generate summary
-
env2toml
Convert env vars to toml text
-
pdf_seekers
parser and information extractor from PDF documents based on keyword search functionality (powered by Rust)
-
convert_string
A trait to convert Strings to safe non-keywords and/or convert a Strings case (snake_case, PascalCase, ...)
-
godotdoc
documentation generator for GdScript
-
codevis
turning your code into one large image
-
bubble-bath
Small and quick HTML sanitizer
-
unveil-rs
Unveil Rs is a tool to create presentations from markdown files
-
ascii-hangman-backend
customizable Hangman game with ASCII-art rewarding for children (backend)
-
show-bytes
Display bytes as printable ascii with escape sequences as needed
-
manchu-converter
Converts transcripted Manchu text to Manchu script with Manchu alphabet
-
mdbook-alerts
mdBook preprocessor to add GitHub Flavored Markdown's Alerts to your book
-
remove-markdown-links
Turns
[example](https://example.com)
intoexample
. That’s it -
august
& program for converting HTML to plain text
-
spellabet
Convert characters into spelling alphabet code words
-
markdown-toc
Markdown Table of Contents generator
-
mdbook-external-links
Open external links inside your mdBooks in a different tab
-
mathematica-notebook-filter
mathematica-notebook-filter
parses Mathematica notebook files and strips them of superfluous information so that they can be committed into version control systems more easily -
rust_stringz
string manipulation library for Rust that you should probably don't need. It's just for learning purposes.
-
unflow
DSL to convert design to code
-
hello_lib
Demonstrate Generics Function
-
matrix_display
display a matrix in command line with pretty unicode separators
-
cha-rs
Extract specific characters from an input
-
mdict
Rust implement mdict
-
md-designer
A CLI tool for creating design docs in Markdown
-
letter-sequence
A method to create sequence displayed as uppercase or lower letters, or digits
-
spongebob
A command to convert text to the Spongebob meme case
-
unicode-canonical-combining-class
Fast lookup of the Canonical Combining Class property
-
mdbook-post
A CLI for add post to mdbook
-
markovish
Markov chain implementation for text generation
-
wordmarkov
but flexible Markov chain library, specifically for text sentences, which handles punctuation and whitespace
-
autoruby-cli
CLI to easily generate furigana for various document formats
-
testcall
companinon crate to bintest, implements test facilities
-
cargo-markdown
Local crates.io readme development server with ultra-fast hot reloading goodness
-
text-transliterate
transliterate texts using the SO iconv from POSIX
-
igo-rs
Pure Rust port of the Igo, a POS(Part-Of-Speech) tagger for Japanese (日本語 形態素解析)
-
text-colorizer
Transitionary package
-
morc
Dead simple, minimal markdown generator library written in Rust
-
readable-regex
Regex made for humans. Wrapper to build regexes in a verbose style.
-
loe
Very fast and yet another line ending (CRLF <-> LF) converter written in Rust
-
ascii-canvas
canvas for drawing lines and styled text and emitting to the terminal
-
whitespacers
A whitespace JIT-compiler, for when your whitespace just isn't executing fast enough
-
ccase
Command line interface to convert strings into any case
-
boringascii
Strings that can only be constructed to contain non-whitespace, non-control ASCII characters
-
clima
A minimal Markdown reader in the terminal
-
pulldown-cmark-fork
A pull parser for CommonMark
-
wool
Preview Github Markdown Offline
-
udp-logger-rs
Log macro for log's kv-unstable backend and a UDP socket logger
-
slicedisplay
Simplistic Display implementation for Vecs and slices
-
char-ranges
Iterate chars and their start and end byte positions
-
story-dl
Story web scraping
-
tgrep
Toy grep that honors .gitignore
-
awabi
A morphological analyzer using mecab dictionary
-
event_parser
Rust NLP library for parsing English natural language into icalendar events
-
ragegun
Performs lexica based analysis on text (i.e. age, gender, PERMA, OCEAN personality traits, ..)
-
korrektor
work with Uzbek language text processing
-
markdown-to-html
Markdown parser that runs at hyper speeds!
-
pact_matching
Pact-Rust support library that implements request and response matching logic
-
html-to-pulldown-cmark-events
Parse HTML to pulldown-cmark's events
-
fiat-lux
Offline terminal-accessible Bible
-
charmap
one-to-(none/one/many) character mapping
-
ascii_converter
converting between different ascii representations
-
c6o-obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
nu_plugin_ron
A Nushell plugin for parsing RON text
-
wattle
tools for rust
-
alpino-tokenizer
Wrapper around the Alpino tokenizer for Dutch
-
fuzzywuzzy
A pure-Rust clone of the incredibly useful fuzzy string matching python package, FuzzyWuzzy
-
trie-match
Fast match macro
-
moguls
Let the words of financial moguls inspire and guide you in your quest for financial excellence and understanding
-
chargrid_common
Common utilities for making text UIs with chargrid
-
twitter-text
in Rust
-
ipset_lookup
ipset is a command-line tool that takes networks or IPs and searches through a lot of different threat feeds quickly. It can also download the feed data necessary to perform the queries…
-
kanjidic_types
A collection of types encompassing the variety of data about kanji available from Kanjidic
-
bilingual
A cmdline tool used for markdown translation via calling Chinese translation api cloud services
-
unicode-language
detect language coverage given a list of codepoints
-
csvsc
Build processing chains for CSV files
-
unicode-normalization-alignments
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
subject-classifier
classifying a commit by it's subject
-
lines_lossy
extension to
BufRead
with a functionlines_lossy
that works likeBufRead::lines
but with lossy UTF-8 decoding -
terminal_cli
A standalone library with no-std support for command line terminal interfaces. With autocomplete support, helpers for commands and properties and a prompt implementation.
-
acorns
Generate an AsciiDoc release notes document from tracking tickets
-
unicode-case-mapping
Fast lowercase, uppercase, and titlecase mapping for characters
-
mdbook-d2
D2 diagram generator plugin for MdBook
-
null-terminated-str
FFI-friendly utf-8 string, enabling const null-terminated str and caching of the non-terminated string to avoid frequent allocation
-
regexgrep
ripgrep tool that suports regular expressions
-
spongemock
CLI-tool for random text capitalization
-
sluggify
slug or clean url generator for rust. With default settings, you will get an hyphenized, lowercase, alphanumeric version of any string you please, with any diacritics removed, whitespace and dashes collapsed…
-
beary
augment Bear, the note-taking app
-
mdbook-webinclude
Preprocessor for mdBook that includes content from URLs
-
hulk
An ultra simple no hassle static site generator
-
kspconfigtool
KSP1 ConfigNode parser and block removal tool
-
aki-mline
match line, regex text filter like a grep of linux command
-
fast_symspell
Spelling correction & Fuzzy search
-
mdzk
Plain text Zettelkasten based on mdBook
-
static_format
Format strings with no runtime overhead
-
mdbook-cmdrun
mdbook preprocessor to run arbitrary commands
-
regex-split
split_inclusive for the regex crate
-
cdragon-rst
Work with Riot translation files
-
dsntk-recognizer
DSNTK | Decision Table Recognizer
-
bitflip
functions to generate bitflips of binary and UTF-8 strings
-
uiuifree-dictionary
My MecabDic Util
-
ngram
Iterator adaptors for n-grams and k-skip-n-grams
-
mdbook-last-changed
mdbook preprocessor to add the last modification date per page
-
text_lines
Information about lines of text in a string
-
zspell-cli
Command line interface for the ZSpell spellchecking library
-
skill-tree
generate graphviz files to show roadmaps
-
tcalc-rustyline
A fork of Rustyline for use specifically with tcalc
-
sejong
Buffer is a buffer that can receive ASCII bytes different from keyboard and send out UTF-32 Hangul string. This buffer allows deletion by Jamo.
-
dwrote
Lightweight binding to DirectWrite
-
spinning_cube
Perspective projection of a spinning cube, using just ASCII characters
-
unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
frida-build
Build support for Frida Rust bindings
-
syntastica-themes
A collection of themes for syntastica
-
rex-regex
A pedestrian library for regular expressions: Slow, but easy to understand
-
uwuify
fastest text uwuifier in the west
-
computergeneration
compgen but all wrong
-
imperative
Check for imperative mood in text
-
markov_strings
A simplistic Markov chain text generator
-
zalgo-codec-common
converting an ASCII text string to a single unicode grapheme cluster and back
-
mutf8
Basic support for mutf8 strings
-
thesauromatic
command-line thesaurus that returns related words when given a word. The output words are one per line, making it easy to process in shell pipelines.
-
spacey
a tiny yet performant whitespace vmm/interpreter
-
falcom-sjis
Falcom-compatibile Shift JIS implementation
-
charwise
This lightweight, dependency-free rust library provides a convenient way to read characters from different resources
-
notedown_ast
Notedown Abstract Syntax Tree
-
reddish
A Rust utility library
-
character_frequency
counting character frequencies in a string concurrently
-
literumilo
A spell checker and morphological analyzer for Esperanto
-
untanglr
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies
-
terminal-clipboard
a minimal cross-platform clipboard
-
mdbook-open-gh-issue
mdbook preprocessor to add a open-on-github link on every page
-
imstr
Cheaply clonable and slicable immutable strings
-
pi_ucd
unicode字符函数,获得字符的语言区间段;及根据文字排版的需要,判断字符是否为单字字符或字母字符
-
spacemod
A easy to understand and powerful text search-and-replace tool
-
synox
Program synthesis of string transformations from input-output examples
-
rustrings
Strings manipulation for Rust
-
unicode_reader
Adaptors which wrap byte-oriented readers and yield the UTF-8 data as Unicode code points or grapheme clusters
-
simdutf
Unicode validation and transcoding at billions of characters per second
-
gimme
Pull useful data out of your clipboard
-
encoding-next
Character encoding support for Rust
-
polyvalid
A validator for package names, namespace, username and app name on wasmer
-
vape
full width aesthetics
-
rsmorphy
Morphological analyzer / inflection engine for Russian and Ukrainian (soon) languages (WIP)
-
soup
Inspired by the python library BeautifulSoup, this is a layer on top of html5ever that adds a different API for querying and manipulating HTML
-
bookrafter
This repository contains code related to bookrafter rendering
-
ripgrep
line-oriented search tool that recursively searches the current directory for a regex pattern while respecting gitignore rules. ripgrep has first class support on Windows, macOS and Linux.
-
text_distance
A collection of approximate string matching algorithms
-
forbidden-bands
8-bit string handling library
-
boss
Baseball Open Source Software
-
svgbobdoc
Renders ASCII diagrams in doc comments as SVG images
-
mask-text
mask text with multiple masking options
-
nfa_regex
NFA regex engine for text processing
-
text-tokenizer
Custom text tokenizer
-
marktask
A CLI tool for parsing and manipulating Markdown tasks
-
filenamify
Convert a string to a valid safe filename
-
vader_sentiment
Bindings for Rust from the original Python VaderSentiment analysis tool
-
mdbook-mathpunc
An mdbook preprocessor that prevents line breaks between inline math blocks and punctuation marks when using katex
-
ftrace
trace files and paths
-
doc-chunks
Clusters of doc comments and dev comments as coherent view
-
notegraf
Core library for building a graph-oriented notebook
-
mdbook-files
Preprocessor for mdbook which renders files from a directory as an interactive widget
-
confusables
around Unicode confusables/homoglyphs
-
case_insensitive_hashmap
A HashMap that uses case-insensitive strings as keys
-
sortuniq
Find or count unique values in an input stream
-
rupantor
A Bengali Phonetic Parser which is very flexible and supports Avro Phonetic
-
monkey-printer
infinite nr of monkeys you could write Shakespeare
-
pink-runtime
A very minimal replecement based DSL, intended for math
-
unicode_converter
CLI tool to convert data between various Unicode encodings
-
stfu8
Sorta Text Format in UTF-8
-
grammateus
facilitate working with Ancient Greek words
-
umlauts
text transformation of german umlauts
-
zalgo-text
A command line tool for generating zalgo text
-
uwl
A management stream for bytes and characters
-
anon-csv-cli
anonymise CSV files, providing various options to substitute real data with plausable fake data
-
intuicio-data
Data module for Intuicio scripting platform
-
mdbook-asciidoc
mdBook backend for AsciiDoc generation
-
neo-mime
Strongly Typed Mimes
-
character-set
High performance
set.contains(char)
-
prototty_render
Traits for defining a renderer and a view
-
guarding
guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
yozuk-helper-english
English NLP utilities for Yozuk
-
armnod
random string generator
-
hyphenation_commons
Proemial code for the
hyphenation
library -
esl01-drawdag
Parse an ASCII DAG into parent relations
-
graphannis-cli
command-line interface to the new backend implementation of the ANNIS linguistic search and visualization system
-
bzzz
Bee-Speak Translator
-
valid_rust_char
A tiny library to check if a char it's valid in a rust file
-
streamparser
Adapt simple stream parsers for different types of streams
-
uwildmat
wildmat/uwildmat matching
-
catdream
Sleeping cat dreams your text
-
cheetah
An extremely fast and flexible static site generator
-
caser
Change text between PascalCase, camelCase, and snake_case
-
smart-date
Parses relative and absolute dates in natural language text
-
kana-converter
converter for half-width/full-width Japanese language characters (katakana, hiragana, and ASCII)
-
raekna
combines the other crates in the workspace to make a functional program
-
genkit
A common generator kit for static site generator
-
unicode-joining-type
Fast lookup of the Unicode Joining Type and Joining Group properties
-
pulldown_mdbook
A pull parser for mdBook
-
mdtranslation
prepare multi-lingual Markdown documents
-
macro_colors
colorful printing macros
-
trigram
Trigram-based string similarity for fuzzy matching
-
strizer
minimal and fast library for text tokenization
-
swot
community-driven or crowdsourced library for verifying that domain names and email addresses are tied to a legitimate university of college
-
gzip-cmp
that makes a distance measurement between binary data based on the difference of the compressed data length
-
zw
encoding and decoding text using zero-width characters
-
ctf-brute
Brute-force utilities for Rust
-
m4b/dryad
An almost-parallel, semi-functioning, dynamic linker experiment, written in Rust
-
strcursor
string cursor type for seeking through a string whilst respecting grapheme cluster and code point boundaries
-
ellipse
Truncate and ellipse strings in a human-friendly way
-
playin-cards
♠ Library for French-suited playing cards
-
iconv-compat-win-sys
iconv bindings for Rust
-
mdbook-open-git-repo
mdbook preprocessor to add a open-on-git-repo link on every page
-
repub-rs
binary for converting mhtml webpages into remarkable-style summarized epubs
-
bidi
Unicode Bidirectional Algorithm (UBA)
-
texting
string helpers
-
parattice
Recursive paraphrase lattice generator
-
STKLR
STKLR: pronounced 'stickler'. Is a cli tool to automatically link functions, enums, structs, traits etc in rust-doc docstrings. I couldn't find a tool like this when I needed it so... here we are.
-
small-bwt
BWT construction in small space
-
hxgm30-noise
Noise-generation for use in hexagram30 projects
-
glcp
Small library to help you set and get text from a global clipboard
-
charisma
Generate ASCII Art using AI 🦾🧠
-
wordfreq
port of wordfreq for looking up the frequencies of words in many languages
-
pinot
Fast, high-fidelity OpenType parser
-
glifnames
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
json-peek
Amature JSON parser library designed for my specific need
-
math-text-transform
Transform greek letters, latin letters, or decimal digits into certain variants from the mathematical alphanumeric symbols Unicode block (U+1D400–U+1D7FF). For example to bold, italic, script or double-struck.
-
nibi
static site generator. (WIP, implementing)
-
codes-iana-charset
This package contains an implementation of the IANA Character Set registry
-
text-diff
text diffing and assertion library
-
yeslogic-ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched with support for additional tables
-
mdbook-rust
Enhanced Rust support for MDBook
-
cabocha
Safe Rust wrapper for cabocha a japanese language dependency structure analyzer library
-
snekdown
A parser for the custom snekdown markdown syntax
-
enc-check
inspect utf-8 and utf-16 character encodings
-
unicode-vo
Unicode vertical orientation detection
-
libgtp
implmenting the gtp protocol
-
indentasy
Indent like a party
-
gbx
GBX (Grundbuch-Exchange) Dateiformat
-
dictcc
Rust API for reading and querying the dict.cc offline translation database
-
mdbook-unlink
A mdBook backend that validates local links
-
fmty
Composable
core::fmt
utilities -
veryfi
Module for communicating with the Veryfi OCR API
-
afrim-memory
handle of sequential codes easier for an input method
-
xpanda
Unix shell-like parameter expansion/variable substitution
-
md-ulb-pwrap
Markdown paragraph wrapper using Unicode Line Breaking Algorithm
-
text_alignment
Align your text in Rust in the CLI
-
esperanto-text
Convert Esperanto text between UTF-8, x-system and h-system transliterations
-
english-language-parser
parser of English sentences created for KMA Rust course
-
tectonic_engine_xetex
The XeTeX engine as a reusable crate
-
str-block
Remove the initial newline and common indentation from string literals
-
rex-regextract
extracts key value pairs out of text
-
ocr_b_checksum
Generates OCR B Checksums
-
destiny_rs
An async library to interact with Bungie's destiny 2 api
-
clparser
A command line tool for parsing CHANGELOG.md files that use the Keep A Changelog format. (fork marcaddeo/clparse)
-
runiq-lib
An efficient way to filter duplicate lines from input, à la uniq
-
mdbook-footnote
mdbook preprocessor for footnotes
-
unindenter
unindent text
-
clippers
Cross-platform clipboard management library
-
tradu
command to traslate using DeepL
-
faster-chars-count
counting length of chars faster than Chars::count()
-
rutie
The tie between Ruby and Rust
-
simdnbt
an unnecessarily fast nbt decoder
-
ucfirst
Uppercase the first letter of a string
-
mle
The markup link extractor (mle) extracts links from markup files (Markdown and HTML)
-
rl-sys
Native bindings to libreadline
-
whitespace_text_steganography
A steganography strategy that uses whitespace to hide text in other text
-
common_substrings
Finding all common strings
-
tpt
Pure Rust implementation of the Unix concatenate (cat), word-count (wc) and echo command
-
cowonverter
owo-ify text files. provides a simple function for this purpose.
-
mdbook-twiki
twiki backend for mdbook
-
rcut-lib
rcut is a Rust replacement for GNU cut that supports UTF-8
-
naromat
Convert text to narou novel format
-
cyrla
two-way conversion between latin and cyrillic script
-
the-book-tui
A TUI application to load, read, and search The Rust Book
-
markdown-composer
composing markdown documents
-
idna-cli
Encode/decode Unicode domain names to/from IDNA ASCII
-
text-scanner
A UTF-8 char-oriented, zero-copy, text and code scanning library
-
ucd-raw
Uninterpreted access to the unicode UCD
-
lingua-german-language-model
The German language model for Lingua, an accurate natural language detection library
-
kirjat-rs
prices for finnish textbooks from multiple stores
-
mini_paste
Fast-to-compile equivalent to
::paste
-
kvu
The simplest command line tool to manage key-value pair lines
-
braille_pics
producing text-art pictures using Braille characters
-
minigrepdemothorndal
A mini grep exercise from rust book
-
libxdiff
Rust bindings for the libxdiff C library
-
enso-lazy-reader
An efficient buffered reader
-
archetype
Lightweight golden testing library
-
pdf-min
Very minimal crate for writing PDFs
-
runestr-pancjkv
rune
-based Pan-CJKV support -
mdbook-obsidian
mdBook preprocessor to render Obsidian specific syntax
-
afrim-translator
Manage the predication system of the afrim input method
-
matchable
Check if text is matching against string or regex in one API
-
goodname
assist you with cool naming of your methods and software
-
koelner-phonetik
koelner_phonetik or cologne phonetics is a phonetic algorithm like soundex, but specialized for german words
-
solstice_glyph
A fast text renderer for solstice, powered by glyph_brush
-
lexa-wildcard-matching
Correspondance de caractères génériques sur les chaînes de caractères
-
uwubot
discord bot for uwuifying text
-
encoding-index-singlebyte
Index tables for various single-byte character encodings
-
looking-glass
reflection & type-erasure library for Rust
-
winrt-globalization
-
tectonic_status_base
Basic types for reporting status messages to a user
-
quickner-core
A fast and simple NER tool
-
repub
convert markdown documents to epub
-
mdbook-infisearch
InfiSearch plugin for Mdbook
-
todo-to-issue
CLI tool that converts forgotten TODO comments into actionable GitHub issues
-
economic_indicator_finder
A finder for extracting economic indicators from paragraphs
-
is-vowel
Heuristically test whether a character is a vowel letter
-
rosie
Interface for the Rosie Pattern Language, for efficient and maintainable text pattern matching and search
-
detone
Decompose Vietnamese tone marks
-
mdcat-http-reqwest
Reqwest-based HTTP resource handler for mdcat
-
porter-stemmer
Flexible and unicode friendly, Porter stemmer implementation
-
rnltk
Natural Language Toolkit for Rust
-
flatlake
In the works
-
istr
a fast string interning library based off of ustr
-
word_filter
A Word Filter for filtering text
-
mdbook-scientific
Enables inline equations for mdbook to set by $..$ signs and $$..$$
-
markdown_to_html_parser
parses Markdown syntax into HTML
-
rust_readability
A package to assess the complexity of texts using a variety of readability formulas
-
humnum
Human numeric sorting program — does what
sort -h
is supposed to do! -
mdbook-multicode
Allows you to give multilanguage code examples, toggled by a spinner
-
console-traits
Basic console traits, such as setting char position and writing strings
-
markdown-it-autolink
A markdown-it plugin for parsing GFM autolinks
-
lf_lint
Linter that checks that all files in <PATH> ends with a newline
-
xhtmlchardet
Character set detection for XML and HTML
-
lindera-unidic-builder
A Japanese morphological dictionary builder for UniDic
-
mail-internals-ng
[mail-api] _internal_ parts for the mail-api crates
-
pomsky-macro
Macro for converting pomsky expressions to regexes
-
hyper-old-types
HTTP types from hyper 0.11.x
-
ctrl-z
A composable reader to treat
0x1A
as an end-of-file marker -
pithy
Ultra-fast, spookily accurate text summarizer that works on any language
-
founder
The package provides a font toolbox
-
str_overlap
Methods for finding the overlap between two string slices
-
csv_to_table
pretty print CSV as a table
-
inslice
A command-line utility for filtering text input by columns and rows
-
mdbook-najan
Preprocessor for the Najan mdBook
-
jg
Jeff Goldblum (jg) is a command-line JSON processor. jg searches for structural patterns in json input and prints each json object that matches the pattern.
-
git-blamediff
A program to automatically annotate changes to a file in git(1)
-
igpay-atinlay
Translate text to Pig Latin
-
flw
Process text via configurable tasks
-
rustrawi
Rust port of the original PHP Sastrawi
-
aki-unbody
output first or last n lines, like a head and tail of linux command
-
changecase
A trait and implementation for changing the case of Strings and &str. It currently supports uppercase, lowercase, alternating case, and inverting case. Title case is in the works.
-
ansi-to-tui
convert ansi color coded text into ratatui::text::Text type from ratatui library
-
maud-pulldown-cmark
An adapter between maud and pulldown-cmark
-
neshan-rs
neshan.org map client in rust
-
mdbook-nix-eval
mdbook preprocessor for evaluating nix expressions
-
text_searcher
text searcher tool
-
wtf8-rs
WTF-8 encoding
-
file-search
File indexing and search
-
utf-cli
A UTF code viewer
-
mdbook-bash-tutorial
A mdbook preprocessor that allows embedding Bash scripts as tutorials
-
ucd
Extends the char type to provide access to most fields of the UCD, Unicode Character Database, as of version 9.0.0. It aims to be compact, fast, and use minimal dependencies (only rust's core crate)…
-
csr
encrypting and decrypting many types using caesar cipher. This shouldn't be used for any serious security but was more of an exercise in generics and creating a rust crate.
-
wildflower
Wildcard matching against strings
-
rustex
auto-generated LaTeX files in Rust
-
markdown-includes
Include other documents, table of content, or rust-doc in Markdown using a simple template system
-
shutil
Shell utility helper library
-
icu_casemapping
Unicode case mapping algorithms
-
mdx
in Rust
-
alass-ffi
FFI bindings for subtitle synchronization with alass
-
polyhorn-ios
Polyhorn for iOS
-
graphql-minify
Minify GraphQL queries
-
fastchr
Faster memchr using SIMD intrinsics
-
diffy-fork-filenames
Fork of https://docs.rs/diffy that allows specifiying filenames
-
hsk
Return HSK Level for Simplified Chinese Characters
-
mdbook-superimport
Import code/text from other files into your mdbook - without the link rot
-
mdbook-quiz-validate
Input validation for quizzes used in mdbook-quiz
-
treelight
A syntax highlighter for the web using tree-sitter
-
skribo
low-level text layout
-
plagiarismbasic_lib
Basic plagiarism checker written in Rust
-
md2gemtext
for converting Markdown into gemtext
-
zp
Copy the contents of the source file or the std output buffer to the clipboard, allowing users to easily paste the contents into another file or program
-
csv-sanity
Sanitize and transform large CSVs with millions of records quickly and efficiently
-
make_it_braille
A CLI tool and library to manipulate unicode braille images
-
rst_parser
a reStructuredText parser
-
libwizard
creating a simple http web server or a (somewhat static) API
-
md_converter
My markdown to html converter
-
wcounter
Give the word and count the appearance
-
uchr
translate char values to the characters themselves, and vice versa
-
textblocks
working with text blocks
-
genex
Text-expansion library
-
whitespace-conf
Key-value configuration file delimited with whitespaces
-
mdbook-fix-cjk-spacing
mdbook preprocess that fixes CJK line breaks
-
univstring
The Universal String(Rust String, CString, Wide(UTF-16/UTF-32) CString) trait
-
textedit-merge
sorting and merging textedits
-
pillar
small tool to format lines into columns
-
csmlinterpreter
The CSML (Conversational Standard Meta Language) is a Domain-Specific Language developed for creating conversational experiences easily
-
unic-segment
UNIC — Unicode Text Segmentation Algorithms
-
whitespace
Encode arbitrary data whitespaces and vice versa
-
genere
randomization of text respecting grammatical gender of sentences
-
untex
Understand and manipulate TeX files with ease
-
transcript
A transcriber for European scripts
-
strmatch
Conditionally match strings in Rust using regex without much boilerplate
-
minigrep5
grep implementation in Rust
-
crawdad
ChaRActer-Wise Double-Array Dictionary
-
kbremap
Custom keyboard layouts for windows
-
compact_str
A memory efficient string type that transparently stores strings on the stack, when possible
-
utfx
-
learning_fromdocs_minigrep
Project made on the path toward learning Rust from its docs. Completely personal and to test how to upload to crates.io.
-
regex-automata
Automata construction and matching using regular expressions
-
minigrep-carlossgv
grep implementation in Rust
-
rammer
A Baysian Text Classifier for Spam/Ham
-
cw
Count Words, a fast wc clone
-
password-characters
help with the "enter the 12th, 35th, and 63rd characters from your password" situations
-
ropey
A fast and robust text rope for Rust
-
qm-role-build
Role builder from markdown tables
-
cowsay
program that generates ASCII art pictures of a cow with a message
-
unicopy
program to open a textbox that you can type one unicode on and it will be copied to your clipboard
-
shoebill
A Wadler/Leijen style pretty-printer
-
aki-stats
output the statistics of text, like a wc of linux command
-
cur
that will hunt for your regular expression
-
smartcat
Putting a brain behind
cat
. CLI interface to bring language models in the Unix ecosystem 🐈⬛ -
verba
working with Latin words
-
lindera-ipadic-builder
A Japanese morphological dictionary builder for IPADIC
-
mdbook-skill-tree
mdbook plugin to show roadmaps
-
tectonic_bridge_flate
Exposing flate compression to the Tectonic C code
-
roxy_markdown_parser
Roxy plugin for parsing Markdown
-
argot
Parse documentation from codebases into Markdown for easy doc creation
-
re_space_view_text_box
Space View that shows a single text box
-
mdbook-image-size
A mdbook preprocessor which support image size syntax
-
text-sanitizer
convert text to plain ASCII text
-
dumbfuzz
dumb library for fuzzy search
-
cautious-octo-funicular
Test: shipping an mdbook with API docs
-
bionic-ebooks
Takes an EPUB file and generate a copy with bionic like font applied
-
subscript-compiler
A modern LaTeX rendition
-
fsays
flavored replacement for the classic cowsay
-
to-pdf
Export & Preview as PDF
-
string-overlap
A helper crate for "layering" ASCII art
-
bebop-lang
LISP based Markdown preprocessor for creating html documents!
-
modeling
tools to analysis different languages by Ctags
-
insim_core
Core types and traits for the insim crate
-
unidok
A powerful, readable, easy-to-learn markup language
-
mqtt-analyzer
Help analyzing MQTT messages
-
rreplace
designed to streamline string replacements. It can handle multiple unique replacements and iterates the string only once.
-
strip_markdown
remove markdown syntax from markdown files
-
sola
A 16-bit stack machine and forth language
-
linkcheck
extracting and validating links
-
mdplayscript
An extension of Markdown for play scripts
-
ansi-cut
cutting a string while preserving colors
-
precis-tools
Tools and parsers to generate PRECIS tables from the Unicode Character Database (UCD)
-
rspanphon
rough Rust port of the Python PanPhon library, extracts articulatory features from IPA strings and implements operations on them
-
encoding_rs_transcode
Transcode text within writers using encoding_rs
-
pretok
A string pre-tokenizer for C-like syntaxes
-
truncate_string_at_whitespace
Truncate a &str at the closest whitespace to a specified length with unicode safety
-
text_styled
Trait to style String and &str
-
glyph-names
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
ra_rustc_lexer
lexer used by rustc, zero stability gurantees
-
dprint-cli-core
Core dprint CLI library for reuse
-
fmt-extra
Extra formaters not shipped in the standard library
-
strange
A static website generator
-
annoyify
TeXt To LoOk LiKe ThIs
-
charx
A replacement for char::is_ascii*
-
whisperer
将文本编码为简短的中文字符, 防和谐
-
folia
High-performance library for handling the FoLiA XML format (Format for Linguistic Annotation)
-
cli_utils_hoijui
A tiny CLI utilities library, providing functions and constants useful in many CLI tools
-
uiuifree-normalize
uiuifree text normalize
-
llmvm-core-lib
llmvm core application
-
hl_core
Syntax highlighting in Rust
-
string-box
Create Rust string from UTF-8 string, byte string or wide string
-
codes-iso-15924
This package contains an implementation of the ISO 15924 Codes for the representation of names of scripts specification
-
caribon
A repetition detector program and library
-
decline-word
Choose word form based on given number
-
encoded-words
Encoded Words for usage in MIME headers
-
pulldown_typst
A pull parser for Typst markup
-
mdbook-compress
Compress an mdBook project into a single PDF file
-
chinese2digits
The Best Tool of Chinese Number to Digits. A useful tool in NLP and robot project.
-
noodler
A port of the python-ngram project that provides fuzzy search using N-gram
-
unicode-intervals
Search for Unicode code points intervals by including/excluding categories, ranges, and custom characters sets
-
twitter_text_parser
Parser for twitter-text in Rust
-
clippy_lints
A bunch of helpful lints to avoid common pitfalls in Rust
-
e_book_sync_library
Synchonize e-book with your local e-library
-
scoundrel
An engine for working with ASCII roguelikes
-
token-read
reading whitespace delimited files intended for competitive programming
-
swrite
Infallible alternatives to write! and writeln! for Strings
-
text-utils
Text utils for unescaping and align
-
trim_lines
An extremely simple and tiny library which provides an iterator over the lines of a string, trimmed of whitespace. It is a simple wrapper around the Lines iterator in std::str which trims the whitespace from each line.
-
sastrawi
stemming and stopword removal for Bahasa Indonesia based on PHP sastrawi project by Andy Librian
-
arbitrator
Format text based on a set of rules and regexes
-
zummi
fun lib that produces spoonerisms
-
framework
detector for different frameworks in one projects
-
gfm-autolinks
Parse GitHub Flavored Markdown autolinks
-
charname
Incredibly simple library that just gives you the Unicode name for a character
-
timfmt
A small utility for formatting code as Tim likes it
-
llmvm-outsource-lib
outsource backend for llmvm
-
synctex_sys
Rust Bindings to libsynctex's C API
-
yeslogic-unicode-script
Fast lookup of the Unicode Script property
-
lsp-document
Helpers to convert between LSP documents and Rust strings
-
libflagup
Display a country's flag as an emoji
-
write16
A UTF-16 analog of the Write trait
-
crypto-invert
Unicode Upside-Down Mapping
-
aki-txpr-macro
the more easy to use libaki-*
-
spandex-hyphenation
Knuth-Liang hyphenation for a variety of languages
-
pathmut
Command line utility for extracting, removing, and replacing components from path strings
-
resrap
A parser library
-
spdkit-surface
short text for crates.io
-
qpdf-sys
Rust bindings to QPDF C++ library via FFI and bindgen
-
haoxue-dict
Chinese dictionary and word segmenter
-
rmw-utf8
Short text compression algorithm for utf-8 (optimized for Chinese , developed based on rust programming language). 面向utf-8的短文本压缩算法(为中文压缩优化,基于rust编程语言开发)。
-
tabwriter
Elastic tabstops
-
crustword
Crusty Crosswords
-
trim
trim whitespaces from files
-
corpus-count
Util to count words and character ngrams in a corpus
-
lindera-analyzer
A morphological analysis library
-
text_unit
Newtypes for text offsets
-
mime_4
Strongly Typed Mimes
-
text-to-json
Convert text to json in rust
-
ascii-hangman-webapp
customizable Hangman game with ASCII-art rewarding for children (webapp version)
-
scripter
A screenplay compiler
-
ogrep
searching in indentation-structured texts
-
ferris-finder
Unicode character finder with fuzzy search
-
yarner-block-links
A Yarner plugin that adds to each code block a list of links to all referenced and all referencing blocks
-
marko
Programmtically format text with Markdown syntax
-
moscato
Outline scaler for OpenType glyphs
-
h_hangul
Korean Characters
-
simple_csv
CSV parsing implementation
-
jpreprocess-njd
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
homoglyph-core
Domain layer for the homoglyph cli and service. Compute homoglyphs for an input sentence.
-
chars_data
Build-dependency for chars, the unicode character information CLI
-
hina
:]
-
fgr-rs
A Find & Grep command line utility with SQL-like syntax
-
vndb_rs
Bare-bones VNDB api
-
mdbookshelf
Create epubs from a list of mdbook repositories
-
kincaid
A word statistics library in Rust
-
mdtransform
program that renders Markdown as HTML for websites
-
analisar
A Lua parser for Rust
-
wordfreq-model
Model loaders for wordfreq-rs
-
mocker
transforming text with uppercase and lowercase letters, resembling the Spongebob Mock Meme
-
morse-nostd
A nostd version of the morse crate
-
quill_delta_pdf
Convert Quill Delta to PDF
-
mdrend-this-is-only-a-beta-demo
Renders markdown to HTML
-
cli-animate
Enrich your Rust CLIs with stunning, easy-to-implement animations
-
markdown-table
Creating markdown tables with Rust!
-
pdfutil
PDF document manipulation
-
s_minigrep
project from rust book
-
collapse
Trim and collapse consecutive whitespace to a single space
-
mdbook-hide
A preprocessor for mdbook that adds support for hidden chapters
-
tectonic_bridge_harfbuzz
Expose the Harfbuzz C/C++ APIs to Rust/Cargo
-
arabic-script
An expressive API for the characters of the Arabic script
-
rew
A text processing CLI tool that rewrites FS paths according to a pattern
-
hunspell-sys
Bindings to the hunspell C API
-
boxy
Declarative builder for Unicode box-drawing characters
-
darts
A double array trie, A Forward Maximum Matching Searcher
-
old_icelandic_zoega
Old Icelandic dictionary for Rust. From "A Concise Dictionary of Old Icelandic" by Geir Zoëga
-
sixbit
Small packed strings
-
struckdown
A structured markdown / commonmark library for Rust
-
twitter_text_conformance
twitter-text in Rust
-
rsonpath-test-codegen
Blazing fast JSONPath query engine powered by SIMD. TOML-based test codegen for rsonpath-lib.
-
strings
String utilities, including an unbalanced Rope
-
irssi-sys
Automatically generated bindings to irssi
-
milligrep
Custom simplified implementation of grep
-
unic-ucd-age
UNIC — Unicode Character Database — Age
-
markdown-linkify
Markdown preprocessor for substiting link shorthands to valid links according to configurable regexes and custom substitution implementations
-
glob-match
An extremely fast glob matcher
-
p4d-mdproof
Markdown to PDF converter
-
outerspace
Methods for prefixing and suffixing the non-whitespace characters in a string
-
zawk
an efficient Awk-like language with stdlib
-
lindera-cc-cedict-builder
A Chinese morphological dictionary builder for CC-CEDICT
-
darn
Extras for the Book on Data Analysis with Rust Notebooks
-
ezemoji
Catigoryized Emoji's
-
tre-regex-sys
Rust bindgen bindings to the TRE regex module
-
saurus
Markdown to LaTeX Transpiler
-
morsels_lang_ascii
Basic ascii tokenizer for morsels
-
hashmoji
hash functions that returns emojis
-
is_utf8
functions to determine if a sequence of bytes is valid utf-8
-
mdbook-translation
prepare multi-lingual mdBook books
-
finl-charsub
Character substitution module for in-line replacement of fixed strings
-
tfidf-summarizer
Basic tf-idf compute for documents
-
pygmentize
wrapper for syntax highlighting
-
chinese-ner
A CRF based Chinese Named-entity Recognition Library written in Rust
-
aprilasr-sys
Low-level FFI bindings for the april-asr C api (libaprilasr)
-
gen3-charset
Pokemon Generation 3 Character Set Support (GBA)
-
kryptor
command-line cryptography tool
-
mdbook-embed
A preprocessor that simplifies embedded URL
-
ewin-com
editor for Window(GUI) users.No need to remember commands
-
readwise
wrapper for the Readwise API
-
yozuk-model
NLP model generator for Yozuk
-
comment-strip
Remove comments out of text files
-
casaubon
A feature-poor static site generator
-
ngrams
Generate n-grams from sequences
-
rodarte-minigrep
CLI for simple plain-text searching
-
mdbook-wordcount
Word count for mdbook, inspired by the mdbook tutorial
-
mediawiki_parser
A strict parser for MediaWiki markdown
-
diff_match_patch
Diff Match Patch is a high-performance library in Rust-lang that manipulates plain text
-
case-macro
Macros for changing the casing of things
-
veloci_levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
indoc
Indented document literals
-
jpreprocess-jpcommon
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
casile
The command line interface to the CaSILE toolkit, a book publishing workflow employing SILE and other wizardry
-
simple-localization
localization library for text translation
-
ellie_tokenizer
Tokenizer for ellie language
-
lingua-latvian-language-model
The Latvian language model for Lingua, an accurate natural language detection library
-
translation-api-cn
Some useful structs for calling Chinese translation api cloud services. A helper tool for
bilingual
cmdline tool. -
allsorts_no_std
Font parser, shaping engine, and subsetter for OpenType, WOFF, and WOFF2
-
mdbook-playscript
Preprocessor for mdBook, which styles stage play scripts
-
difference
text diffing and assertion library
-
spellcheck_toy
a basic spellchecking library based on edit distance
-
emoji-printer
Replace emoji shortcodes in string with emoji unicode (":sushi:" -> 🍣)
-
mdbook-latex
An mdbook backend for generating LaTeX and PDF documents
-
intname
Full English name for any integer of any primitive integer type
-
encoding-next-index-tradchinese
Index tables for traditional Chinese character encodings
-
slideck
Create an HTML slide deck from Markdown
-
fwuffgrep
Basic implementation of a grep command written in rust
-
fontconfig-rs
Safe, higher-level wrapper around the fontconfig library
-
mdbook-numeq
An mdbook preprocessor for automatically numbering centered equations
-
kakasi
Romanize hiragana, katakana and kanji (Japanese text)
-
gen-epub-book
Generate an ePub book from a simple plaintext descriptor
-
termwrap
Wrap Unicode text with ANSI color codes
-
basic-text-internals
Basic Text string literal implementation details
-
catmark
Console printer for CommonMark
-
tzgrep
grep tar.gz
-
textalyzer
Analyze key metrics like number of words, readability, and complexity of any kind of text
-
gqlog
👾 filter your json logs with graphql 👾
-
every-range
Iterator for interleaving included and excluded ranges
-
shallow
long text
-
minigrep-danidim13
A test crate based on Rust book examples
-
assert-text
the testing macro tools
-
ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched
-
sudachiclone
sudachiclone-rs is a Rust version of Sudachi, a Japanese morphological analyzer
-
const-utf16
Utf8 to utf16 conversion functions for use in const contexts
-
hexhex_impl
use hexhex instead of this
-
compute-file-server
used to serve files from Fastly Object Store
-
libpcre-sys
Native bindings to libpcre
-
pdf-create
low-level, strongly-typed PDF creation library
-
moenarchbook
Creates a book from markdown files
-
mdlint-cli
tbd
-
ed_join
Implemtation of Ed-Join Algorithm for string similarity join
-
node-emoji
Convert
:emoji:
to Unicode using GitHub’s and EmojiDB’s emoji names -
fancy_mocker
transforming text into alternating uppercase and lowercase letters, resembling the Spongebob Mock Meme
-
varcon-core
Varcon-relevant data structures
-
literate
programming tool that extracts code written in your Markdown files
-
pencil_case
but effective library for writing GUI in bevy
-
top-english-words
retrieve top words from the English language
-
romulus
a stream editor like sed
-
rustscii
ASCII game framework
-
furigana
Map furigana to a word given its reading
-
markdown-rs
mdast-compliant Markdown parser in Rust with ASTs and extensions
-
omgwtf8
Optimized-Matching-Generalized Wobbly Transformation Format — 8-bit
-
lines
Utililities for iterating readers efficiently line-by-line
-
utf
UTF-8
-
yozuk-sdk
Types used in the Yozuk ecosystem
-
spacebar
An anti-plagiarism tool based on null width characters
-
topfew
CLI to find high frequency occurrences in structured text files
-
txtframe
Creates a frame for text
-
portmanteau
create portmanteaux
-
code-span
Add additional infomation to code character
-
mmgrep
search in files
-
case_convert
Converts the first letter of a Rust String to uppercase
-
mul
Bengali stemmer
-
jmdict
The free Japanese dictionary database JMdict, pre-digested for use in Rust libs and apps
-
rust_lemmatizer
A lemmatizing package for use with a .csv dictionary of lemmas and their corresponding words
-
caseless
Unicode caseless matching
-
sauron-markdown
parsing markdown into sauron node
-
grep-rezza
An app for search text in file
-
fifthtry-mdbook
fork of mdbook, only for ft-cli
-
ron_to_table
pretty print RON as a table
-
lyt
A static site generator written in Rust
-
ignis
editing various 3DSFE formats
-
wkhtmltopdf
High-level bindings to wkhtmltopdf
-
yozuk-core-skillset
Set of default Yozuk skills
-
leven
Make your own blog!
-
blockcounter
Counts the blocks in a stream
-
rigrep
grep from Rust Book
-
justify
plaintext while handling Unicode gracefully
-
bottomify
Fantastic (maybe) CLI for translating between bottom and human-readable text
-
autoruby
Easily generate furigana for various document formats
-
html_to_epub
A command line converts .html file to .epub file
-
esl01-renderdag
Render a graph into ASCII or Unicode text
-
latin1str
Windows-1252 string types
-
summertail
Summarize your tail -f output, by reporting how many lines match regexes
-
informa
font compiler for digital typography
-
ammonia
HTML Sanitization
-
file-size
a function formatting file sizes in 4 chars
-
gestalt_ratio
Calculate the gestalt pattern matching ratio between two strings
-
hashtag-regex
regex matching hashtags accoding to the unicode spec: http://unicode.org/reports/tr31/#hashtag_identifiers
-
owoify
Text owoification library
-
recode_rs
Command-line tool for converting between the character encodings defined in the Encoding Standard
-
pcre2
High level wrapper library for PCRE2
-
ptero-cli
A text steganography CLI tool for Social Media
-
ghimlink
Generates a Github-Flavoured-Markdown formatted image link to an image in the same repository
-
image_ascii
transform images into ASCII art. Both into text and images consisting of letters.
-
lithe
A Slim template engine by using Pest
-
character-stream
Helper data structures for reading UTF-8 characters from a stream
-
dhoni
converting Bengali text into their phonetic counterpart
-
pdf_form
programatically filling out pdf forms
-
am-parse
converting asciimath to other formats
-
sparklet
small flashcards library
-
seq2xypic
Turn a text sequence diagram into a LaTeX xypic diagram
-
stringutils
A collection of various and (hopefully) useful String utility functions
-
cologne_phonetics
generate phonetic cologne codes for utf8 strings
-
lindera-ipadic-neologd-builder
A Japanese morphological dictionary builder for IPADIC NEologd
-
diacritics
Remove diacritics from letters, for example when standardizing input for a search
-
rustextile
Textile markup language parser for Rust
-
halfcaps
tRaNslAtE aNy TeXt To ThIs
-
eliza
natural language processing program developed by Joseph Weizenbaum in 1966
-
single_source
Generate code files from snippets in md tutorial files
-
termbook
behind the
termbook-cli
-
color-convert
Support RGB,RGBA,HEX,HSL,HSLA,HSV,CMYK to convert each other, write by rust
-
ansi-to-tui-forked
convert ansi color coded text into ratatui::text::Text type from ratatui library
-
byte_string
Wrapper types for outputting byte strings (b"Hello") using the Debug ({:?}) format
-
csvre
replacing data in CSV columns with regular expressions
-
txt_processor
A little library for text processing
-
latex_snippet
Convert even erroneous LaTeX snippets into HTML
-
demoji
Remove all emojis from a string
-
markdown-table-formatter
Markdown table formatter fully compliant with Unicode 15.1.0
-
bitfont
Takes an ASCII string and generates a vector containing a bitmap font, for easy overlay into images
-
tb_normalization
normalization utf8 string, loc dau vietnamese and some language
-
cistring
A string type that preserve case, but compares insensitiveley
-
hunspell-rs
Rust bindings to the Hunspell library
-
scie
research about how to build simple code identify engine for different languages
-
mdbook-snips
Markers for hidden lines in rust blocks within an mdbook
-
ultron
A web base text-editor with strong focus on displaying multi-width characters correctly
-
askama-filters
Extra template filters for Askama
-
strainer
Command-line tool for finding duplicate lines in text files
-
anagram
A collection of anagram utility functions
-
ttf_word_wrap
Wraps text based on character width
-
find_unicode
Find Unicode characters, the easy way!
-
pdf-merger-rs
app for merge PDF files
-
encoding-index-korean
Index tables for Korean character encodings
-
utf8_slice
Lightweight UTF8 Slice Utilities
-
chinese_segmenter
Tokenize Chinese sentences using a dictionary-driven largest first matching approach
-
encoding-index-tradchinese
Index tables for traditional Chinese character encodings
-
case-conv
Faster case conversion crate
-
yarner-lib
build plugins for the Literate Programming tool Yarner
-
dialogi
A dialog parser
-
mdtohtml
markdown to html renderer (with a couple of missing features)
-
minigrep_macano953
A fun grep tool written as my first incursion into Rust. Not intended to be inherited and used in production code
-
const_format_proc_macros
detail of the
const_format
crate -
ugrip
A i/o command that searches for word in a file
-
toml_document
Document-style TOML parser. Allows editing TOML document in a way which preserves whitespace and comments.
-
lexicmp
comparing and sorting strings lexicographically and naturally
-
goya
morphological analyzer for Rust and WebAssembly
-
syllable
counter for use with reading level calculations
-
pattern-3
Needle API (née Pattern API 3.0), generalization of
std::str::pattern
-
mdbook-wikilink
Support for wikilinks on mdBook
-
mdbook-checklist
An mdBook preprocessor for generating checklists and indexes
-
quoted-string-parser
Quoted string parser for grammar defined in RFC3261
-
demoji-rs
demoji implementation for Rust
-
aklat
create books from markdown files (like Gitbook)
-
doccy
brace based markup language
-
conllu-utils
working with the CoNLL-U dependency format
-
kindleclip
CLI tool to export a file per book in kindle 'My Clippings.txt'
-
tweak
when/then clauses to run
-
names-changer
Convert a names of sql schemes from camelcase to snake case
-
uniwhat
Display the unicode characters text
-
snakecase
general purpose snakecase implementation supporting both ascii and unicode
-
kaomoji-ru
The full http://kaomoji.ru database, encoded and escaped as byte literals
-
wordbreaker
A Unicode-aware no_std crate (requires alloc) that rapidly finds all sequences of dictionary words that concatenate to a given string
-
bible-reference-rs
Extract Bible references from plain text
-
spongedown
Converts markdown to html with svgbob support
-
pikchr-cli
PIC-like diagramming language to SVG converter
-
minigrep_v1
A lib from rust book, search word in contents
-
hemtt-tokens
A token library for hemtt
-
briefcase
Convert to any case!
-
mdbook-tectonic
An mdbook backend for generating LaTeX and PDF documents
-
mdbook-preprocessor-utils
writing mdBook preprocessors
-
m_lexer
extensible regular expressions based lexer
-
wordcut-engine
Word segmentation/breaking library
-
pangu
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
filecount
A modern high-performance open source file analysis library for automating localization tasks
-
kma-rustlang-vadym-polishchuk-english-parser
parser of English sentences created for KMA Rust course
-
terminal-supports-emoji
Check whether the current terminal supports emoji
-
asciimath-text-renderer
Render asciimath in terminal
-
zhconv-cli
Convert Traditional/Simplified Chinese and regional words of Taiwan/Hong Kong/mainland China/Singapore based on Wikipedia and OpenCC rulesets 轉換中文簡體、繁體及兩岸、新馬地區詞,基於維基百科和…
-
moenster
mønster (n) - pattern. simple glob-style pattern matching for strings
-
textcat
detect text categories. It can be used to detect the language of a given text
-
tectonic_xetex_layout
XeTeX's font loading and layout interface encapsulation, as a crate
-
rust-cedar
efficiently-updatable double-array trie in Rust (ported from cedar)
-
lean-sys
Bindings to Lean 4's C API
-
x1b
State tracker for 0x1b terminal escape codes. Also gives random characters names.
-
html2runes
An HTML to Text converter
-
bibutils-sys
Rust bindings for bibutils, a program for bibliography format interconversion
-
thfst-tools
Support tools for DivvunSpell - convert ZHFST files to BHFST
-
platina
parameterized golden testing library
-
lingua-french-language-model
The French language model for Lingua, an accurate natural language detection library
-
export-kindle
Export Kindle book list to JSON or YAML
-
jellybean-pack-1
Sweet syntax highlighting with tree-sitter
-
tectonic_bridge_icu
Expose a subset of the ICU Unicode APIs to Rust/Cargo
-
mdbook-all-the-markdowns
Render all markdown files in a given folder structure
-
dynfractal
A program to render any equation (in terms of z and c) into an ASCII fractal
-
unic-bidi
UNIC — Unicode Bidirectional Algorithm
-
txtpic
Generate Unicode art from images
-
nipah_tokenizer
A powerful yet simple text tokenizer for your everyday needs!
-
web-grep
A Grep Tool for HTML or XML
-
bullshit-rs
A BullshitGenerator implementation in Rust
-
local-encoding
encoding/decoding string with local charset. It usefull for work with ANSI strings on Windows.
-
bocu1
BOCU-1 compressed unicode encoding
-
pdf_forms
programatically filling out pdf forms
-
encoding-next-index-korean
Index tables for Korean character encodings
-
retest
Command-line regular expression tester
-
guarding_parser
Guarding is a guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
trevordmiller
Personal CLI
-
scrambler
command line tool to scramble letters
-
typed-dialogflow
typed Google Dialogflow client
-
corg
A cog-like tool
-
adobe-cmap-parser
parse Adobe CMap files
-
rckive-genpdf
User-friendly PDF generator written in pure Rust
-
regex-lite
A lightweight regex engine that optimizes for binary size and compilation time
-
xim-ctext
compound text en/decoder
-
mojimoji-rs
fast converter between Japanese hankaku and zenkaku characters, mojimoji
-
terminal-emoji
safely displaying emoji inside of terminals
-
fast2s
A fast Traditional Chinese to Simplified Chinese conversion library. Built with FST, faster than most of other libraries.
-
lexer-generator
Lexer derived from Regex patterns with user customizeable tokens
-
bookkeeping
Minimal, safe and flexible in-memory bookkeeping API
-
mdbook-yml-header
mdBook preprocessor for removing yml header
-
daveparr_minigrep
command line tool to search for a string in a file based on 'The Rust Programming Language' book
-
mdbook-newday
Command line utility to add a new line for today to the top of an mdbook SUMMARY.md book
-
nb2fr
Convert integer number to literal French text
-
hashmoji_generator
Code generation tool for hashmoji
-
highlights
Convert kindle highlights into different formats
-
github-slugger
A slugger for GitHub headings
-
stardict_lib
Rust implement stardict
-
route-pattern
A parser and matcher for route patterns
-
suffix
arrays
-
wtf8
WTF-8 encoding. https://simonsapin.github.io/wtf-8/
-
rexpaint
functionality for reading and writing .xp files of the Grid Sage Games REXPaint ASCII art editor
-
geml
Generator-orientated ML parser
-
datex
package to automatically process your data and make it look pretty in LaTeX
-
prettify-markdown
Format Markdown at the speed of Rust
-
math_test
Generates math questions for elementary school kids
-
sesdiff
Generates a shortest edit script (Myers' diff algorithm) to indicate how to get from the strings in column A to the strings in column B. Also provides the edit distance (levenshtein).
-
rslnp
A configurable parser for scopes list notation (SLN)
-
basic-text
Basic Text strings and I/O streams
-
dictionary-1024
a mnemonic dictionary with 1024 words, 10 bits per word
-
typos-dict
Source Code Spelling Correction
-
noneifempty
Convert T to None if empty
-
jp-location-relation
隣接する市区町村の一覧を取得
-
font-index
Static index for system fonts
-
reap-grep
wgrep/write-grep CLI
-
ascii_tree
generates ascii trees
-
aki-json-pick
The json pick out command
-
react-html
react html components for frender
-
simple-text-pattern
Matching simple text patterns
-
bos_books_codes
that handles 3-character Bible Books Codes
-
ucd-util
A small utility library for working with the Unicode character database
-
emojito
Find all the Emoji in a string. Supports composed emoji.
-
vaporetto_rules
Rule-base filters for Vaporetto
-
vextractor
vextractor-cli
-
latex-to-html
Latex to html converter
-
unicode_categories_zheyuye
Query Unicode category membership for chars
-
mojibake
Encode/Decode bytes as emoji base2048
-
hline
a grep-like tool that highlights lines in files
-
rsplitter
Splitter is a tool to split words from text that contains no spaces
-
lyn
A scanner useful for creating recursive descent parsers
-
case_style
Converts from and to different case styles
-
trexter
Text progression tracking library
-
anagrambot
find anagrams of words
-
interpol
String interpolation macros
-
basen
Convert binary data to ASCII with a variety of supported bases
-
readable-readability
Really fast readability
-
hex_d_hex
HexDHex is a Rust Crate that encodes and decodes byte data to and from its hexidecimal representation. For instance, one may wish, on ocasion that is, to translate a utf8 or ASCII string…
-
latexify
Shared definition for turn a rust object into latex code
-
fingers
a finger client library
-
rosie-sys
build or link to librosie to access the Rosie Pattern Language
-
japhonex
Japanese phone number checker for Rust
-
naming_clt
Extract and convert the naming format(case|notation) of identifiers from files or stdin. Use this tool to prepare identifier name strings for further operations (matching,replacing...) on relative files
-
tblcompare
A tabular data comparison utility
-
pseudolocalize
Pseudolocalization crate for Rust
-
lindera-compress
A morphological analysis library
-
kanabake
Turn Romaji strings into Hiragana or Katakana
-
re2
Wrapper for the re2 C++ regex library
-
hoedown
bindings for the Hoedown markdown processor
-
seven_seg
Seven-segment digital display for terminal
-
tablers
create tables and export them into CSV or PDF. Without external deps
-
tex
The νTeX typesetting engine
-
indented
Format data with indentation
-
toc
generate toc from markdown or html
-
kytea-tokenizer
Wrapper of tokenization by KyTea
-
fnew
A Unicode-aware line-oriented drop-in replacement for coreutils' fold
-
shingles
in rust
-
scenic
Presentation tool that utilizes LaTex Beamer package
-
twemoji-rs
A word-cloud image generation crate
-
company_hr
My approach to the third problem from chapter 8.3 (Common Collections) of 'The Book'
-
songww-harfbuzz-rs
Rust bindings to the HarfBuzz text shaping engine
-
wordninja
port of the Word Ninja English word splitting library
-
bgrep
grep tailored to handle binary patterns and files
-
atm_parser_helper_common_syntax
Parsing utilities for the common syntax specification
-
mercurius
Extended pulldown-cmark
-
fancy_print
Easily print animated ASCII text to the terminal!
-
indentation_flattener
From indented input, generate plain output with indentation PUSH and POP codes
-
encoding_c_mem
C API for encoding_rs::mem
-
mystem
Wrapper around Yandex Mystem for Rust
-
munemo-rs
Turn an integer into a more rememberable word, or vice-versa
-
mdtranslation-cli
Command-line tools for using mdTranslation, which can be used to prepare multi-lingual Markdown documents
-
encoding8
various 8-bit encodings
-
psa
PSA(Project structure analysis) is a analyzer for analysis project struct
-
spyglass
Search engine for documents, inspired by bioinformatics
-
unidades
playing with physical units
-
grepy
search files
-
unicode-utf8
that converts utf-8 bytes to a unicode scalar value, and vice versa
-
mepple
English dictionary as a library
-
iterlower
Final-sigma-correct lowercasing iterator adapter with option for Turkish/Azeri I behavior
-
utf8reader
wrapper around Reader that returns a stream of UTF-8 characters
-
xsv
A high performance CSV command line toolkit
-
hex_table
Format a byte slice into a hex table
-
codam-mdbook
Creates a book from markdown files
-
mdbook-numthm
An mdbook preprocessor for automatically numbering theorems, lemmas, etc
-
cp437_constants
Defined constants covering the entirety of ASCII code page 437 (https://en.wikipedia.org/wiki/Code_page_437)
-
ced
Dead easy csv editor
-
stylometry
A package to perform stylometry operations
-
rut
A small UTF-8 parsing library for applications that need to parse individual
char
s -
homoglyph-service
homoglyph-cli
-
jpreprocess-dictionary
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
forgiving-htmlescape
HTML entity encoding and decoding, with support for leaving malformed entities intact
-
lingua-spanish-language-model
The Spanish language model for Lingua, an accurate natural language detection library
-
markx
markdown parser
-
budgetinvoice
Very easy to use pdf invoice generator
-
poetry-book
Create a poetry book in latex, starting from plain text
-
conveyance
A stop-gap CLI for conveyancing
-
text_art_cn
A text art generator can include any kind of characters depending on the font. The art is composed of Braille characters like ⠐ ⠝ ⠿ ⣫ ⣷
-
kase
Convert from one case to another. Input case is determined on a best-guess basis, but can be overridden with the --from flag.
-
aqua_parse
A parser library
-
jpreprocess-naist-jdic
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
deface
Lightweight markup to HTML converter
-
kradical_static
Ready-to-use EDRDG radical decompositions
-
vl-convert-pdf
convert SVG to PDF with embedded text
-
notedown-rs
Notedown Text parser and formatter
-
table_to_html
interface to convert a
tabled::Table
into a HTML table (<table>
) -
hashlogs
Command-line utility that hashes the part before a space on each line from stdin with blake2b keyed with an ephemeral randomly-generated key and writes to stdout
-
mdbook-chapter-number
A mdBook preprocessor that adds chapter numbers to the each page header
-
goya-ipadic
IPA dictionary for Goya
-
mdbook-to-github-wiki
Turns an mdbook book into a github wiki
-
newline-converter
Newline byte converter library
-
betaconvert
CLI for Betacode conversion
-
glyphana
Quickly find, inspect & collect unicode glyps
-
kth-lines
Command line tool for filtering stdin lines that just work
-
char_reader
Safely read wild streams as chars or lines
-
charclass
define and modify unicode character classes
-
wz-conf
Configuration options for wz
-
pest_ascii_tree
Helper crates converting the parsing result of any pest grammar into an ascii tree
-
indexrs
inefficient multi-language search index
-
mdbook-trace
A traceable document preprocessor for mdbook
-
jellybean-pack-2
Sweet syntax highlighting with tree-sitter
-
overlap
shows overlap text in files
-
rusty_code_code_for_book
my book_rusty code
-
mdbook-inline-highlighting
mdBook preprocessor that enables support for inline highlighting
-
fum
fum finds fuzzy matches to a literal search pattern, searching recursively through all the files in the current directory and respecting gitignore rules
-
strng
Owned UTF-8 string
-
luaparse
A Lua 5.3 parser
-
unicode-line-stacker
Tiny library for stacking Unicode line-drawing characters on top of each other
-
easyinvoice
Very easy to use pdf invoice generator
-
mdbook-readme-summary
A mdbook Preprocessor that generates table for content for each subdirectory README files
-
stopwords
from popular text processing frameworks
-
kaomji-ru
The full http://kaomoji.ru database, encoded and escaped as byte literals
-
nib
static site generator
-
rdftk_iri
IRI and URI specifications
-
ra_ap_ide_ssr
Structural search and replace of Rust code
-
unic-idna-mapping
UNIC — IDNA — IDNA Mapping Table
-
rep-cli
Replace text file in bulk
-
corpus-preproc
A preprocessor for text and HTML corpora
-
encoding
Character encoding support for Rust
-
vaporetto_tantivy
Vaporetto Tokenizer for Tantivy
-
slicer
that slices string slices into smaller string slices
-
texrender
Thin wrapper around running
latexmk
to render LaTeX documents. Also supports generating Tex documents. -
stfu
Shut The Ferris Up - profanity filtering for Rust
-
cjieba-sys
unsafe ffi to cppjieba
-
tectonic_bridge_graphite2
Expose the graphite2 library to Rust/Cargo
-
spellcheck
A spell-checker based on the statistical algorithm described by Peter Norvig in http://norvig.com/spell-correct.html
-
pdftotext
High-level library that binds to Poppler to extract text from a PDF
-
encoding-index-japanese
Index tables for Japanese character encodings
-
nutrimatic
Tools for reading Nutrimatic (https://nutrimatic.org) index files
-
contractions
expand contractions in English
-
exit-with-code
A CLI which returns the exit code of your choice
-
panduck-latex
Use panduck to generate XeLaTeX
-
deepfrog
A deep learning NLP suite (PoS,lemmatiser,NER) with FoLiA XML support
-
regex-cli
A command line tool for debugging, ad hoc benchmarking and generating regular expressions
-
unicode_clusters
variable width unicode characters as single items, allowing for array like indexing etc
-
unicode_names
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
chanoma
Characters Normalization library. 文字列正規化処理用のライブラリです。
-
minigrep_maybe
Just another minigrep from Rust book
-
base_emoji
Convert everything to Emojis (and back)! - 🐫🔦🔥🎲🐬🍀🍟🔋🐬🎲🐬🍀🎁🍟💧👂🔥🚪🔋🍟🔦🔋🚗👣🔦🌍👂🍤🐗
-
infisearch_lang_ascii
Basic ascii tokenizer for InfiSearch
-
code-tour
Enhanced example-based learning, i.e. awesome examples user experience
-
ucd-parse
parsing data files in the Unicode character database
-
str_to_bytes
parse a string representations of numbers or ascii text and return a Vec<u8>
-
scannedpdf
A fast, low memory footprint, simple images to PDF crate
-
lindera-ipadic
A Japanese morphological dictionary for IPADIC
-
random_access_unicode
Provide random access to the unicode characters in a file
-
saku
efficient rule-based Japanese Sentence Tokenizer
-
unicode-character-database
Unicode character database tables (Unicode Standard Annex #44) generated using
ucd-generate
-
mdoc
Modern PDF creation through Markdown and LaTeX
-
lindera-decompress
A morphological analysis library
-
unicode_types
A mapping of all the unicode characters into convenience types (one enum per block of characters with one variant per character)
-
mdbook-treesitter
mdBook preprocessor for html adding tree-sitter highlighting support
-
esc
Escape characters in strings
-
alphabet-encoder
A quick and dirty way to deal with escape characters
-
lindera-ko-dic
A Japanese morphological dictionary for ko-dic
-
static_table
creates pretty tables at compiler time
-
equt-md-ext
Extend event iterator
-
ansinator_ansi_image
defining specific structures for representing and converting an image in ansi
-
japanese-ruby-filter
Japanese ruby notation parser
-
simplearrayhash
-
lingua-chinese-language-model
The Chinese language model for Lingua, an accurate natural language detection library
-
encoding-next-index-japanese
Index tables for Japanese character encodings
-
tpng
A small tool that prints truecolor png renderings to the terminal using unicode block characters
-
mdtable-cli
that makes creating tables in markdown much easier!
-
ruby-parser
A parser for the Ruby language
-
asciifolding
ascii folding library
-
translation_api
translation API
-
mdbook-to-example
Turns an mdbook book into a Rust example
-
lazy-char-iter
Lazily iterate over Unicode characters from a u8 slice in Rust
-
mdtoc
generate table of contents for markdown
-
debug_canvas
Easily print complicated debug strings
-
enchode
ASCII enchoder
-
unicode_skeleton
detects unicode strings that look nearly identical once rendered, but do not compare as equal. It defines "confusable" and "skeleton" based on Unicode Standard Annex #39
-
backslash
parsing escape characters
-
unic-ucd-block
UNIC — Unicode Character Database — Unicode Blocks
-
grep-searcher
Fast line oriented regex searching as a library
-
cjk_entity_extractor
Entity Extractor for CJKV Text
-
unic-idna
UNIC — Unicode IDNA Compatibility Processing
-
yeslogic-ucd-parse
parsing data files in the Unicode character database
-
encoding-index-simpchinese
Index tables for simplified Chinese character encodings
-
jellybean-pack-0
Sweet syntax highlighting with tree-sitter
-
nib-cli
A cli for a yet another static site generator Nib
-
conv-rs
CLI converter for multi extensions
-
unidecode
pure ASCII transliterations of Unicode strings
-
forming
lightweight architecture as code language. 架构描述语言
-
flag_by_code
Easiest way to get an emoji flag from BCP-47 language tags
-
mdbook-extended-markdown-table
Preprocessor for mdBook that generates tables with merged cells from ASCII text
-
djot
Djot parser written in pure Rust
-
tectonic_engine_xdvipdfmx
The
xdvipdfmx
program as a reusable crate -
fst-subseq-ascii-caseless
An automaton that matches if the input contains a specific subsequence ignoring ASCII case to be used with fst
-
asciis
ASCII base on RFC20. Just ord() and chr()
-
simplecc
Chinese Convert library (partially) compatible with OpenCC's dictionaries
-
uecho
The unicode of the echo command
-
crabfetcher
A fetcher written in rust
-
dechode
Dechoder to ASCII
-
ascii-alphabetic-char
Traits for ASCII alphabetic characters
-
text-tables
A terminal/text table prettifier with no dependencies
-
xmldecl
Extracts an encoding from an ASCII-based bogo-XML declaration in text/html in a Web-compatible way
-
chinese-number-uppercase
Convert Arabic figures to Chinese uppercase number
-
swappy
An anagram generator
-
brainfuck_interpreter
brainfuck interpreter written in Rust
-
minigrep_rustbook_2023
grep tool project from the Rust Book
-
boilerpipe
text extraction from HTML documents
-
jpreprocess-dictionary-builder
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
epubparse
Parse epub and convert to text-only Book structure
-
lingua-portuguese-language-model
The Portuguese language model for Lingua, an accurate natural language detection library
-
lindera-unidic
A Japanese morphological dictionary for UniDic
-
ascii-read
BufRead
-like methods for reading into anAsciiString
-
json-event-parser-witespace
A json event parser with whitespace
-
korrektor-utils
Utils library for korrektor-rs
-
ascii_help
help you quickly convert ASCII codes
-
convert_encoding
Convert encoding of text files in batch
-
unic-char
UNIC — Unicode Character Tools
-
jpreprocess-window
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
pcre2-sys
Low level bindings to PCRE2
-
mdlynx
Small, fast utility to find broken file links in Markdown documents
-
yeslogic-fontconfig
RENAMED: use the fontconfig crate instead
-
writedown
format parser
-
markdown2unicode
Converter from markdown notation to unicode characters
-
rustyword
An anagram finder
-
chiselj
JSON command line utility
-
owlz
"Owlz" ascii emoojis, created randomly or by design
-
lindera-cc-cedict
A Japanese morphological dictionary for CC-CEDICT
-
mon-artist
Render ASCII art to SVG, with user-customizable rendering rules
-
emojicons
Parse :emoji: notation to unicode representation
-
uniaxe
replace Unicode letters with Ascii equivalents
-
grep-pcre2
Use PCRE2 with the 'grep' crate
-
replace-all
Cli to quickly replace occurences of a word in a file
-
pdf-annotations-converter
Converts annotations found in PDF files to different formats
-
tpaint
Terminal based unicode and ascii diagrams and drawings
-
xsystem
Conversion between the Esperanto x-system and Unicode circumflexes
-
fountain
Parser and formatter for the Fountain screenplay markup language
-
rusty_word_builder
Syllable and Word generation library written fully in Rust
-
toml_to_table
pretty print TOML as a table