-
regex
regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
-
textwrap
word wrapping, indenting, and dedenting strings. Has optional support for Unicode and emojis as well as machine hyphenation.
-
encoding_rs
A Gecko-oriented implementation of the Encoding Standard
-
unicode-normalization
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
similar
A diff library for Rust
-
fancy-regex
regexes, supporting a relatively rich set of features, including backreferences and look-around
-
heck
case conversion library
-
const_format
Compile-time string formatting
-
convert_case
Convert strings into any case
-
ropey
A fast and robust text rope for Rust
-
lazy-regex
lazy static regular expressions checked at compile time
-
pulldown-cmark
A pull parser for CommonMark
-
unicase
A case-insensitive wrapper around strings
-
unicode-segmentation
Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules
-
deunicode
Convert Unicode strings to pure ASCII by intelligently transliterating them. Suppors Emoji and Chinese.
-
scraper
HTML parsing and querying with CSS selectors
-
unicode-bidi
Unicode Bidirectional Algorithm
-
rustybuzz
A complete harfbuzz shaping algorithm port to Rust
-
html2text
Render HTML as plain text
-
termimad
Markdown Renderer for the Terminal
-
emojis
✨ Lookup emoji in *O(1)* time, access metadata and GitHub shortcodes, iterate over all emoji, and more!
-
ammonia
HTML Sanitization
-
widestring
wide string Rust library for converting to and from wide strings, such as those often used in Windows API or other FFI libaries. Both
u16
andu32
string types are provided, including support for UTF-16 and UTF-32… -
lopdf
PDF document manipulation
-
mdbook
Creates a book from markdown files
-
lngcnv
linguistics: display pronunciation, translate between dialects, convert between orthographies; support for multiple languages: English, Latin, Polish, Quechua, Spanish, Tikuna
-
strip-ansi-escapes
Strip ANSI escape sequences from byte streams
-
prettydiff
Side-by-side diff for two files
-
fuzzy-matcher
Fuzzy Matching Library
-
unicode-general-category
Fast lookup of the Unicode General Category property for char
-
regress
A regular expression engine targeting EcmaScript syntax
-
linkify
Finds URLs and email addresses in plain text. Takes care to get the boundaries right with surrounding punctuation like parentheses.
-
pulldown-cmark-to-cmark
Convert pulldown-cmark Events back to the string they were parsed from
-
text-splitter
Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.
-
finl_unicode
handling Unicode functionality for finl (categories and grapheme segmentation)
-
lindera
A morphological analysis library
-
printpdf
reading and writing PDF files
-
onig
Rust-Onig is a set of Rust bindings for the Oniguruma regular expression library. Oniguruma is a modern regex library with support for multiple character encodings and regex syntaxes.
-
garde
Validation library
-
titlecase
Capitalize text according to a style defined by John Gruber for Daring Fireball
-
font-kit
A cross-platform font loading library
-
charabia
detect the language, tokenize the text and normalize the tokens
-
roff
ROFF (man page format) generation library
-
unicode-script
exposes the Unicode
Script
andScript_Extension
properties from UAX #24 -
synoptic
low-level, syntax highlighting library with unicode support
-
const-str
compile-time string operations
-
unescaper
Unescape strings with escape sequences written out as literal characters
-
diff
An LCS based slice and string diffing implementation
-
nucleo
plug and play high performance fuzzy matcher
-
mkrs
Build automation tool
-
os_display
Display strings in a safe platform-appropriate way
-
diffy
Tools for finding and manipulating differences between files
-
edit
Open a file in the default text editor
-
chardetng
A character encoding detector for legacy Web content
-
stringsext
find multi-byte-encoded strings in binary data
-
inlinable_string
inlinable_string
crate provides theInlinableString
type – an owned, grow-able UTF-8 string that stores small strings inline and avoids heap-allocation – and theStringExt
trait… -
smartcat
Putting a brain behind
cat
. CLI interface to bring language models in the Unix ecosystem 🐈⬛ -
hyperlink
Very fast link checker for CI
-
cruet
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
ascii
ASCII-only equivalents to
char
,str
andString
-
Inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
line-index
Maps flat
TextSize
offsets to/from(line, column)
representation -
xan
The CSV magician
-
wana_kana
checking and converting between Japanese characters - Kanji, Hiragana, Katakana - and Romaji
-
uuhelp_parser
A collection of functions to parse the markdown code of help files
-
whyq
jq wrapper
-
str_indices
Count and convert between indexing schemes on string slices
-
mdxjs
Compile MDX to JavaScript in Rust
-
ferris-says
flavored replacement for the classic cowsay
-
unicode_names2
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
stringzilla
Faster SIMD-accelerated string search, sorting, fingerprints, and edit distances
-
autocorrect
A linter and formatter for help you improve copywriting, to correct spaces, words, punctuations between CJK (Chinese, Japanese, Korean)
-
entities
raw data needed to convert to and from HTML entities
-
jieba-rs
The Jieba Chinese Word Segmentation Implemented in Rust
-
mdbook-katex
mdBook preprocessor rendering LaTeX equations to HTML
-
google-translate2-cli
A complete library to interact with Translate (protocol v2)
-
epub-builder
generating EPUB files
-
unicode-case-mapping
Fast lowercase, uppercase, and titlecase mapping for characters
-
ncount
A word count tool intended to derive useful stats from markdown
-
mdbook-pdf
A backend for mdBook written in Rust for generating PDF based on headless chrome and Chrome DevTools Protocol
-
two_percent
Fuzzy Finder in rust!
-
scraps
static site generator based on Markdown files written with simple Wiki-link notation. It can be used primarily for personal or team knowledge management.
-
decancer
that removes common unicode confusables/homoglyphs from strings
-
regex-cursor
regex fork that can search discontiguous haystacks
-
hck
A sharp cut(1) clone
-
llmvm-core
The core application for llmvm
-
pdf-extract
extract content from pdfs
-
prop-check-rs
A Property-based testing Library in Rust
-
hgrep
grep tool with human-friendly search output. This is similar to
-C
option ofgrep
command, but its output is enhanced with syntax highlighting focusing on human readable outputs. -
matchers
Regex matching on character and byte streams
-
unicode-blocks
contains a list of all unicode blocks and provides some functions to search across them
-
unindent
Remove a column of leading whitespace from a string
-
unicode-xid
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
omekasy
Decorate alphanumeric characters in your input with various font; special characters in Unicode
-
mdbook-admonish
A preprocessor for mdbook to add Material Design admonishments
-
arrow-string
String kernels for arrow arrays
-
bfom
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
unicode_categories
Query Unicode category membership for chars
-
blockwatch
Linter that tracks changes between dependent blocks of code
-
cargodisttest
💬 a CLI for learning to distribute CLIs in rust
-
font-types
Scalar types used in fonts
-
rust-stemmers
some popular snowball stemming algorithms
-
regex-syntax
A regular expression parser
-
aki-resort
sort lines of text. You can use regex to specify the KEY.
-
text_io
really simple to use panicking input functions
-
trivet
Parser Library
-
uncased
Case-preserving, ASCII case-insensitive, no_std string types
-
netidx
Secure, fast, pub/sub messaging
-
stop-words
Common stop words in many languages
-
stringcase
Converts string cases between camelCase, COBOL-CASE, kebab-case, and so on
-
uncomment
A cli tool to remove comments from code. Supports multiple languages.
-
difflib
Port of Python's difflib library to Rust
-
dmos-cli
Djot HTML renderer with advanced features - CLI
-
languagetool-rust
LanguageTool API bindings in Rust
-
html2md
binary to convert simple html documents into markdown
-
xi-unicode
Unicode utilities useful for text editing, including a line breaking iterator
-
unicode-id
Determine whether characters have the ID_Start or ID_Continue properties according to Unicode Standard Annex #31
-
vaporetto
pointwise prediction based tokenizer
-
schemat
A code formatter for Scheme, Lisp, and any S-expressions
-
wildcard
matching
-
rumdl
A fast Markdown linter written in Rust (Ru(st) MarkDown Linter)
-
apisnip
A terminal user interface (TUI) tool for trimming OpenAPI specifications down to size ✂️
-
tossicat
입력된 단어에 맞게 같이 입력된 토시(조사)를 적절하게 변환하는 라이브러리
-
mdbook-yapp
A mdBook preprocessor for simple text replacements
-
slice-command
slice is a command-line tool that allows you to slice the contents of a file using syntax similar to Python's slice notation
-
wezterm-bidi
The Unicode Bidi Algorithm (UBA)
-
file-organiser
Command line file manager to list, move or delete large numbers of files in nested folders filtered by age, file extension, file name pattern and/or size range
-
stam
powerful library for dealing with stand-off annotations on text. This is the Rust library.
-
cow-utils
Copy-on-write string utilities for Rust
-
bundle_repo
Pack a local or remote Git Repository to XML for LLM Consumption
-
gemini-map
A command-line tool to run files in parallel through Google Gemini
-
stylin
Convert markdown to pandoc markdown with custom styles
-
any_ascii
Unicode to ASCII transliteration
-
lindera-tantivy
Lindera Tokenizer for Tantivy
-
sliceslice
A fast implementation of single-pattern substring search using SIMD acceleration
-
newdoc
Generate pre-populated module files formatted with AsciiDoc that are used in Red Hat and Fedora documentation
-
zawk
An efficient Awk-like language implementation by Rust with stdlib
-
diff-match-patch-rs
The fastest implementation of Myer's diff algorithm to perform the operations required for synchronizing plain text
-
nom-unicode
Unicode extensions for Nom
-
charset
Character encoding decoding for email
-
hyphenation
Knuth-Liang hyphenation for a variety of languages
-
pad
padding strings at runtime
-
vesti
A preprocessor that compiles into LaTeX
-
mdbook-catppuccin
🎊 Soothing pastel theme for mdBook
-
tiefdownconverter
A CLI tool to manage and convert Markdown-based projects
-
collclean
Clean up collaboration commands in LaTeX files
-
tau-engine
A document tagging library
-
near-facsimile
Find similar or identical text files in a directory
-
hypher
separates words into syllables
-
norad
Read and write Unified Font Object files
-
nu_plugin_regex
nu plugin to search text with regex
-
tkrar
Count frequency of words in a file or a directory
-
trans-epub
Translate EPUB with CLI
-
unicode-reverse
Unicode-aware in-place string reversal
-
unicode-bidi-mirroring
Unicode Bidi Mirroring property detection
-
presenterm
A terminal slideshow presentation tool
-
quixote
Quizzes and tests in Markdown
-
topiary-cli
CLI app for Topiary, the universal code formatter
-
chewing
(酷音) intelligent Zhuyin input method
-
allms
One Library to rule them aLLMs
-
htmd-cli
The command line tool for htmd
-
sigrs
Interactive grep (for streaming)
-
sile
Simon’s Improved Layout Engine
-
uwc
Counts things in unicode text files
-
rsrpp-cli
project for research paper pdf
-
nmd
Official NMD CLI and compiler
-
graphannis
new backend implementation of the ANNIS linguistic search and visualization system
-
igrepper
The interactive grepper
-
mdbook-epub
An EPUB renderer for mdbook
-
wchar
Procedural macros for compile time UTF-16 and UTF-32 wide strings
-
regex-literal
delimited regular expression literals
-
sd
An intuitive find & replace CLI
-
mdbook-environment
A preprocessor for MdBook for working with environment variables
-
boreal
evaluate YARA rules, used to scan bytes for textual and binary pattern
-
molybdenum
Recursive search and replace CLI application
-
line-ending
Detect, normalize, and convert line endings across platforms, including support for character streams. Ensures consistent handling of LF, CRLF, and CR line endings in text processing.
-
asciidork-cli
Asciidork CLI
-
matcher_rs
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
tbll
tbll outputs data in tabular format
-
promptify
A plaintext directory formatting tool for interacting with LLMs on the command line
-
probly-search
A lightweight full-text search engine with a fully customizable scoring function
-
mdcat
cat for markdown: Show markdown documents in terminals
-
unicode-joining-type
Fast lookup of the Unicode Joining Type and Joining Group properties
-
moonwave
generating documentation from comments in Lua source code
-
smeagol-wiki
A personal wiki webserver. Work in progress.
-
mdbook-pandoc
A pandoc-powered mdbook backend
-
tree-sitter-stack-graphs-typescript
Stack graphs definition for TypeScript & TSX using tree-sitter-typescript
-
mdbook-combiner
combine mdbook summaries from multiple source into one mdbook
-
string-offsets
Converts string offsets between UTF-8 bytes, UTF-16 code units, Unicode code points, and lines
-
idna
IDNA (Internationalizing Domain Names in Applications) and Punycode
-
etradeTaxReturnHelper
Parses etrade and revolut financial documents for transaction details (income, tax paid, cost basis) and compute total income and total tax paid according to chosen tax residency (currency)
-
arf-strings
Encoding and decoding for ARF strings
-
nvl-cli
A program to download webnovels
-
reword
some utility functions for human-readable formatting of words
-
fm
Non-backtracking fuzzy text matcher
-
minimizer
Minimize files to find minimal test case
-
uuid25
25-digit case-insensitive UUID encoding
-
mdbook-quiz
Interactive quizzes for your mdBook
-
wrapr
wrap your code for ai
-
obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
lindera-ko-dic-builder
A Korean morphological dictionary builder for ko-dic
-
unicode_titlecase
add Unicode titlecase and Turkish and Azeri locale upper/lowercase utilities to chars and strings
-
mdbook-typst
An mdBook backend to output Typst markup, pdf, png, or svg
-
heatseeker
A fast, robust, and portable fuzzy finder
-
latex-thebib
Clean and sort legacy TeX bibliographies written using ‘thebibliography’ via the
refactor
sub-command. Compile BibTeX files to legacythebibliography
TeX code using thecompile
sub-command… -
fiat-lux
Offline terminal-accessible Bible
-
indefinite
Prefix a noun with an indefinite article - a or an - based on whether it begins with a vowel
-
freetype-rs
Bindings for FreeType font library
-
bashdoc
generating documentation/help menu for user defined bash functions
-
yake-rust
Yake (Yet Another Keyword Extractor) in Rust
-
yara-x-parser
A parsing library for YARA rules
-
wordcut-engine
Word segmentation/breaking library
-
say-rust
command-line tool which is an alternative to echo
-
ricat
A Rust-Based implemenation of classic UNIX
cat
command -
scru64
Sortable, Clock-based, Realm-specifically Unique identifier
-
skyspell
Fast and handy spell checker for the command line
-
simple-string-patterns
Makes it easier to match, split and extract strings in Rust without regular expressions. The parallel string-patterns crate provides extensions to work with regular expressions via the Regex library
-
utf16_iter
Iterator by char over potentially-invalid UTF-16 in &[u16]
-
iepub
epub、mobi电子书读写
-
levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
document_tree
reStructuredText’s DocumentTree representation
-
line-numbers
Find line numbers in strings by byte offsets, quickly
-
byteyarn
hyper-compact strings
-
qpdf
Rust bindings to QPDF C++ library
-
rustc-literal-escaper
code to unescape string literals
-
fuzzy-muff
Fuzzy Matching Library
-
svgdx-pandoc
pandoc filter for svgdx codeblocks in Markdown
-
repgrep
An interactive command line replacer for
ripgrep
-
mdbook-mermaid
mdbook preprocessor to add mermaid support
-
aneubeck-daachorse
Daachorse: Double-Array Aho-Corasick
-
roman-numerals-rs
Manipulate well-formed Roman numerals
-
aki-mcolor
mark up text with color
-
rhai-autodocs
Custom documentation generator for the Rhai scripting language
-
mdfried
A markdown viewer for the terminal that renders images and big headers
-
epub
support the reading of epub files
-
mdbook-tailor
mdbook preprocessor for image-tailor
-
zhconv
Traditional/Simplified and regional Chinese variants converter based on MediaWiki & OpenCC rulesets and powered by AC automata 轉換简体、繁體及兩岸、新馬中文地區詞,基於MediaWiki和OpenCC之字詞轉…
-
unicode-truncate
Unicode-aware algorithm to pad or truncate
str
in terms of displayed width -
affinidi-messaging-text-client
Affinidi Messaging SDK
-
htop
HTML to PDF converter
-
mdbook-graphviz
mdbook preprocessor to add graphviz support
-
instant-segment
Fast English word segmentation
-
character_converter
Turn Traditional Chinese script ot Simplified Chinese script and vice-versa and tokenize
-
charasay
The future of cowsay 🐮! Colorful characters saying something 🗨️
-
sgrep
grep util for those lazy to remember many command line options
-
lexicmp
comparing and sorting strings lexicographically and naturally
-
frizbee
Fast fuzzy matching via SIMD smith waterman, similar algorithm to FZF/FZY
-
vmks-exam-generator
CLI program for pseudo-randomly generating different variants of an embedded programming exam
-
clipcount
Counting words from the clipboard content
-
snakit
Command-line tool that recursively renames all files and folders within a specified directory to snake_case
-
lipsum
lorem ipsum text generation library. It generates pseudo-random Latin text. Use this if you need filler or dummy text for your application. The text is generated using a simple Markov chain…
-
nanohtml2text
A zero-dependency library to convert HTML to plain text
-
adrs
Architectural Decision Record command line tool
-
asimov-dataset-cli
ASIMOV Dataset Command-Line Interface (CLI)
-
grok
popular java & ruby grok library which allows easy text and log file processing with composable patterns
-
pathmut
Command line utility for manipulating path strings
-
inkjet
A batteries-included syntax highlighting library for Rust, based on tree-sitter
-
mdbook-theme
A preprocessor and a backend to config theme for mdbook, especially creating a pagetoc on the right and setting full color themes from the offical ace editor
-
substudy
Language-learning tools for working with parallel, bilingual subtitles and media files
-
textwrap-cli
Command line interface for textwrap
-
mdbook-alerts
mdBook preprocessor to add GitHub Flavored Markdown's Alerts to your book
-
llguidance
Super-fast Structured Outputs
-
precis-tools
Tools and parsers to generate PRECIS tables from the Unicode Character Database (UCD)
-
kelp
A convert tool for Japanese
-
tabprinter
creating and printing formatted tables in the terminal. It supports various table styles and offers both color and non-color output options.
-
regexml
XPath compatible regex engine
-
mut-str
A toolkit for working with mutable string slices (&mut str)
-
yeslogic-ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched with support for additional tables
-
trprvr
TRanslate PRogress VieweR
-
hauchiwa
Incredibly flexible static site generator library with incremental rebuilds and cached image optimization
-
crankshaft-config
Configuration facilities for Crankshaft
-
stfu8
Sorta Text Format in UTF-8
-
unicode-security
Detect possible security problems with Unicode usage according to Unicode Technical Standard #39 rules
-
rst
a reStructuredText parser and renderer for the command line
-
text2num
Parse and convert numbers written in English, Dutch, Spanish, Portuguese, German, Italian or French into their digit representation
-
addbib
An app to add linked bibliographies to markdown files
-
thoth-note
note-taking app written in Rust
-
fuzzt
Implementations of string similarity metrics. Includes Hamming, Levenshtein, OSA, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Sørensen-Dice.
-
plsfix
Text cleaner upper
-
sapling-streampager
streampager is a pager for command output or large files
-
codetypo-vars
Source Code Spelling Correction
-
rapidfuzz
rapid fuzzy string matching library
-
case_insensitive_hashmap
A HashMap that uses case-insensitive strings as keys
-
madato
command line tool for reading and writing tabular data (XLS, ODS, CSV, YAML), and Markdown
-
whitespace-sifter
Sift duplicate whitespaces away!
-
rust-persian-tools
Official Rust implementation of Persian Tools
-
asciimath-unicode
Convert asciimath to unicode
-
jx
An interactive JSON explorer for the command line
-
vader-sentimental
A faster Rust version from the original Python VaderSentiment analysis tool
-
jetscii
A tiny library to efficiently search strings and byte slices for sets of ASCII characters or bytes
-
colornames
An enum of color names, with a catchall RGB variant
-
ipset_lookup
ipset is a command-line tool that takes networks or IPs and searches through a lot of different threat feeds quickly. It can also download the feed data necessary to perform the queries…
-
bogrep
Full-text search for bookmarks from multiple browsers
-
herring-automata
Automata construction for Herring
-
hyphertool
Hypertool is a command-line tool for syllabification and hyphenisation
-
mdmodels
generate models, code and schemas from markdown files
-
galm
pattern matching library
-
typst-ansi-hl
highlights your Typst code using ANSI escape sequences
-
veg
Flexible tables
-
quranize
Encoding transliterations into Quran forms
-
swift-check
High-performance, robust, and expressive searching and validation (uses SIMD on x86_64, aarch64, and WASM)
-
patchkit
parsing and manipulating patch files
-
tesseract-rs
Rust bindings for Tesseract OCR with optional built-in compilation
-
COXave
Instruments for codings
-
rustdoc-stripper
manipulate rustdoc comments
-
picodiff
Tiny GUI app to compare text easily
-
zet
zet finds the union, intersection, set difference, etc of files considered as sets of lines
-
vidyut-kosha
A Sanskrit key-value store
-
rust_string_utils
String utilities for rust based on org.apache.commons.lang3
-
uast
Unicode Aware Saṃskṛta Transliteration in Rust 🦀
-
ripsecrets
A command-line tool to prevent committing secret keys into your source code
-
tantivy-common
common traits and utility functions used by multiple tantivy subcrates
-
spellbook
A spellchecking library compatible with Hunspell dictionaries
-
subconverter
A more powerful utility to convert between proxy subscription format
-
autumnus
Syntax highlighter powered by Tree-sitter and Neovim themes
-
asciisavers
A small collection of ascii screensavers
-
babel
Provide Rust enums for Groq, SambaNova, Openrouter's llm model names
-
unicode-properties
Query character Unicode properties according to UAX #44 and UTR #51
-
mdbook-d2
D2 diagram generator plugin for MdBook
-
unic-ucd-ident
UNIC — Unicode Character Database — Identifier Properties
-
mandown
Markdown to groff (man page) converter
-
mdbook-cmdrun
mdbook preprocessor to run arbitrary commands
-
vlazba
Lojban words generator and analyzer
-
text-to-ascii-art
program to convert text to ASCII art
-
pretty-xmlish
Pretty print XML-ish data with unicode art
-
rustkorean
processing Korean characters. It provides functionalities to check if a character is Korean, classify Korean characters, verify if a character is a leading consonant (choseong), a medial vowel (jungseong)…
-
furigana
Map furigana to a word given its reading
-
harfbuzz_rs
A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust
-
gh-emoji
Convert
:emoji:
to Unicode using GitHub’s emoji names -
dprint-plugin-markdown
Markdown formatter for dprint
-
mdka
HTML to Markdown converter
-
mfmt
Meta formatter library
-
hashmoji_generator
Code generation tool for hashmoji
-
emojic
Emoji constants
-
oxford_join
Join string slices with Oxford Commas!
-
kas-text
Text layout and font management
-
autotex
Continuously compile TeX and LaTeX
-
see-cat
A cute cat(1)
-
figlet-comment
quickly create banner to use as comments
-
capitalize
Change first character to upper case and the rest to lower case, and other common alternatives
-
textalyzer
Analyze key metrics like number of words, readability, and complexity of any kind of text
-
sublime_fuzzy
Fuzzy matching algorithm based on Sublime Text's string search
-
the_rock
A command line King James bible viewer
-
rustic_print
A versatile Rust library for enhancing console output. It offers a range of features to create a more engaging and informative command-line interface.
-
nlpo3
Thai natural language processing library, with Python and Node bindings
-
ticker-sniffer
extracting multiple stock ticker symbols from a text document
-
quagga
CLI tool that combines multiple text files into a single prompt suitable for Large Language Models
-
stam-tools
Command-line tools for working with stand-off annotations on text (STAM)
-
mdbook-angular
mdbook renderer to run angular code samples
-
what-rs
Identify what something is! A pyWhat reimplementation in Rust
-
misanthropy
An interface to the Anthropic API
-
libharu_ng
Easily generate PDFs from your Rust app
-
ast-grep-language
Search and Rewrite code at large scale using precise AST pattern
-
creature_feature
Composable n-gram combinators that are ergonomic and bare-metal fast
-
unitil
EUC-JPの全角チルダを波ダッシュに変換するツール
-
cskk
C ABIから使う事を目的とした SKK(Simple Kana Kanji henkan)方式のかな漢字変換ライブラリ
-
pulldown-cmark-toc
Generate a table of contents from a Markdown document
-
mdbook-codeblocks
A mdbook preprocessor to prepend customizable vignette to code blocks
-
timug
It has been created for personal blog creation purpose. Timus has its limits, but it fulfills the purposes for which it was created.
-
zp
Copy the contents of the source file or the standard output buffer to the clipboard, with support for maintaining a history of copied content, allowing users to easily paste into another file or program
-
float-pretty-print
Format f64 for showing to user, not for serialisation
-
pager
pipe your output through an external pager
-
dcsv
Dyanmic csv reader,writer,editor
-
clima
A minimal Markdown reader in the terminal
-
mdbook-private
An mdbook preprocessor that controls visibility of private chapters and sections within them
-
in_definite
Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.
-
hlight
dedicated to delivering exceptional syntax highlighting capabilities
-
fastn-jdebug
fastn: Full-stack Web Development Made Easy
-
four-char-code
A string of 4 ascii chars represented by an u32
-
ib-pinyin
一个高性能拼音匹配库
-
sk-skimmer
Fuzzy Finder in rust!
-
secular
No Diacr!
-
reggy
friendly, resumable regular expressions for text analytics
-
deindent
A command line utility and Rust library to format overly-indented text
-
seshat-unicode
A Unicode Library for Rust. Unicode 16.0.0 ready. XID_Start and XID_Continue are also available.
-
nu-utils
Nushell utility functions
-
pukram-formatting
A type to represent the formatting of the pukram markup language
-
mdbook_rash
Binary to create doc from
rash
code -
diagnostic
Pretty diagnostic report
-
indent
Functions for indenting multiline strings
-
kathoey
text feminization using open corpus linguistics data
-
pandoc
API that wraps calls to the pandoc 2.x executable
-
inlet_manifold
A general purpose highlighting library
-
diacritics
Remove diacritics from letters, for example when standardizing input for a search
-
mktoc
Generate Table of Contents from Markdown files
-
charx
A replacement for char::is_ascii*
-
iotext_rs
IoText data protocol
-
scrunch
full-text-searching compression
-
dmos
Djot HTML renderer with advanced features
-
crate2bib-cli
A CLI tool for the crate2bib crate
-
mdbook-callouts
mdBook preprocessor to add Obsidian Flavored Markdown's Callouts to your book
-
frawk
an efficient Awk-like language
-
scanix
search a text or pattern in files. A fast and lightwight text tool.
-
unbom
Remove UTF-8 BOM from files
-
string-patterns
Makes it easier to work with common string patterns and regular expressions in Rust, adding convenient regex match and replace methods (pattern_match and pattern_replace) to the standard…
-
hi-doc-jumprope
fast rope (fancy string) library built on top of Skiplists - hi-doc fork
-
nu_plugin_emoji
a nushell plugin called emoji
-
rust-ai
A collection of 3rd-party AI APIs for Rust
-
vi
An input method library for vietnamese IME
-
string-auto-indent
Normalizes multi-line string indentation while preserving platform-specific line endings
-
regex-charclass
Manipulate and convert regex character classes
-
glyph_brush_layout
Text layout for ab_glyph
-
shell2batch
Coverts simple basic shell scripts to windows batch scripts
-
mdbook-pikchr
A mdbook preprocessor to render pikchr code blocks as images in your book
-
ctreg
Compile-time regular expressions the way they were always meant to be
-
mnm
Mnemonic sentences for BitTorrent info-hashes
-
agentai
designed to simplify the creation of AI agents
-
easy_reader
easily navigating forward, backward or randomly through the lines of huge files
-
mdbook-aquascope
Interactive Aquascope editor for your mdBook
-
html-compare
compare html files
-
svgbob_cli
Transform your ascii diagrams into happy little SVG
-
flowquad
that helps you build UI stuff with Macroquad
-
stroka
Small String optimization
-
charname
Incredibly simple library that just gives you the Unicode name for a character
-
cai
The fastest CLI tool for prompting LLMs
-
mdbook-tocjs
A mdbook preprocessor which adds extra js and css file for ToC hydration
-
quicksilverx
easy to use grep clone
-
turtlefmt
Auto-formatter for RDF Turtle
-
mdbook-open-on-gh
mdbook preprocessor to add a open-on-github link on every page
-
mdbook-nice
A mdbook plugin to add nice css to your book
-
catalog-of-markdown
Generate the catalog of markdown file
-
substring
method for string types
-
reflexo-typst
Bridge Typst to Web Rendering, with power of typst
-
gen-mdbook-summary
generate SUMMARY.md for mdbook project
-
regexnight
Command-line tool to print syntax-highlighted versions of regular expressions and spot errors
-
v_escape
The simd optimized escaping code
-
krafna
terminal-based alternative to Obsidian's Dataview plugin, allowing you to query your Markdown files using standard SQL syntax
-
armnod
random string generator
-
extract_anchors
Утилита для извлечения из исходных кодов всех помеченных отрывков
-
fimdoc
Firendship is Magic Document, converts Markdown into FIMFiction BBCode
-
mdbook-linkcheck2
A backend for
mdbook
which will check your links for you -
async-utf8-decoder
Convert AsyncRead to incremental UTF8 string stream
-
ncase
Enforce a case style
-
rake
Rapid Automatic Keyword Extraction (RAKE) algorithm
-
px-wsdom-ts-convert
wsdom crate
-
mdbook-chess
An mdbook preprocessing plugin to generate chess boards
-
ewts
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (lib)
-
minspan
a package for determining the minimum span of one vector within another
-
md-tui
A terminal markdown viewer
-
minimo
terminal ui library combining alot of things from here and there and making it slightly easier to play with
-
md-ulb-pwrap
Markdown paragraph wrapper using Unicode Line Breaking Algorithm
-
utf8_iter
Iterator by char over potentially-invalid UTF-8 in &[u8]
-
n_gram
training n-gram language models
-
rsonpath-lib
Blazing fast JSONPath query engine powered by SIMD. Core library of
rsonpath
. -
mdbook-tabs
mdBook plugin for rendering content in tabs
-
utf64
encode utf-8 strings into utf-64, and decode them back
-
cronus_spec
The definitions for cronus API spec
-
mdbook-hints
mdBook preprocessor to add hover hints to your book
-
dialogi
A dialog parser
-
simstring_rust
A native Rust implementation of the SimString algorithm
-
lister-cli
Lister: Navigate Markdown Lists
-
puppet-fmt
Automatic code formatter for puppet manifests
-
mdbook-external-links
Open external links inside your mdBooks in a different tab
-
word-tally
Output a tally of the number of times unique words appear in source input
-
loki_text
advanced string manipulation with pattern searching and replacement capabilities
-
hyperscan
bindings for Rust with Multiple Pattern and Streaming Scan
-
to_markdown_table
An easy way to format any data structure into a Markdown table
-
rustash
CLI tool to manage your notes
-
treegrep
A pattern matcher frontend or backend which displays results in a tree
-
libchai
汉字编码优化算法
-
indent_write
Write adapters to add line indentation
-
jayce
tokenizer 🌌
-
cliche
Dead simple static site generator
-
eliza
natural language processing program developed by Joseph Weizenbaum in 1966
-
reason-shell
Reason: A Shell for Research Papers
-
slack-blocks-render
Slack blocks render is a Rust library to render Slack blocks as Markdown
-
doxygen-bindgen
Converts Doxygen comments into Rustdoc markdown
-
filename-refactor
Command to refactor file names
-
encoding-next
Character encoding support for Rust
-
botanical-latin
Decliner / conjugator / inflector for classical / botanical Latin
-
replaxe
A command-line tool to replace text in files with easy patterns
-
textpod
Local, web-based notetaking app inspired by 'One Big Text File' idea
-
fcowsay
working with cowsay
-
nucleo-matcher
plug and play high performance fuzzy matcher
-
rustclock
a stopwatch or timer cli made in rust
-
gulagcleaner_rs
Ad removal tool for PDFs
-
wit-bindgen-markdown
Markdown generator for WIT and the component model, typically used through the
wit-bindgen-cli
crate -
mdbook-embedify
based mdbook preprocessor plugin that allows you to embed apps to your book, like youtube, codepen and some other apps
-
dcss-api
A DCSS Webtile API for Rust
-
rewrite
Safely rewrite file contents from stdin, even when file is open as an input
-
wikidot-normalize
provide Wikidot-compatible string normalization
-
mdsh
Markdown shell pre-processor
-
codetypo-dict
Source Code Spelling Correction
-
libretranslate
A wrapper for the LibreTranslate web API
-
rust_file_encode_mode_convert
这是一个rust的库,用于检测文件的编码格式。 支持GBK,GBK2312 , UTF8, UTF16LE, UTF16BE, UTF8+BOM,UTF32 等多种编码格式。
-
esri_ascii_grid
reading ESRI Ascii Grid .asc files
-
strloin
copy on write slices of a string
-
conststr
Constant strings
-
mdbook-pagebreaks
A mdbook preprocessor to insert page breaks when rendering to HTML
-
ascii_help
help you quickly convert ASCII codes
-
text-editing
string with utilities for editing
-
hat-splitter
HAT splitter
-
fmtt
A diff-friendly text formatter that breaks lines on sensible punctuations and words to fit a line width
-
runi
a CLI tool to generate unicode fonts
-
huggingface/tokenizers-python
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
-
piet-cosmic-text
A text layout engine for piet based on cosmic-text
-
poppler-sys-rs
Low-level (FFI) bindings for poppler-glib
-
colonnade
format tabular data for display
-
unicode-canonical-combining-class
Fast lookup of the Canonical Combining Class property
-
keyphrases
Rapid Automatic Keyword Extraction (RAKE) implementation in Rust
-
bobo_html_parser
parser of html markdown
-
lodestone
A website wrapper for FFXIV's lodestone
-
string_wizard
manipulate string like a wizard
-
portmanteau
create portmanteaux
-
rustdoc-md
Convert Rust documentation JSON into clean, organized Markdown files
-
cesu8
Convert to and from CESU-8 encoding (similar to UTF-8)
-
asciidoctor-client
A kludge to improve the performance of static site generators that use asciidoc through its cli
-
broken-md-links
A command-line tool and library to detect broken links in Markdown files
-
fast_symspell
Spelling correction & Fuzzy search
-
aki-xcat
concatenate files that are plain, gzip, xz and zstd
-
razy-importer
lazy_importer
-
cedarwood
efficiently-updatable double-array trie in Rust (ported from cedar)
-
unidown
Convert Markdown to Unicode
-
duvet
A requirements traceability tool
-
colored_text
adding colors and styles to terminal text
-
hebrew_unicode_script
A low-level library designed to ascertain whether a character belongs to the Hebrew Unicode script. It supports checks for individual characters as well as for membership within collections
-
iregex
Intermediate representation for Regular Expressions
-
mdbook-curly-quotes
mdBook preprocessor that replaces straight quotes with curlyquotes, except within code blocks or code spans
-
deliminator
Universal code documentation generator
-
physis
Interact with XIV game data
-
text_utils_s
edit array. Example delete duplicate in array. Clear string
-
spc-core
A command-line tool for processing and analyzing data from SPC files
-
utf16_lit
macro_rules to make utf-16 literals
-
ADA_Standards
help you handle checks on your ADA projects, especially good to build scripts to check coding standards conformity
-
ean-rs
generating and validating EAN barcodes
-
overlap-chunk
splitting text into chunks of specified size with adjustable overlap percentage
-
lindera-unidic-builder
A Japanese morphological dictionary builder for UniDic
-
lexical-sort
Sort Unicode strings lexically
-
mdbook-langtabs
An mdbook preprocessor that adds language tabs for code blocks
-
erebus
A CLI message generation library
-
mdbook-pdf-headless_chrome
Control Chrome programatically
-
mdbook-ai-pocket-reference
mdbook preprocessor for the ai-pocket-reference project
-
moto
motivated automation
-
sixbit
Small packed strings
-
bpetok
CLI for tokenizing text input using Byte Pair Encoding (BPE)
-
inflections
High performance inflection transformation library for changing properties of words like the case
-
kashida
Insert Kashidas/Tatweel into Arabic text, e.g. for justification purposes.
-
obelisk-component-builder
Builder of Obelisk components
-
fix-name-case
CLI tool to convert variable and function names to snake_case
-
orly
Download O'Reilly books as EPUB
-
str_inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
linkcheck2
extracting and validating links
-
loc
Count lines of code (cloc) fast
-
unicode-display-width
Unicode 15.1.0 compliant utility for determining the number of columns required to display an arbitrary string
-
tform
format plain text into well-structured Markdown or HTML
-
crowbook-text-processing
some utilities functions for escaping text (HTML/LaTeX) and formatting it according to typographic rules (smart quotes, ellipsis, french typograhic rules)
-
giff
Visualizes the differences between the current HEAD and a specified branch in a git repository using a formatted table output in your terminal. The differences are displayed with color-coded…
-
itex
Initialize a LaTex project inside a folder instantly
-
cbfr
A buffer that run on stack, focusing on performance and speed
-
gigagei
random quote fetching console utility
-
rust-regex-dsl-creator
Regular expression DSL derive macros
-
writings
The Bahá’í Sacred Writings for use in Rust projects and APIs
-
owned_str
Provide a stack allocated String for no-std or const environement
-
aho-corasick-unsafe
Fast multiple substring searching
-
rgon
A command-line tool written in Rust that searches for a query string within a file
-
pixt
Terminal Based Cross Platform Image Viewer
-
tergo-formatter
Formatter for tergo
-
mdlink
Auto-convert HTTP links for your favorite services into nice Markdown links
-
prompt-input
lightweight library for user input prompts in Rust, designed to make input handling straightforward
-
kitoken
Fast and versatile tokenizer for language models, supporting BPE, Unigram and WordPiece tokenization
-
iregex-syntax
Common syntax for regular expressions
-
readability
Port of arc90's readability project to rust
-
turn-uppercase
Small command to uppercase text in command line and copy to clipboard
-
gliclass-rs
Inference engine for GLiClass models
-
matcher_py
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
flx-rs
Rewrite emacs-flx in Rust for dynamic modules
-
linoleum
but ergonomic line editor
-
ultra-nlp
A NLP library
-
unclog
allows you to build your changelog from a collection of independent files. This helps prevent annoying and unnecessary merge conflicts when collaborating on shared codebases.
-
split-every
Split for every n occurrences of a pattern iteratively!
-
tree-sitter-stack-graphs-python
Stack graphs definition for Python using tree-sitter-python
-
org-rust-exporter
exporter for org mode documents parsed with
org-rust-parser
-
until_needle
An extension to the BufRead trait that allows reading until a specified pattern (needle) is found
-
mylibrary_
my personal library
-
maybe-regex
Wrapper for strings that may be either a regex or a plain-text string
-
pandoc_types
Rust port of pandoc-types
-
furze
finite state transducers (fst) writen in rust
-
paltoquet
rule-based general-purpose tokenizers
-
cloc
Count, or compute differences of, lines of source code and comments
-
dnd-character
A Dungeons and Dragons character generator
-
repvar
A tiny CLI tool that replaces variables of the style
${KEY}
in text with their respective value. It can also be used as a rust library -
strinject
Inject text from somewhere else into given text
-
libanubhav
management system written in Rust
-
ragit-korean
korean tokenizer for ragit
-
twars-url2md
A powerful CLI tool that fetches web pages and converts them to clean Markdown format using Monolith for content extraction and htmd for conversion
-
like
A SQL like style pattern matching
-
r-matrix
Rust port of cmatrix
-
notmecab
tokenizing text with mecab dictionaries. Not a mecab wrapper.
-
keep-a-changelog
generating and manipulating CHANGELOG.md files that use the Keep A Changelog format
-
utf58
High-tech encoding of the Unicode space in one quibble and up to 3 bytes
-
redpen-linter
Rust linter
-
casile
The command line interface to the CaSILE toolkit, a book publishing workflow employing SILE and other wizardry
-
raylib_interactive
An interactive library for Raylib
-
vectorscan-rs
Ergonomic bindings to the Vectorscan high-performance regex library
-
askalono-cli
detect the contents of license files
-
htmd
A turndown.js inspired HTML to Markdown converter
-
ponsic-winsafe
The dependency of the
ponsic
crate -
pink_accents
Replacement of patterns in string to simulate speech accents
-
escrit
learning languages by reading texts
-
filecheck
writing tests for utilities that read text files and produce text output
-
syllabize-es
Syllabize Spanish text, and much more
-
textwrap-macros
procedural macros to use textwrap utilities at compile time
-
parse-wiki-text-2
Parse wiki text from Mediawiki into a tree of elements
-
asimov-cli
ASIMOV Command-Line Interface (CLI)
-
array_tool
Helper methods for processing collections
-
chord3
Create pdf songbooks from chopro source
-
jaaj-rs
Blazingly 🔥 fast 🚀 and memory safe ✨ JaaJ implementation in Rust 🦀
-
cnv
Command-line tool to convert between units of measurement
-
lowcharts
draw low-resolution graphs in terminal
-
jawk
JSON AWK
-
pray
A tui tool for preparing a prompt to the llms
-
normalize-line-endings
Takes an iterator over chars and returns a new iterator with all line endings (\r, \n, or \r\n) as \n
-
fast_whitespace_collapse
Collapse consecutive spaces and tabs into a single space using SIMD
-
cli_app_capo
CLI application with Unix-like tools
-
pragmatic-segmenter
Rust port of pySBD v3.1.0
-
ripjson
A fast and lean way to grep in JSON files
-
mdopen
Preview markdown files in a browser
-
src2md
Turn source code into a Markdown document with syntax highlighting, or extract it back
-
fea-rs
Tools for working with Adobe OpenType Feature files
-
minigrep_jeck
minigrep
is a grep clone that takes a query and searches for the query in the file; with added support for regex -
windot
emoji picker
-
tag2upload-service-manager
Debian tag2upload service manager
-
string-replace-all
String replacement utility inspired by JavaScript, allowing pattern-based substitutions with support for both exact matches and regex patterns
-
sqdj
sqdj shortens delimited data
-
scatternotes
A cli application to manage unstructured notes
-
textmate-scope-selector-peg
Textmate scope selector implementation as a PEG (parser grammar) in Rust
-
uklatn
Ukrainian Cyrillic transliteration to Latin script
-
xenon-lexer
The Xenon compiler's lexer
-
santoka
Translations of 668 of Taneda Santoka's free-verse haiku
-
ethan-rs-wc
The ethan-rs-ws(erwc) is word, line, character, and byte count. Like wc command but not just wc command, more accurate and faster. Text can also be read from standard input for statistics.
-
slugify
Macro for flexible slug generation
-
docket
markdown to HTML documentation rendering
-
ndef-rs
NDEF (NFC Data Exchange Format) parser and generator in Rust
-
xml_magic
A reasonably fast XML formatter
-
chamkho
Khmer, Lao, Myanmar, and Thai word segmentation/breaking library and command line
-
huski-auxies
Auxiliaries for huski implementation
-
rusty-dawg
building and querying Directed Acyclic Word Graphs (DAWGs) and Compacted DAWGs (CDAWGs) for efficient string indexing and searching
-
adobe-cmap-parser
parse Adobe CMap files
-
uniquewords-rs
Count the frequencies of words in text file(s) or stdin
-
unicode-intervals
Search for Unicode code points intervals by including/excluding categories, ranges, and custom characters sets
-
lingua-english-language-model
The English language model for Lingua, an accurate natural language detection library
-
epcmanager
EPC text tool for RFID
-
analiticcl
approximate string matching or fuzzy-matching system that can be used to find variants for spelling correction or text normalisation
-
alphabet_detector
Natural language alphabet detection library
-
webspeeddial
A dial system for websites
-
translation-api-cn
Some useful structs for calling Chinese translation api cloud services. A helper tool for
bilingual
cmdline tool. -
naming_utils
generating naming conventions, pluralizing words, and rest api paths in Rust
-
re_case
Case conversions, the way Rerun likes them
-
aws-smt-strings
manipulating SMT-LIB strings and regular expressions
-
mdbook-toc
mdbook preprocessor to add Table of Contents
-
readability-liveboat
Port of arc90's readability project to rust, updated for use with liveboat
-
lexi-matic
A Lexer Library
-
unicode-language
detect language coverage given a list of codepoints
-
bump-bin
Increments version with semver specification
-
daffy
small file comparision tool, uses Levenshtein distance to compare files
-
ainu-utils
A collection of utilities for the Ainu language
-
vyder_std
Standard library for vyder
-
gregex
Regex solver utilizing NFA
-
gh_page_tool
A github gh-pages tool for static blog site
-
tre-regex
Rust safe bindings to the TRE regex module
-
tantivy-stemmers
A collection of Tantivy stemmer tokenizers
-
context-notation
Featherweight semantic notation for text
-
trust_pdf
Verifies signed PDFs against the originals, checking for sneaky modifications
-
enma
serving anime and manga information 📦
-
iconv-native
A lightweight text encoding converter based on platform native API or libiconv
-
bbd
Binary Braille Dump
-
asciidork-backend
Asciidork backend
-
unicodeit
Converts LaTeX to Unicode (rust port)
-
analyst
A command line tool that supports quick browsing of csv data
-
streampager
pager for command output or large files
-
mdbook-plugin-utils
mdBook plugins
-
cheat_checker
Detects similarities between sets of files
-
mdbook-plantuml
A preprocessor for mdbook which will convert plantuml code blocks into inline SVG diagrams
-
mdbook-trunk
mdBook plugin which bundles packages using Trunk and includes them as iframes
-
yara-x-fmt
A code-formatting library for YARA rules
-
rst_parser
a reStructuredText parser
-
mdbook-kroki-preprocessor
render kroki diagrams from files or code blocks in mdbook
-
rs-tool
A command-line tool to perform reservoir sampling on a file or a stream
-
mj_minigrep
Welcome to mj minigrep project
-
ik-rs
chinese segment, ik-analyzer for rust
-
rust-beam
A LaTeX slide generator you can write in faster than beamer
-
cocomo
(Constructive Cost Model) CLI utility and library
-
chewing-cli
Tools of the Chewing (酷音) intelligent Zhuyin input method
-
runiq
An efficient way to filter duplicate lines from input, à la uniq
-
lemmeknow
Identify any mysterious text or analyze strings from a file
-
dirgrab-lib
Core library for dirgrab: concatenates file contents from directories, respecting Git context
-
text_trees
textual output for tree-like structures
-
searcher_txt
A copy of grep that i made to show that im bad at rust
-
regex_generate
Use regular expressions to generate text
-
named_entity_parsing
Named entity parser. Used in Rusev to parse a list of tokens into a list of entities.
-
adulting
A program to print one rule at a time from The 25 Principles for Adult Behavior: John Perry Barlow
-
egg-mode-text
Text parsing for Twitter: character counting, hashtag/mention extraction
-
speki-cli
cli version of speki
-
bwrap
A fast, lightweight, embedded systems-friendly library for wrapping text
-
mdi
markdown include
-
tremor-kv
A logstash inspured key value extractor
-
markdown_converter
html to markdown converter and flavored markdown to discord markdown converter
-
mdbook-linkcheck
A backend for
mdbook
which will check your links for you -
mathml-core
MathML type definitions
-
mdbook-fs-summary
Summary generator for mdbook
-
rust_readability
A package to assess the complexity of texts using a variety of readability formulas
-
tgrep
Toy grep that honors .gitignore
-
nucleo-ui
TUI wrapper around the nucleo fuzzy matching crate
-
manchu-converter
Converts transcripted Manchu text to Manchu script with Manchu alphabet
-
bt-echo
implemenation of the echo command-line utility
-
langsan
sanitizing language model input and output
-
mttf
working with TrueType fonts. Most parts are zero-allocation.
-
html-linter
An HTML linting library for checking HTML structure and semantics
-
sm-search
way of searching through text - for people who are too lazy to use Regex
-
forbidden-bands
8-bit string handling library
-
CLI_Project_Scott_Coakley
CLI Project in Rust
-
xpath-cli
Evaluate XPath selectors on XML or HTML documents
-
whichlicense_detection
detect licenses used by the WhichLicense project
-
fr_alebref_libbrefdata
BrefData library
-
csml_interpreter
The CSML Interpreter is the official interpreter for the CSML programming language, a DSL designed to make it extremely easy to create rich and powerful chatbots
-
regexy
lightweight Rust library for working with regular expressions. The regexy crate provides an easy-to-use interface for matching patterns in strings using regex
-
rst_renderer
a reStructuredText renderer
-
typope
Pedantic source code checker for orthotypography mistakes and other typographical errors
-
parse2csv
parse log-file and output to stdout as csv file by regex
-
minos-codex
Minos Codex is a tool for detecting and identifying secrets in a string
-
statisk
opinionated static site generator
-
message_segment_calculator
package to calculate SMS message segments
-
mdbook-github-authors
mdbook preprocessor to display Github profiles of authors of a page
-
cqtool
converting between CQ strings and message segment arrays
-
mdbook-merjong
A preprocessor for mdbook to add merjong support
-
spanned
string processing with file/line/col information and the regular rust
str
API -
hanconv
Convert between Chinese characters variants
-
mdbook-cat-prep
a preprocessor for mdbook which provides teacher, subject, material and tag functionality
-
wikipedia_prosesize
Count Wikipedia prose size
-
iregex-automata
Finite automata definitions for the
iregex
crate -
nlf
A CLI to append newline characters (LF) at the end of text file
-
bk-tree
A Rust BK-tree implementation
-
svgbob
Transform your ascii diagrams into happy little SVG
-
filenamify
Convert a string to a valid filename
-
mdbook-dtmo
Creates a book from markdown files with added plugins
-
unified-diff
GNU unified diff format
-
inflector-plus
Adds String based inflections for Rust. Snake, kebab, camel, word, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
sre-engine
A low-level implementation of Python's SRE regex engine
-
zipcodes
Query US zipcodes without SQLite
-
mdbook_fork4ls
Fork of mdBook for mdBook_LS
-
unicode-ellipsis
truncate Unicode strings to a certain width, automatically adding an ellipsis if the string is too long
-
text_lines
Information about lines of text in a string
-
prescript
parsing and executing Prescript scripts
-
auto-regex
Automagically finds a regex that best matches an example and a sample list
-
mazer-core
A minimal, simple math markup language that compiles to HTML, written in Rust
-
commit_crafter
AI powered tool for Git commit message generator
-
mdbook-llms-txt-tools
convert mdbook to llmstxt.org format
-
summary-rs
A summary library for lithium battery and sodium ion battery
-
tfidf-text-summarizer
extractive text summarization system which uses TF-IDF scores of words present in the text to rank sentences and generate a summary
-
bukvalno
A cli tool for converting images to ascii art
-
gspell
Rust bindings for gspell
-
clippy-to-md
cli tool to convert clippy json reports to markdown files
-
interslavic
in rust
-
mdbook-presentation-preprocessor
A preprocessor for utilizing an MDBook as slides for a presentation
-
lindera-cc-cedict-builder
A Chinese morphological dictionary builder for CC-CEDICT
-
kbnf-regex-automata
A forked version of regex-automata for kbnf
-
rust-tfidf
calculate TF-IDF (Term Frequency - Inverse Document Frequency) for generic documents
-
fontconfig
Safe, higher-level wrapper around the Fontconfig library
-
eddie
Fast and well-tested implementations of edit distance/string similarity metrics: Levenshtein, Damerau-Levenshtein, Hamming, Jaro, and Jaro-Winkler
-
unicode-matching
match Unicode open/close brackets
-
char-ranges
Iterate chars and their start and end byte positions
-
enum-ts
TypeScript Enum pattern matcher codegen
-
include-doc
Include examples in your Rustdocs
-
buf-min
Minimal utf-8 safe buffer traits
-
mdbook-ocirun
mdbook preprocessor to run arbitrary commands and code snippets inside containers
-
mdbook-quiz-schema
Schema for quizzes used in mdbook-quiz
-
srx
A mostly compliant Rust implementation of the Segmentation Rules eXchange (SRX) 2.0 standard for text segmentation
-
stringmatch
Allow the use of regular expressions or strings wherever you need string comparison
-
koto_regex
A Koto library for working with regular expressions
-
ironsmith-parser
Transforms Smithy 2.0 IDL files into an abstract syntax tree
-
clipboard-substitutor
CLI tool to monitor clipboard changes and perform operations based on the contents
-
pulldown-html-ext
Extended HTML rendering capabilities for pulldown-cmark
-
vocalolyrics
Lyrics scraper, primarily for Vocaloid content. By default, atwiki is used as the source. We plan to make other sources selectable, but that is not currently possible
-
case
A set of letter case string helpers
-
simple-logging
logger for the log facade
-
truncate_string_at_whitespace
Truncate a &str at the closest whitespace to a specified length with unicode safety
-
samvadsetu
LLM API for commonly used LLM services including Gemini, ChatGPT, and Ollama. The name implies a bridge for dialogue since the library facilitates communication and interaction between…
-
midstring
Create a string between two other strings, that is lexicographically halfway between them
-
stringutil
A collection of useful string utilities
-
linurgy
Manipulate the output of multiple newlines. Replace/Insert/Append newlines with text. Input and output from stdio/files/buffers
-
ps-str
String transcoding library
-
jpreprocess
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
asciidork-parser
Asciidork parser
-
rust-texas
generate latex documents
-
pulldown_typst
A pull parser for Typst markup
-
markov_str
Markov Chain implementation optimized for text generation
-
rig-dyn
A dynamic client-provider abstraction framework for Rust applications on top of rig-core
-
csv_to_table
pretty print CSV as a table
-
summavy-ownedbytes
Expose data as static slice
-
ngrammatic
Character-oriented ngram generator and fuzzy matching library
-
fyi_ansi
Compile-time ANSI formatting macros for FYI
-
rusty_regex
A minimalistic regex engine in Rust using the pipeline: Regex -> AST -> NFA -> DFA -> Match(String)
-
thesaurus
An offline thesaurus library for Rust
-
hxd
configurable hexdump library
-
owlz
"Owlz" ascii emojis, created randomly or by design
-
litime
A command line tool to display the current time ish with a literature quote
-
glk
Bindings for the Glk I/O interface for hosting interactive fiction interpreters
-
timeblok
A language for event scheduling in plain text
-
tectonic
A modernized, complete, embeddable TeX/LaTeX engine. Tectonic is forked from the XeTeX extension to the classic “Web2C” implementation of TeX and uses the TeXLive distribution of support files.
-
gaze
small utility library with the goal of making it easier to scan/lex text and collections
-
fast-str
A flexible, easy-to-use, immutable, efficient
String
replacement for Rust -
block-list
A minimalist hosts-based tool for managing block lists and ad-blocking
-
spezilinter
spezifisch's linter for different file formats, linting for weirdly specific stuff
-
vidyut-prakriya
A Sanskrit word generator
-
enc-check
inspect utf-8 and utf-16 character encodings
-
unicode_extension
Don't use this crate
-
tectonic_bridge_core
Exposing core backend APIs to the Tectonic C/C++ code
-
binatime
A binary clock in the terminal
-
abbreviation_extractor
extracting abbreviations from text
-
mecab
Safe Rust wrapper for mecab a japanese language part-of-speech and morphological analyzer library
-
textgridde-rs
dealing with Praat TextGrid files. MIT licensed.
-
vndb_tags_get
convert VNDB tag list (JSON to markdown)
-
archive-pdf-urls
Extract all links from a PDF and archive the URLs in the Internet Archive's Wayback Machine
-
glimpse
A blazingly fast tool for peeking at codebases. Perfect for loading your codebase into an LLM's context.
-
ipynb-to-md
Convert Jupyter Notebooks to Markdown files
-
caseless
Unicode caseless matching
-
metatron
core library
-
randem
Print a random emoji optionally with the given string as seed
-
modeling
tools to analysis different languages by Ctags
-
gchemol-parser
Text parsing made simple
-
coinflip_animation
coinflip animation in the terminal, as a screensaver or just simply to look at
-
substring-replace
developer-friendly methods to manipulate strings with character indices
-
stur
functions for working with strings
-
diffy-imara
Tools for finding and manipulating differences between files
-
uclanr
A random word picker that gives you actually useful words
-
litua
Read a text document, receive its tree in Lua and manipulate it before representing it as string
-
termdiff
Write a diff with color codes to a string
-
easy-regex
Make long regular expressions like pseudocodes
-
grammalecte_client
Grammalecte HTTP client
-
choco
markup language for dialogue systems
-
csmlinterpreter
The CSML (Conversational Standard Meta Language) is a Domain-Specific Language developed for creating conversational experiences easily
-
committer
git commit message generator
-
mdbook_header_footer
mdBook preprocessor to prepend header and append footer to certain chapters
-
rsrusl
A really simple useful library ported to Rust
-
aki-mline
match line, regex text filter like a grep of linux command
-
gte-rs
Text embedding and re-ranking pipelines
-
spongebob
convert text to spongebob case a.k.a tHe MoCkInG sPoNgEbOb MeMe
-
vibrato
viterbi-based accelerated tokenizer
-
clarifai_grpc
The official Clarifai gRPC Rust client
-
skyspell_kak
skyspell - kakoune integration
-
riimut
Transform latin letters to runes & vice versa
-
antex
Styled text and tree in terminal
-
markdown-extract
Extract sections of a markdown file
-
irg-kvariants
wrapper around kvariant from hfhchan/irg
-
IndicScriptSwap
help transliterate between various indic scripts. It is not ready yet and has many issues. If you encounter any issues, please contact me (https://github.com/mssrprad/transliterate-ferris/tree/cli or pradyumna…
-
mdbook-pagetoc
A mdbook plugin that provides a table of contents for each page
-
pulldown_mdbook
A pull parser for mdBook
-
midpad
Command line utility to pad texts
-
wiki-tui
easy to use Wikipedia Text User Interface
-
ohnomore
Transformations for TüBa-D/Z lemmas
-
minigrep_nc
An implemantation of grep in Rust
-
taboc
A table of contents generator for markdown documents
-
tracery
Text-expansion library
-
ing2ynab
cleans up ing.com.au transactions for YNAB
-
tokenizations
alignments library
-
freesia
some string operators
-
pulldown-cmark-mdcat
Render pulldown-cmark events to TTY
-
strs_tools
Tools to manipulate strings
-
split-identifier
Rust package that provides functions to split programmatic identifiers according to case conventions
-
markdown-it-footnotes
Creates footnotes and lists of footnotes in Markdown documents
-
phonet
A CLI tool and library to validate phonotactic patterns for constructed languages
-
sbert
Sentence Bert (SBert)
-
const_unit_poc
Proof of Concept: Physical units through const generics
-
aho-corasick
Fast multiple substring searching
-
cglue-bindgen
cleanup cbindgen headers for CGlue
-
typeline_ext_utils
operators for typeline
-
vortilo
Analizas la gramatikon de Esperantaj frazoj
-
pprint
Flexible and lightweight pretty printing library for Rust
-
lingua-spanish-language-model
The Spanish language model for Lingua, an accurate natural language detection library
-
notion2html
Convert Notion pages to HTML
-
merge-whitespace
Procedural macros for merging whitespace in const contexts
-
budoux
Rust port of BudouX (machine learning powered line break organizer tool)
-
bard
Creates PDF and HTML songbooks out of easy-to-write Markdown sources
-
mdbook-treesitter
mdBook preprocessor for html adding tree-sitter highlighting support
-
viterbi_pos_tagger
A part-of-speech (POS) tagger using the Viterbi algorithm
-
emoji
Every emoji, their metadata, and localized annotations
-
asciidork-eval
Asciidork eval
-
kurtbuilds_regex
Wraps the regex library to also provide macros
-
unaccent
remove accents from strings, inspired by PostgreSQL's unaccent extension
-
rosie
Interface for the Rosie Pattern Language, for efficient and maintainable text pattern matching and search
-
regex-chunker
Iterate over the data in a
Read
type in a regular-expression-delimited way -
mdbook-spec
An mdBook preprocessor to help with the Rust specification
-
man
Generate structured man pages
-
xee-ir
Xee intermediate representation and compilation to bytecode
-
gaoya
Locality Sensitive Hashing Data Structures
-
purlu
A full-text search engine
-
regex-filtered
Efficiently check an input against a large number of patterns
-
mdbook-yml-header
mdBook preprocessor for removing yml header
-
utf8streamreader
lookahead iterator on an utf8 byte stream
-
rascii_art
Advanced ASCII Art Generator
-
lookbook
Component preview framework for Dioxus
-
lindera-ipadic-neologd-builder
A Japanese morphological dictionary builder for IPADIC NEologd
-
advent-ocr
Converts ASCII-art representations of letters generated by Advent of Code puzzles into a String containing those letters
-
ob
A Blog and RSS system written in Rust
-
sample-std
Sampler definitions and implementations for st
-
knock-knock
CLI tool for obtaining and outputting domain name information in an easy-to-read format
-
etch
Not just a text formatter, don't mark it down, etch it
-
xi-rope
A generic rope data structure built on top of B-Trees
-
surt-rs
Sort-friendly URI Reordering Transform (SURT)
-
goofy-animals
Generate a name in adjective-adjective-animal form
-
bfom-lib
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
font-map
Macros and utilities for parsing font files
-
mdbook-svgbob
SvgBob mdbook preprocessor which swaps code-blocks with neat SVG
-
mmsearch
一个从文本文件中查找字符的命令行工具。只支持utf8编码的文件
-
plagiarismbasic_lib
Basic plagiarism checker written in Rust
-
markdown2pdf
Create PDF with Markdown files (a md to pdf transpiler)
-
unescape
Unescapes strings with escape sequences written out as literal characters
-
typos-cli
Source Code Spelling Correction
-
mini-openai
An OpenAI API client with minimal dependencies
-
mdbook-infisearch
InfiSearch plugin for Mdbook
-
boreal-cli
CLI utility to run boreal, a YARA rules engine
-
morse_code_parser
A Morse code parser and decoder implemented in Rust
-
rsrpp
project for research paper pdf
-
kl-hyphenate
Knuth-Liang hyphenation for a variety of languages
-
sanitizer
A collection of methods and macros to sanitize struct fields
-
diff-man
diff utility lib
-
marcus
An experimental Markdown parser written in Rust
-
abbreviator
abbreviating long words
-
dicexp
A Dice Expression Interpreter program and library for parsing (and rolling) role-playing game style dice notations (e.g. "2d8+5")
-
display_bytes
Human-readable display of byte sequences
-
google_translate_request
Google translate request to a spesific endpoint
-
rsnltk
Rust-based Natural Language Toolkit
-
hns
Human numeric sorting program — does what
sort -h
is supposed to do! -
natural
Pure rust library for natural language processing
-
palmdoc-compression
Fast & safe implementation of PalmDoc/MOBI/AZW/Kindle flavored LZ77
-
ipa-translate
translating between IPA and ASCII text
-
chromalog
A customizable logger with dynamic color coding and file logging
-
dingtalk
Robot Util, Send text/markdown/link messages using DingTalk robot, 钉钉机器人
-
leptos-markdown
A component which can render markdown as html element in leptos
-
nfa_regex
NFA regex engine for text processing
-
formatjson
Formats JSON files
-
hmd
Custom Markdown Engine for my personal blog
-
easy_io
Fast and dead-simple IO for competitive programming in Rust
-
realhydroper-utf16
Work with UTF-16 in Rust
-
fsays
flavored replacement for the classic cowsay
-
anslatortray
translate from English to Pig Latin!
-
r4d
Text oriented macro processor
-
sastrawi
stemming and stopword removal for Bahasa Indonesia based on PHP sastrawi project by Andy Librian
-
unicount
Alphabetic counter supporting unicode
-
easy_random
Generate random data easily with easy_random :)
-
cli_app_capo15
CLI application with Unix-like tools
-
linebreak
breaking a given text into lines within a specified width
-
lithe
A Slim template engine by using Pest
-
convert_string
A trait to convert Strings to safe non-keywords and/or convert a Strings case (snake_case, PascalCase, ...)
-
avatarsay
Beautiful quotes from Avatar: The Last Airbender
-
scraps_libs
Scraps is a static site generator based on Markdown files written with simple Wiki-link notation. It can be used primarily for personal or team knowledge management.
-
dedent
Procedural macro for stripping indentation from multi-line string literals
-
wtf8-rs
WTF-8 encoding
-
texcore
Create LaTeX documents using native Rust types
-
matchpick
Find and replace multi-lines using a match-case
-
ohos-ime
Bindings to the
inputmethod
API of OpenHarmony -
catsay-AK
A catsay cli
-
meep
pasting service
-
text-parsing
Hierarchical text processing preserving char position info
-
mdbook-auto-gen-summary
A preprocessor and cli tool for mdbook to auto generate summary
-
syllarust
quickly counting syllables
-
porigon
Lightweight FST-based autocompleter library, targeting WebAssembly and data stored in-memory
-
uwurs
UwUify your strings with uwurs!
-
csv-groupby
execute a sql-like group-by on arbitrary text or csv files
-
nu_plugin_clipboard
A nushell plugin to copy text into clipboard or get text from it
-
justify
plaintext while handling Unicode gracefully
-
epub2mdbook
convert EPUB files to MDBook format
-
graphannis-capi
C-API to the ANNIS linguistic search and visualization system
-
vectorscan
wrapper for Vectorscan
-
rust-regex-dsl
Regular expression DSL
-
analyse-json
CLI tool for inspecting (Newline Delimited) NDJSON or JSON to understand the contents
-
rusile
components for the SILE typesetter
-
mdzk
Plain text Zettelkasten based on mdBook
-
fancy-regex-fork-pb
A custom fork of the fancy-regex crate. You probably don't want to use this.
-
patiencediff
algorithm
-
treebender
An HDPSG inspired symbolic NLP library for Rust
-
annotated-string
String with ability to annotate (format) its individual fragments
-
ferret
A trigram-based tool for detecting similarity in groups of text documents or program code
-
parascope
Weggli ruleset scanner for source code and binaries
-
heart-strings
Quickly get random heart emojis to copy!
-
file-expert
Expert system for recognizing source code files, similar to GitHub/lingust
-
yozuk
Chatbot for Programmers
-
ruby_inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
readable-regex
Regex made for humans. Wrapper to build regexes in a verbose style.
-
pulldown-cmark-escape
An escape library for HTML created in the pulldown-cmark project
-
rjoin
joining CSV data on command line
-
tengwar
Transliterate text into J.R.R. Tolkien's Tengwar.
-
textgrid
working with PRAAT .TextGrid files with parsing, riting, manipulation, and history tracking modulesfor TextGrid data
-
unicode-casing
Titlecase helper function on characters
-
pithy
Ultra-fast, spookily accurate text summarizer that works on any language
-
rk-utils
A collection of utility functions and data structures for rust
-
none-shall-pass
Artifact for GitHub Action to validate hyperlinks in all markdown files
-
html-auto-p
function like
wpautop
in Wordpress. It uses a group of regex replaces used to identify text formatted with newlines and replace double line-breaks with HTML paragraph tags. -
bstr
A string type that is not required to be valid UTF-8
-
recase
Changes the convention case of input text
-
google_translator
Custom Google Translator
-
tpt
Pure Rust implementation of the Unix concatenate (cat), word-count (wc) and echo command
-
crowbook
Render a Markdown book in HTML, PDF or Epub
-
supercat
A syntax highlighting alternative to cat
-
letter-sequence
A method to create sequence displayed as uppercase or lower letters, or digits
-
ewts-cli
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (cli)
-
unic-ucd-age
UNIC — Unicode Character Database — Age
-
acorns
Generate an AsciiDoc release notes document from tracking tickets
-
basalt-core
core functionality for Basalt TUI application
-
escape-bytes
Escapes bytes that are not printable ASCII characters
-
advanced_string_generator
A command-line tool for generating strings based on customizable regex patterns
-
match-pinyin-with-hanzi
Checks whether the sentence in Chinese characters (汉字) matches with the sentence in pinyin (拼音). Erhua is supported.
-
ttaw
talking to a wall, a piecemeal natural language processing library
-
codetypo-cli
Source Code Spelling Correction
-
fast_trie
A memory efficient trie library
-
poriborton
Interconversion between Unicode and various Bengali ANSI encodings
-
hyper-static-server
friendly library to build static servers with hyper HTTP server
-
quickner-core
A fast and simple NER tool
-
google-fonts
Download and cache TTF fonts from Google
-
unicodeit-cli
The command line interface to unicodeit
-
eaverdeja-minigrep
minigrep from chapter 12 of the Rust lang book
-
royal_road_archiver
An archival program and library for the webnovel site RoyalRoad
-
jfmt
command-line tool for formatting json files in both readable and compact formats. It supports stdin/stdout shell usage, as well as working on files directly.
-
next-pagefind
Pagefind for next.js non output export applications. Fully crawl and index your app in one command.
-
globber
Extended glob matching library
-
latex2mathml
Convert LaTeX equations to MathML
-
mask-text
mask text with multiple masking options
-
string-overlap
A helper crate for "layering" ASCII art
-
engish
A language utility for sampling letters and building words
-
rnltk
Natural Language Toolkit for Rust
-
odict
A blazingly-fast dictionary file format for human languages
-
uo_rst_parser
fork of rst_parser with fixes for upstream-ontologist
-
unveil-rs
Unveil Rs is a tool to create presentations from markdown files
-
flxy
Full-text searching and scoring of strings
-
jira-clean
clean up Jira task description that is an output of jira-cli tool
-
lorem-rustum
lib for generating lorem-ipsum with a rusty fleur
-
blitztext
fast keyword extraction and replacement in strings
-
gazetta-render-ext
A static site generator framework. Extra render code.
-
xmlwriter
streaming XML writer
-
bilingual
A cmdline tool used for markdown translation via calling Chinese translation api cloud services
-
byteutils
that provides a collection of frequently used utility functions for working with bytes, strings, and vectors. It includes common tasks such as converting between strings and byte arrays…
-
salign
Align and prettify comments in asm files
-
hydroper_source_text
Source text containing line locations
-
lindera-decompress
A morphological analysis library
-
salvation-cosmic-text
Pure Rust multi-line text handling
-
markdown-toc
Markdown Table of Contents generator
-
loe
Very fast and yet another line ending (CRLF <-> LF) converter written in Rust
-
mdbook-najan
Preprocessor for the Najan mdBook
-
regex-split
split_inclusive for the regex crate
-
untanglr
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies
-
encoding-next-index-singlebyte
Index tables for various single-byte character encodings
-
cesu8-str
CESU-8 and Java CESU-8 string validation and manipulation
-
csv2ndjson-lite
A little tool to convert a csv to a valid ndjson/json-stream
-
analogize
log analyzer
-
geoipsed
Inline decoration of IPv4 and IPv6 address geolocations
-
asimov-sdk
ASIMOV Software Development Kit (SDK) for Rust
-
mdplayscript
An extension of Markdown for play scripts
-
just-enough-emojis
text to emoji cli
-
text-tokenizer
Custom text tokenizer
-
typship
A cli for typst packages
-
panda-re-sys
The official *-sys library for interfacing with PANDA (Platform for Architecture-Neutral Dynamic Analysis)
-
opstr
‘Operate on strings’ command line utility
-
record-query
doing record analysis and transformation
-
slicestring
slicing Strings
-
tagalyzer
A CLI tool to gather statistics on collections of plaintext-adjacent files
-
harfbuzz
Rust bindings to the HarfBuzz text shaping engine
-
roan-engine
The core engine for the Roan project
-
punycode
Functions to decode and encode Punycode
-
lspt
Language Server Protocol (LSP) types made easy
-
datatroll
a robust and user-friendly Rust library for efficiently loading, manipulating, and exporting data stored in CSV files
-
english
language decliner
-
kakasi
Romanize hiragana, katakana and kanji (Japanese text)
-
khaiii-rs
Bindings to Kakao Hangul Analyzer III (khaiii) for parsing and analyzing Korean text
-
oxcomm
using Google Translate on the fly
-
hunspell-rs
Rust bindings to the Hunspell library
-
vidyut-lipi
A Sanskrit transliterator
-
deinflect
japanese deinflection
-
md-bakery
Markdown Bakery CLI app
-
str-utils
some traits to extend types which implement
AsRef<[u8]>
orAsRef<str>
-
allsorts-subset-browser
Temp fork of allsorts 0.15 - includes patch for subsetting fonts for browsers
-
reedy
A terminal-based RSS reader with a clean TUI interface
-
mdbook-metadata
mdBook preprocessor to parse markdown metadata
-
cargo-markdown
Local crates.io readme development server with ultra-fast hot reloading goodness
-
yara-x-cli
A command-line interface for YARA-X
-
shopping-parser
A Rust-based parser for parsing structured product information and shopping lists, supporting multiple currencies and units
-
afrim-translator
Manage the predication system of the afrim input method
-
zummi
fun lib that produces spoonerisms
-
enpsrlib
English Phrase Structure Rules library
-
baidu_trans
百度翻译API
-
computergeneration
compgen but all wrong
-
nesty
Generate code with with human readable indentation
-
derek-minigrep
grep clone
-
serbian-cyrillic-latin-conversion
Serbian Cyrillic to Latin and Latin to Cyrillic conversion library
-
roxy_markdown_parser
Roxy plugin for parsing Markdown
-
perm-text
curling straight/dumb quotation marks ("") and apostrophes (') into their curly/smart (“”’) equivalents
-
forestrie-builder
Build a trie and convert it TokenStream
-
schmfy
Schmfication library
-
marktask
A CLI tool for parsing and manipulating Markdown tasks
-
wildcard_ex
extended wildcards that allows VB-like specifications
-
onig_sys
onig_sys
crate contains raw rust bindings to the oniguruma library. This crate exposes a set of unsafe functions which can then be used by other crates to create safe wrappers around Oniguruma… -
rizzer
Fuzzy matching tool to find string similarity
-
c6o-obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
roman_numerals_fn
A function to convert integers to their roman numeral representation as strings. Values from 1 to 3999 are possible, otherwise it returns an OutOfRangeError. Zero has no representation in roman numerals.
-
imperative
Check for imperative mood in text
-
idna-cli
Encode/decode Unicode domain names to/from IDNA ASCII
-
indented_text_writer
IndentedTextWriter
-
st7789_rs
A driver and graphics library for st7789 displays, primarily used on a Raspberry Pi
-
lingua-german-language-model
The German language model for Lingua, an accurate natural language detection library
-
markdown-viewer
Support preview of markdown files
-
ascii-hangman-backend
customizable Hangman game with ASCII-art rewarding for children (backend)
-
squidge
shortens delimited data
-
unicode-vo
Unicode vertical orientation detection
-
maudit
Framework for generating static websites
-
topfew
CLI to find high frequency occurrences in structured text files
-
string-simple
containing some simple string utilities that I use in my other projects
-
rfsee-tf-idf
TF-IDF implementation for rfsee
-
text-to-png
way to render text to a png image with basic options
-
subject-classifier
classifying a commit by it's subject
-
wool
Preview Github Markdown Offline
-
quewuigrep
grep-like tool written in Rust
-
rtss
A command-line tool to annotate stdout/stderr with elapsed times
-
nanoid-dictionary
Popular alphabets for use with nanoid
-
bbcode-tagger
BBCode tree parser and tagger
-
ry
yaml searching
-
aki-xtee
copy standard input to each files and standard output
-
translitrs
Transliteration utility for Serbian language
-
akiaki
A good old fashioned wiki engine with a flat-file database
-
path2regex
Express style path to RegExp utility
-
zoitei
alphabet conversions
-
fip
Field Parser, roughly emulating "awk '{print $<field-number>}'"
-
unicode-normalization-alignments
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
magic_string_rain
magic string
-
icemelter
minimize files that trigger internal compiler errors (ICEs)
-
inslice
A command-line utility for filtering text input by columns and rows
-
strcursor
string cursor type for seeking through a string whilst respecting grapheme cluster and code point boundaries
-
pi_ucd
unicode字符函数,获得字符的语言区间段;及根据文字排版的需要,判断字符是否为单字字符或字母字符
-
mdbook-keeper
An improved testing experience for mdbook
-
mastodon-async-entities
Types for (de)serializing entities from the Mastodon API; part of mastodon-async
-
simple_peg
A command-line peg parser implemented by Rust
-
sejong
Buffer is a buffer that can receive ASCII bytes different from keyboard and send out UTF-32 Hangul string. This buffer allows deletion by Jamo.
-
arabic-script
An expressive API for the characters of the Arabic script
-
runestr-pancjkv
rune
-based Pan-CJKV support -
fiberplane-markdown
convert Fiberplane Notebooks to and from Markdown
-
text_unit
Newtypes for text offsets
-
udp-logger-rs
Log macro for log's kv-unstable backend and a UDP socket logger
-
human_regex
A regex library for humans
-
java_string
Java strings, tolerant of invalid UTF-16 encoding
-
wyrcan-todo
A todo manager for managing todotxt based files
-
sms_splitter
An SMS message splitter and part calculator with support for GSM and Unicode
-
search-in-terminal
A terminal-based search tool
-
common-words-all
Most common words sorted by ngram frequency
-
spf
.spf (Simple Pixel Font) file parsing, and useful api's to go alongside
-
rusty-axml
A parser for Android AXML files
-
correct_word
A No brainer 'did you mean' library for Rust
-
somedoc
A very simple document model and markup generator
-
pomsky-bin
Compile pomsky expressions, a new regular expression language
-
blackboxmc_java
BlackboxMC bindings for java.util
-
quake
knowledge management tool for geek
-
hitori
Generic compile-time regular expressions
-
frontmatter
A Fairly Trivial Wrapper for yaml-rust to Extract Frontmatter from a String Slice
-
cow-rewrite
Rewrite copy-on-write types copying only when it's neccessary
-
pukram2html
converting Pukram-formatted text to HTML
-
peppergrep
grep utility written following the 12th chapter of the Rust book. Some little modifications were made.
-
enso-lazy-reader
An efficient buffered reader
-
apple-notes-exporter
CLI tool for exporting Apple Notes to Markdown
-
divvunspell-bin
Spellchecker for ZHFST/BHFST spellers, with case handling and tokenization support
-
pdfcr
render a codebase to a pdf
-
wordfreq
port of wordfreq for looking up the frequencies of words in many languages
-
tree-sitter-stack-graphs-java
Stack graphs for the Java programming language
-
bookgrep
Basic grep equivalent, minor mods to Chapter 12
-
ccase
Command line interface to convert strings into any case
-
bottomify
Fantastic (maybe) CLI for translating between bottom and human-readable text
-
mdbook-gitbook
mdBook preprocessor to properly render GitBook specific syntax
-
pillar
small tool to format lines into columns
-
markdown-extract-cli
Extract sections of a markdown file with a regular expression
-
story-dl
Story web scraping
-
boringascii
Strings that can only be constructed to contain non-whitespace, non-control ASCII characters
-
rsmorphy
Morphological analyzer / inflection engine for Russian and Ukrainian (soon) languages (WIP)
-
tet_rs
A third-party implementation of Text Entry Throughput (ref. https://doi.org/10.1145/3290605.3300866) for Rust
-
kana-converter
converter for half-width/full-width Japanese language characters (katakana, hiragana, and ASCII)
-
gstring
String with support for Unicode graphemes
-
ucd
Extends the char type to provide access to most fields of the UCD, Unicode Character Database, as of version 9.0.0. It aims to be compact, fast, and use minimal dependencies (only rust's core crate)…
-
asciit
A compact and visually appealing ASCII table for your terminal, featuring colored numbers and letters
-
toktkn
a minimal byte-pair encoding tokenizer implementation
-
crawdad
ChaRActer-Wise Double-Array Dictionary
-
mario_minigrep
first project for minigrep
-
soft-ascii-string
char/str/string wrappers which add a "is-ascii" soft constraint
-
fenx
parsing and handling FEN and algebraic chess notations
-
text-utils
Text utils for unescaping and align
-
libflagup
Display a country's flag as an emoji
-
bubble-bath
Small and quick HTML sanitizer
-
mitex-spec-gen
Guard to geneate specification files for dependent crates
-
detect-indent
Detect the indentation of code
-
nlprule
A fast, low-resource Natural Language Processing and Error Correction library
-
tzgrep
grep tar.gz
-
tantivy-object-store
A tantivy Directory implementation against object stores (S3, GCS, etc.)
-
little_boxes
Adds boxes around stdin. Optionally adds a title
-
unicode-width-16
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
committed
Nitpicking commit history since beabf39
-
mdbook-bib
mdbook plugin allowing to load and present a bibliography in BibLaTex format in your books and cite its references
-
uapi-version
Compare versions according to the UAPI Version Format Specification
-
hebrew
alephbet primatives and parsing library for rust
-
scie
research about how to build simple code identify engine for different languages
-
fuzzywuzzy
A pure-Rust clone of the incredibly useful fuzzy string matching python package, FuzzyWuzzy
-
texting
string helpers
-
Grepulous
An attempt to make a grep like command
-
words-count
Count the words and characters, with or without whitespaces
-
mdbook-check-missing-md
A backend for
mdbook
which will find Markdowns you forgot on SUMMARY.md -
ptero-cli
A text steganography CLI tool for Social Media
-
regexgrep
ripgrep tool that suports regular expressions
-
profane-rs
Check Messages For Profanity/Swearing
-
just-run
Convenience crate for executing system commands with the expectation of successful termination and UTF-8 encoded output, for basic straightforward command execution scenarios
-
yazi-prebuild
Used to place the pre-built assets of yazi (https://github.com/sxyazi/yazi)
-
emojicon
Find Emoji by using Emoticons and GitHub's, Bengali emoji names
-
notegraf
Core library for building a graph-oriented notebook
-
markflowy
A Markdown Editor App
-
left-pad
left-padding for strings
-
align
aligning text
-
commentator
Source code comments extractor binary and SDK
-
hyphenation_commons
Proemial code for the
hyphenation
library -
mdbook-skill-tree
mdbook plugin to show roadmaps
-
recursive-file-loader
recursively load files via references in the files
-
crate-starter
starter
-
mapm
A set of tools, with command line and graphical interfaces, used to build exams typeset in LaTeX
-
zspell-cli
Command line interface for the ZSpell spellchecking library
-
techlead
CLI is a command-line interface that enables developers to chat with an AI assistant powered by the OpenAI GPT language model, designed specifically to help with your Rust project
-
text-colorizer
Transitionary package
-
findtext_sheet
Search text in SpreadSheet
-
reg_match
A match style regex tool
-
incredimo
just another font for your terminal
-
gripx
a cooler alt for grep built in rust
-
sortuniq
Find or count unique values in an input stream
-
stringutils
A collection of various and (hopefully) useful String utility functions
-
prism-js
rust bindings for prism.js syntax highlighting library
-
glyph-names
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
harfbuzz-sys
Rust bindings to the HarfBuzz text shaping engine
-
typst-ts-core
Core function of Typst.ts
-
ranting
Linguistic formatting placeholder extensions for rust
-
ltxcut
formats a table-like stream into a LaTeX-table
-
mdbook-summary-generate
A mdbook preprocessor to generate SUMMARY.md from a directory structure
-
strange
A static website generator
-
haseo
diff command line made simple
-
cklein
High-level safe bindings to the Klein scripting language
-
mdbook-embed
A preprocessor that simplifies embedded URL
-
seq2xypic
Turn a text sequence diagram into a LaTeX xypic diagram
-
markdown-gen
generating Markdown files
-
cnpj
Brazilian CNPJ parsing, validating and formatting library
-
rep-grep
wgrep/write-grep CLI
-
minigrep_dqy
A mini command for grep like linux
-
chunkr
A fast and quick chunking library for rust
-
revstr
Simply reverses strings
-
fontship
A font development toolkit and collaborative work flow
-
campfire
A tiny static site generator, greatly inspired by Zola
-
imagecli
A command line image processing tool
-
character_text_splitter
splitting text into chunks with overlap, designed for handling large amounts of text efficiently. Implementation is identical to langchain's CharacterTextSplitter
-
chardet
rust version of chardet
-
kvarn-chute
A Markdown converter designed to use the Kvarn templating engine
-
markdown-formatter
Flavored Markdown (ZH) content formatter
-
rupantor
A Bengali Phonetic Parser which is very flexible and supports Avro Phonetic
-
mdbook-davids_cooking
A preprocesor for whatever https://davidsotomarchena.gitlab.io/davids-cooking/ needs
-
spare
colorful format iterable
-
guy
Take your terminal to Flavortown
-
mdbook-rustviz
An mdbook preprocessor that allows users to embed RustViz visualizations into mdbook projects
-
polars_arrow_rvsry99dx
Apache Arrow
-
uwu-rs
uwuifying library
-
kansuji
漢数字と数字の相互変換のためのライブラリ
-
is-vowel
Heuristically test whether a character is a vowel letter
-
markx
markdown parser
-
mdbook-scientific
Enables inline equations for mdbook to set by $..$ signs and $$..$$
-
slack_update
app to set Slack status, emoji and photo
-
trevordmiller
Personal CLI
-
rusticsearch
A lightweight, Elasticsearch-compatible search server (early WIP)
-
esl01-renderdag
Render a graph into ASCII or Unicode text
-
yeslogic-unicode-script
Fast lookup of the Unicode Script property
-
cha-rs
Extract specific characters from an input
-
trigram
Trigram-based string similarity for fuzzy matching
-
bitflip
functions to generate bitflips of binary and UTF-8 strings
-
fmty
Composable
core::fmt
utilities -
rustrings
Strings manipulation for Rust
-
meaningsearch
package that helps you find meaningful lines of any given input. Especially useful in CTFs.
-
qpprint
console printing/formatting
-
mdbook-typstpdf
An mdBook backend that generates PDF output using Typst
-
unic-ucd-normal
UNIC — Unicode Character Database — Normalization Properties
-
minigrep_xiaoai
一个简单的命令行工具,用于在文件中搜索字符串。
-
stringslice
A collection of methods to slice strings based on character indices rather than bytes
-
quilltex
open-source Rust library designed to convert LaTeX documents into a Delta format that can be used with Quill.js and vice versa
-
tagsearch
Filter plaintext files based on @keyword tags
-
porter-stemmer
Flexible and unicode friendly, Porter stemmer implementation
-
ufofmt
A fast, flexible UFO source file formatter based on the Norad library
-
tnipv-lint
lints for tnipv, the Telcoin Network Improvement Proposal validator
-
yinzhe9
喵喵隐者9
-
scan-lib
A directory searcher library for rust
-
outerspace
Methods for prefixing and suffixing the non-whitespace characters in a string
-
mepple
English dictionary as a library
-
cur
that will hunt for your regular expression
-
twitter-text
in Rust
-
findtext_doc
Search text in Document
-
base16-rs
in Rust offers capabilities for encoding and decoding data in Base16 format. By utilizing the hex library, you can transform data into its hexadecimal representation and also decode…
-
remake
writing maintainable regex and managing symbol soup
-
kanyey
cli tool for generating quotes in your terminal from Kanye West
-
aprilasr
High-level wrapper for the april-asr C api (libaprilasr) using aprilasr-sys
-
mindmap
Search your notes at the speed of thought
-
mdlc
Markdown Link Checker. Find broken web and local links.
-
minigrepns
A mini version of famous grep application that searches texts on files
-
basic-text-internals
Basic Text string literal implementation details
-
normalize-hebrew-rs
package that normalizes special symbols within Hebrew string used in the Qumran-Digital project
-
rustsay
CLI tool in Rust that mimics the classic cowsay program, allowing a cow to speak your text in the terminal
-
minigrepwebdot
Minigrep is a command-line utility tool that helps to search for occurences of words on a file
-
dokkoo
Mokk (Macro Output Key Kit) implementation written in Rust
-
carnation
some string operators
-
whitespace-conf
Key-value configuration file delimited with whitespaces
-
slicedisplay
Simplistic Display implementation for Vecs and slices
-
clis
a simpl search/fuzy finder
-
japhonex
Japanese phone number checker for Rust
-
ftd
ftd: FifthTry Document Format
-
rustfmt_emitter
Rustfmt emitter library
-
help_crafter
help message generator without hussle
-
korean
hangul manipulation
-
spigot
parser for valve's keyvalue file format (gameinfo.txt, vmt, etc.)
-
halfcaps
tRaNslAtE aNy TeXt To ThIs
-
bpmf_py
A Bopomofo and Pinyin library
-
monkey-printer
infinite nr of monkeys you could write Shakespeare
-
mdbook-quiz-validate
Input validation for quizzes used in mdbook-quiz
-
stam-python
STAM is a library for dealing with standoff annotations on text, this is the python binding
-
markdown-it-autolink
A markdown-it plugin for parsing GFM autolinks
-
pangu
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
mystem
Wrapper around Yandex Mystem for Rust
-
delay_writer
Wraps a writer and delays its output after each newline
-
extract-strings
Extract ascii strings from files
-
html_to_epub
A command line converts .html file to .epub file
-
encoding-index-singlebyte
Index tables for various single-byte character encodings
-
fast_aug
Fast data augmentation for text
-
adib-say-hello
say hello and say goodbye library
-
szovegertesimutato-score
Calculate szovegertesimutato score for a given text and language
-
ucd-raw
Uninterpreted access to the unicode UCD
-
comment-strip
Remove comments out of text files
-
rust-crate-grrs-jesse
search files
-
pst
publish posts to Micro.blog
-
xconv
A high-performance batch file encoding conversion tool
-
hello_lib
Demonstrate Generics Function
-
rmemo
Tools for taking notes fast on the CLI
-
concatenator
Add two pieces of text together
-
text_layout
Text layout algorithms
-
ellipse
Truncate and ellipse strings in a human-friendly way
-
fifthtry-mdbook
fork of mdbook, only for ft-cli
-
wcounter
Give the word and count the appearance
-
mdbook-open-git-repo
mdbook preprocessor to add a open-on-git-repo link on every page
-
jcalendar
Japanese Calendar for Rust
-
pulldown-html-ext-cli
CLI tool for extended HTML rendering of Markdown with pulldown-cmark
-
segtok
Sentence segmentation and word tokenization tools
-
jp-location-relation
隣接する市区町村の一覧を取得
-
wasmer-wit-parser
wit-bindgen-gen-c
-
invisible_unicode
finding invisible unicode characters
-
allwords
Generate all the words over a given alphabet
-
char_reader
Safely read wild streams as chars or lines
-
logisheets_parser
the parser of excel formula
-
clafrica
This application allows you to type most of the characters in the african alphabet in any text field
-
libphonenumber-sys
rust ffi bindings to libphonenumber
-
bigsi_rs
A in-memory implementation of a BIGSI-like data structure
-
nxfetch
A minimal, fast and batteries included fetcher!
-
mdrss
generating RSS feeds from markdown files
-
chinese
language nlp tools
-
cli-colors
A CLI tool for outputting text in ANSI format with features like colors, underlining, boldening, and italicizing
-
mdbook-bibfile-referencing
An mdBook preprocessor to add bibfile referencing to each page
-
ferrissay
cowsay
-
trunc8
Truncate text to a specific line length, based on a number of parameters
-
flashtext2
The FlashText algorithm implemented in Rust
-
chunk_norris
splitting large text into smaller batches for LLM input
-
amongify
A very ඞ sus ඞ program
-
string_macros
Small proc macro library for handling string literals
-
daumdic
Daum Dictionary API wrapper
-
codetypo
Source Code Spelling Correction
-
ctf-brute
Brute-force utilities for Rust
-
markdown-composer
composing markdown documents
-
zindex-scanner
A CLI tool to scan and analyze z-index definitions in JavaScript/TypeScript files
-
pangu2
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
wtf8
WTF-8 encoding. https://simonsapin.github.io/wtf-8/
-
meme_generator_utils
Meme generator utils
-
esc
Escape characters in strings
-
fuzzy-string-distance
Fuzzy string distance comparisons
-
termwrap
Wrap Unicode text with ANSI color codes
-
notoize
that tells you what Noto font stack you need
-
tectonic_engine_bibtex
The
bibtex
program as a reusable crate -
pandoc-ac
pandoc filter for converting acronym codes to LaTeX
-
text_magic
string manipulation, including reversing strings and checking if strings are palindromes
-
forgiving-htmlescape
HTML entity encoding and decoding, with support for leaving malformed entities intact
-
dialogue-rs
parsing dialogue scripts
-
butterkups-minigrep
Mini grep utility; very weak application, use grep instead
-
kspconfigtool
KSP1 ConfigNode parser and block removal tool
-
mdbook-twiki
twiki backend for mdbook
-
runiq-lib
An efficient way to filter duplicate lines from input, à la uniq
-
rustdoc-include
importing external Markdown files into
*.rs
file as doc comments -
chinese-ner
A CRF based Chinese Named-entity Recognition Library written in Rust
-
yeslogic-ucd-parse
parsing data files in the Unicode character database
-
leetcode-picker
Command line app for picking leetcode quiz
-
slidedeck
Create an HTML slide deck from Markdown
-
angr
analyse ngrams in text files
-
ripgrep
line-oriented search tool that recursively searches the current directory for a regex pattern while respecting gitignore rules. ripgrep has first class support on Windows, macOS and Linux.
-
grader
Stream-based CLI for binary sorting text files via a given shell command
-
hunspell-sys
Bindings to the hunspell C API
-
extract-words
Extracts words from text without allocation
-
thousand_birds_deno
deno executable
-
esre
alt regex library
-
is-digit
Detect decimal digit in char or first char of the str and String
-
strip_markdown
remove markdown syntax from markdown files
-
bigstr
A command-line tool to make string BIG
-
koelner-phonetik
koelner_phonetik or cologne phonetics is a phonetic algorithm like soundex, but specialized for german words
-
AsgoreCore
A small rust library to manipulate arabic text to fit in non-supporting arabic games or programes
-
unicode_types
A mapping of all the unicode characters into convenience types (one enum per block of characters with one variant per character)
-
book_lib
that provides an API for managing PDFs on your mac device in one place
-
textspan
Text span utility
-
branchout
Quick and easy ASCII tree of a directory
-
levenshtein_lite
No-frills implementation of a Levenshtein Automata and the Levenshtein Distance function
-
indentsort
Structure-preserving sorting of arbitrary indented text
-
hydroperfox-sourcetext
Source text containing line locations
-
esperanto-text
Convert Esperanto text between UTF-8, x-system and h-system transliterations
-
octor
rmd combines all readmes into one
-
symspell
Spelling correction & Fuzzy search
-
rust_baht_text
Convert number to Thai Baht text
-
bge
Rust interface for BGE Small English Embedding Library
-
cologne_phonetics
generate phonetic cologne codes for utf8 strings
-
mdbook-hide
A preprocessor for mdbook that adds support for hidden chapters
-
cofe
tiny string similarity crate
-
termcolors
Format text and display colors in the terminal
-
cabocha
Safe Rust wrapper for cabocha a japanese language dependency structure analyzer library
-
vape
full width aesthetics
-
case-conv
Faster case conversion crate
-
yzb64
Ytrizja base-64 specialization
-
typeline
Efficient, Type-Safe Pipeline Processor
-
libxdiff
Rust bindings for the libxdiff C library
-
mime_4
Strongly Typed Mimes
-
unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
strmatch
Conditionally match strings in Rust using regex without much boilerplate
-
uwubot
discord bot for uwuifying text
-
bibutils-sys
Rust bindings for bibutils, a program for bibliography format interconversion
-
lorgn_lang
a general purpose scripting language optimized for graphical programming
-
minigrep_joshua
tutorials
-
str_overlap
Methods for finding the overlap between two string slices
-
glob-match
An extremely fast glob matcher
-
grepox
Minimalist's grep written in Rust
-
encoding-index-tradchinese
Index tables for traditional Chinese character encodings
-
sayit
String replacements using regex
-
string_py
aims to make the String type as easy to use as the str type in python
-
guarding
guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
wantora
wantora工具
-
rmbs
Remove any fluff, corporate speak, or other bullshit from input text and print the TL;DR essence of what's being said, using the www.bullshitremover.com public LLM API
-
timfmt
A small utility for formatting code as Tim likes it
-
lindera-ipadic-builder
A Japanese morphological dictionary builder for IPADIC
-
product-os-content
Product OS : Content provides a complete solution for content management for the purpose of serving content via Product OS : Server
-
runanum
Существительные с правильными окончаниями после чисел
-
axum-toml
Axum extractor for TOML
-
ragtime
Easy Retrieval Augmented Generation
-
todo-to-issue
CLI tool that converts forgotten TODO comments into actionable GitHub issues
-
beediff
LCS algorithm in various applications
-
hex-utilities
working with hexadecimal numbers
-
h4x_re
Hacky Regex's
-
numbers_into_words
Command-line utility and library for writing a positive integer as English words
-
mdtoepson
Filter to change from markdown format to Epson ESC codes for my Panasonic KX-2123
-
baselinker
BaseLinker.com API client
-
print-positions
providing string segmentation on grapheme clusters and ANSI escape sequences for accurate length arithmetic based on visible print positions
-
okh-tool
A CLI tool to deal with Open Know-How (OKH) data files. Its main functionalities are: validation of and conversion between the different formats
-
lazy-string-replace
A lazy version of
String::replace
, so that it can be formatted or recursively replaced without intermediate allocations -
mdbook-chapter-zero
A mdBook preprocessor that allows 0th (sub-)chapter
-
ctrl-z
A composable reader to treat
0x1A
as an end-of-file marker -
less
pager utility for displaying file contents or piped input, with dynamic scrolling and search functionality
-
punkt
sentence tokenizer
-
onepage
static site generator
-
contractions
expand contractions in English
-
ipsae-core
markdown parser for DIY lover
-
spider_transformations
Transformation utils to use for Spider Web Crawler
-
logseq
Handle Logseq Markdown files in Rust
-
fast2s
A fast Traditional Chinese to Simplified Chinese conversion library. Built with FST, faster than most of other libraries.
-
mdbook-fix-cjk-spacing
mdbook preprocess that fixes CJK line breaks
-
ranpha
Generate QR code of your Wi-FI network
-
ut1_blocklist
UT1 blocklist URL/domain filters
-
textcat
detect text categories. It can be used to detect the language of a given text
-
cursed_strings
Annoyed that Rust has two string types? Well it doesn't any more
-
spacemod
A easy to understand and powerful text search-and-replace tool
-
llmvm-codeassist
A LLM-powered code assistant that automatically retrieves context (i.e. type definitions) from a Language Server Protocol server.
-
base16384
Encode binary file to printable utf16be, and vice versa
-
weirdgrep
Weird grepping tool for huge pages of code
-
extstd
intended as an extension of the standard library
-
tokengrams
Compute n-gram statistics and model language over pre-tokenized text corpora used to train large language models
-
khmercut
A blazingly fast Khmer word segmentation tool written in Rust
-
korean_regex
Regex extension for Hangeul analysis
-
mdbook-force-relative-links
An
mdbook
pre-processor to transform all local links to relative ones -
bloodhound
Fuzzy file finder
-
markovish
Markov chain implementation for text generation
-
aki-mcycle
mark up text with cycling color
-
gazetta-cli
A static site generator framework. Shared CLI code.
-
mdbook-multicode
Allows you to give multilanguage code examples, toggled by a spinner
-
render_as_tree
visualizing tree data structures via text
-
notedown_ast
Notedown Abstract Syntax Tree
-
azusa
String index transformer for Rust utf8 to JavaScript utf16
-
ngram
Iterator adaptors for n-grams and k-skip-n-grams
-
mdbook-indexing
mdbook preprocessor for index generation
-
awabi
A morphological analyzer using mecab dictionary
-
anthropic-text-editor
A micro-CLI to apply tool calls from Anthropic for their text_editor_20250124 built-in computer use tool
-
minigrep5
grep implementation in Rust
-
ruSTLa
A reStructuredText → LarST ⊂ LaTeX transpiler
-
semchunk-rs
A fast and lightweight Rust library for splitting text into semantically meaningful chunks
-
smoltoken
A fast library for Byte Pair Encoding (BPE) tokenization
-
ragzilla
providing tools for RAG (Retrieval-Augmented Generation) pipelines
-
xee-xpath
XPath 3.1 library API
-
mdbook-tools
A collection of tools for mdbook
-
group-similar
Group similar values based on Jaro-Winkler distance
-
genkit
A common generator kit for static site generator
-
http
A set of types for representing HTTP requests and responses
-
codegenrs
Moving code-gen our of build.rs
-
rahat3062_minigrep
A light-weight & minimal implementation of the grep cli app
-
mdbook-snips
Markers for hidden lines in rust blocks within an mdbook
-
mdx
in Rust
-
morph-rs
Dictionary Morphologizer for Russian language
-
encoding8
various 8-bit encodings
-
lorem-ipsum
Quickly generate placeholder test
-
lingua-french-language-model
The French language model for Lingua, an accurate natural language detection library
-
md2gemtext
for converting Markdown into gemtext
-
auk_markdown
Markdown support for Auk
-
clipcat
A command line tool for copying the contents to clipboard of multiple files in one go
-
regex-map
Associative container where the keys are regular expressions
-
beautify
your terminal
-
unindenter
unindent text
-
prune
struct
-
node-emoji
Convert
:emoji:
to Unicode using GitHub’s and EmojiDB’s emoji names -
uwl
A management stream for bytes and characters
-
codex
Human-friendly notation for Unicode symbols
-
mail-internals-ng
[mail-api] _internal_ parts for the mail-api crates
-
spongebobizer
Command-line utility that outputs its stdin, converted to 'sPonGeBoB cAsE', and a library to support it
-
darkdown
A darkdown(our own markup language) parser written in rust
-
rust_lemmatizer
A lemmatizing package for use with a .csv dictionary of lemmas and their corresponding words
-
mdbook-numthm
An mdbook preprocessor for automatically numbering theorems, lemmas, etc
-
quill-delta-rs
Quill editor Delta format in Rust
-
glifnames
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
text-template
Small template engine for use with plain text (e.g. creating text email), not intended for HTML.
-
mdbook-image-size
A mdbook preprocessor which support image size syntax
-
rreplace
designed to streamline string replacements. It can handle multiple unique replacements and iterates the string only once.
-
kvu
The simplest command line tool to manage key-value pair lines
-
twas
A text substitution application for using random look-up tables to generate text in a manner similar to the Mad Libs game
-
newslookout
A web scraping platform built for news scanning, using LLMs for text processing, powered by Rust
-
rut
A small UTF-8 parsing library for applications that need to parse individual
char
s -
wcc
my own version of wc for personal use
-
redpatterns
a list of patterns for scanners 📟
-
dumbfuzz
dumb library for fuzzy search
-
rand-hira
CLI tool to generate random hiragana characters
-
mdtohtml
markdown to html renderer (with a couple of missing features)
-
alpha-counter
Alphabetic counter
-
blazingly_fast_rust_donut
Generates a rotating donut in the terminal using ASCII art
-
cumaea
handle prompts for user input
-
markdown-linkify
Markdown preprocessor for substiting link shorthands to valid links according to configurable regexes and custom substitution implementations
-
sims
Simplistic string search
-
igpay-atinlay
Translate text to Pig Latin
-
catdream
Sleeping cat dreams your text
-
ultron-ssg
A syntax highlighting library ideal for usage in a static site generator
-
rls-vfs
Virtual File System for the RLS
-
milkbox
A collections of daily utils
-
zalgo-codec
Convert an ASCII text string into a single unicode grapheme cluster and back. Provides a macro for embedding Rust source code that has been encoded in this way.
-
thesauromatic
command-line thesaurus that returns related words when given a word. The output words are one per line, making it easy to process in shell pipelines.
-
local_strtools
Collection of string related utilities
-
toresy
term rewriting system based on tokenization
-
kpathsea
Rust interface to the kpathsea TeX file management library
-
unicode_reader
Adaptors which wrap byte-oriented readers and yield the UTF-8 data as Unicode code points or grapheme clusters
-
caser
Change text between PascalCase, camelCase, and snake_case
-
iconv
bindings for Rust
-
codes-iana-charset
This package contains an implementation of the IANA Character Set registry
-
shear
trimming excess contents from things
-
gpl-memo
Gemachain Program Library Memo
-
deno_tauri
deno executable
-
summary
Extract the sentences which best summarize a document
-
elden-ring-saver
ansi2
-
uiua-doc-gen
Documentation generator for Uiua libraries
-
encoding_rs_transcode
Transcode text within writers using encoding_rs
-
parser-web
Web API for extracting text from various file formats
-
leetcode
solutions in Rust
-
ucd-util
A small utility library for working with the Unicode character database
-
code-tour
Enhanced example-based learning, i.e. awesome examples user experience
-
texoder
A text stream which can encode/decode text in several encoding formats
-
sarcasm
tExT creation and validation library
-
chemstring
A parser that converts strings to their representation using chemical element notations
-
todo_r
command line utility that keeps track of your todo comments in code
-
ruby-string
A string type that tracks Ruby glosses attached to parts of it
-
munemo-rs
Turn an integer into a more rememberable word, or vice-versa
-
rustplexity
bigram-based perplexity calculator, useful for filtering out boilerplate or other abnormal text
-
mdbook-to-example
Turns an mdbook book into a Rust example
-
cvicenie_2
Cvicenie 2
-
text_alignment
Align your text in Rust in the CLI
-
tashkil
A lightweight library for removing Arabic diacritics
-
findtext_textfile
Search text in text file
-
naming_clt
Extract and convert the naming format(case|notation) of identifiers from files or stdin. Use this tool to prepare identifier name strings for further operations (matching,replacing...) on relative files
-
shapdf
Create Shapes into PDF
-
infisearch_common
Internal library for other InfiSearch packages
-
hearthstone
simulator written in Rust
-
mdbook-bash-tutorial
A mdbook preprocessor that allows embedding Bash scripts as tutorials
-
static_table
creates pretty tables at compiler time
-
agldt
Tools for handling data conforming the standards of the Ancient Greek and Latin Dependency Treebank
-
korrektor
work with Uzbek language text processing
-
liwe
IWE core library
-
saneput
Sane input reading library
-
mdast2minimad
converting markdown AST to minimad texts
-
wcrs
GNU wc in Rust
-
cindex
CSV indexing library
-
ohos-ime-sys
Bindings to the
inputmethod
API of OpenHarmony -
ucf
A universal code formatter
-
ftrace
trace files and paths
-
minigrep_elijahkx
MiniGrep is a Rust-based command-line tool, with a (current) size of 588KB that lets users search files for a given query string and shows matching lines with their line numbers
-
lindera-cli
A morphological analysis command line interface
-
mdbook-docslab
mdBook preprocessor for interactive code with docslab
-
rss4mdbook
a generator for mdBook as CLI tool, export RSS.xml into u want path
-
ansi-width
Calculate the width of a string when printed to the terminal
-
sqlify
CLI tool for formatting SQL queries
-
spandex-hyphenation
Knuth-Liang hyphenation for a variety of languages
-
xim-ctext
compound text en/decoder
-
rmw-utf8
Short text compression algorithm for utf-8 (optimized for Chinese , developed based on rust programming language). 面向utf-8的短文本压缩算法(为中文压缩优化,基于rust编程语言开发)。
-
parser-cli
Command-line interface for extracting text from various file formats
-
uiuifree-normalize
uiuifree text normalize
-
perlin
A lazy, zero-allocation and data-agnostic Information Retrieval library
-
my_mini_grep
A mini application that aims to replicate the behavior of the grep shell command
-
rvim
A text editor in rust
-
repa
Peak Performance Pattern Seeker
-
litegrep
A basic tool for searching in files for lines of text, based on a query
-
omgwtf8
Optimized-Matching-Generalized Wobbly Transformation Format — 8-bit
-
trie-match
Fast match macro
-
simple_bencode
bencode encoder and decoder, that uses neither rustc-serialize or Serde. Instead, it serializes from / deserializes to a tree using a 4-branch enum.
-
sentencepiece
Binding for the sentencepiece tokenizer
-
moon-phases
Fast command-line application to show the moon phase
-
vec-string-to-static-str
providing utilities for converting vectors of
String
s into vectors of&'static str
-
matcher_c
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
encoding-utils
A utils to help with encoding and decoding os strings and more
-
svgrep
A grep-like utility for separated-values files written in Rust
-
lingua-danish-language-model
The Danish language model for Lingua, an accurate natural language detection library
-
toml_to_table
pretty print TOML as a table
-
mdbook-svgbob2
Alternative mdbook preprocessor for svgbob
-
dr
Command-line data file processing in Rust
-
write16
A UTF-16 analog of the Write trait
-
merge-whitespace-utils
Procedural macros for merging whitespace in const contexts
-
aoutils
A tiny utilities package to test publishing to crates.io
-
ascii-hangman-webapp
customizable Hangman game with ASCII-art rewarding for children (webapp version)
-
hex_d_hex
HexDHex is a Rust Crate that encodes and decodes byte data to and from its hexidecimal representation. For instance, one may wish, on ocasion that is, to translate a utf8 or ASCII string…
-
utils_rust
这是一个用于各种实用功能的 Rust 库
-
wordninja
port of the Word Ninja English word splitting library
-
notedown-error
Notedown Error Handlers
-
humnum
Human numeric sorting program — does what
sort -h
is supposed to do! -
jp_utils
Utils for working with Japanese text
-
re_view_text_document
view that shows a single text box
-
indenter
A formatter wrapper that indents the text, designed for error display impls
-
xlsxwriter
Write xlsx file with number, formula, string, formatting, autofilter, merged cells, data validation and more
-
moenster
mønster (n) - pattern. simple glob-style pattern matching for strings
-
fuzzy_mime
A Mime-Type parsing library for rust
-
unicode_escape
decoding escape sequences in strings
-
crop
A pretty fast text rope
-
stringedits
Edit trait and associated iterators for small edits to strings
-
tinytoken
tokenizing text into words, numbers, symbols, and more, with customizable parsing options
-
cyrla
two-way conversion between latin and cyrillic script
-
aki-unbody
output first or last n lines, like a head and tail of linux command
-
regex-automata
Automata construction and matching using regular expressions
-
rep-cli
Replace text file in bulk
-
txt_to_md
Command converting from a txt file to a markdown file
-
table_to_html
interface to convert a
tabled::Table
into a HTML table (<table>
) -
betacode
conversion
-
regex-intersect
Find out if two regexes have a non-empty intersection
-
quake_text
Utils for Quake strings and characters
-
sttx
belt for transforming speech-to-text data
-
github-slugger
A slugger for GitHub headings
-
mdbook-latex
An mdbook backend for generating LaTeX and PDF documents
-
regex_quote_fixer
Rewrites grep regexpressions for the use in the regex crate
-
mdbook-footnote
mdbook preprocessor for footnotes
-
jpreprocess-njd
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
mdbook-preprocessor-boilerplate
Boilerplate code for mdbook preprocessors
-
byte-num
converting numbers to bytes, and bytes to numbers in base 10!
-
sourcepawn_lsp
Language Server implemention for the SourcePawn programming language
-
worcher
full-text search for static websites
-
api_key
Generate api key in rust supports base32, base62, string, uuid4, uuid5
-
wdg-base64
The Base64 Data Encoding
-
kindle2cbz
extracting images from kindle books in MOBI format to CBZ archives
-
mdbook-typst-pdf
mdbook typst pdf backend
-
rex-regextract
extracts key value pairs out of text
-
strings
String utilities, including an unbalanced Rope
-
p4d-mdproof
Markdown to PDF converter
-
markitdown
designed to facilitate the conversion of various document formats into markdown text
-
libgrep-rs
searching through text
-
prettify-markdown
Format Markdown at the speed of Rust
-
tdrip
command-line tool to easily remove headers and metadata from text
-
fwuffgrep
Basic implementation of a grep command written in rust
-
olagem
Typing speed test in the terminal
-
redact-engine
Protect confidentiality with dynamic redaction by replacing sensitive data from string or JSON format
-
swc_plugin_import
babel-plugin-import rewritten in Rust
-
whitespace_text_steganography
A steganography strategy that uses whitespace to hide text in other text
-
adbook
Creates a book from AsciiDoc files
-
uecho
The unicode of the echo command
-
repub
convert markdown documents to epub
-
veloci_levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
text_manipulation_rs
generating random placeholder text in different languages
-
mdbook-svgdx
mdbook preprocessor to convert svgdx fenced code blocks into inline SVG images
-
image-to-ascii
Converts images and gifs to ascii art
-
webreg
A CLI tool for testing regexes against web pages
-
marko
Programmtically format text with Markdown syntax
-
teddy
A SIMD-accelerated multistring searcher
-
read_chars
An iterator over characters read from some I/O source
-
markdown_to_html_parser
parses Markdown syntax into HTML
-
admerge
Merge multiply sources into one, with advanced options
-
okkhor
English to Bangla phonetic conversion following the 'Avro' rules
-
lindera-ko-dic
A Japanese morphological dictionary for ko-dic
-
markdown-it-latex
Allows for the insertion of math in Markdown documents using LaTeX
-
markdown-table
Creating markdown tables with Rust!
-
ucfirst
Uppercase the first letter of a string
-
genex
Text-expansion library
-
mdxbook
Fork of mdBook, with more customizations and flexibility for programmers
-
carlo_grep
A fun game where you guess what number the computer has chosen
-
irssi-sys
Automatically generated bindings to irssi
-
md-inc
Include files in Markdown docs
-
mdbook-nix-eval
mdbook preprocessor for evaluating nix expressions
-
kudubot-bindings
Rust Bindings for the kudubot framework
-
fuzzysearchrs
Fuzzy search for finding strings in string with levenshtein distance
-
sesdiff
Generates a shortest edit script (Myers' diff algorithm) to indicate how to get from the strings in column A to the strings in column B. Also provides the edit distance (levenshtein).
-
text-diff
text diffing and assertion library
-
kasedenv
Read environment variables by lower, upper case or case-insensitive keys
-
sparklet
small flashcards library
-
string_morph
string case transformations with an emphasis on accuracy and performance. The case conversions are available as functions as well as traits on String types.
-
nutrimatic
Tools for reading Nutrimatic (https://nutrimatic.org) index files
-
gen3-charset
Pokemon Generation 3 Character Set Support (GBA)
-
neardup
near-duplicate matching
-
termbook-cli
termbook
is a command-line tool to buildmdbook
’s while executingbash
codeblocks and collecting their output to become part of themdbook
-
framework
detector for different frameworks in one projects
-
md-dir-builder
Webserver for serving all markdown files in a directory
-
wn-parser
parser for WordNet database files
-
mle
The markup link extractor (mle) extracts links from markup files (Markdown and HTML)
-
emojicons-2021
Parse :emoji: notation to unicode representation
-
caribon
A repetition detector program and library
-
wit-bindgen-gen-markdown
Markdown generator for WIT and the component model, typically used through the
wit-bindgen-cli
crate -
clippy_lints
A bunch of helpful lints to avoid common pitfalls in Rust
-
terraphim-markdown-parser
Terraphim Markdown Parser
-
romulus
a stream editor like sed
-
chars_counter
The trait that implements character counting for the &str type
-
minigrep_santunioni
A lightweight version of grep
-
ogrep
searching in indentation-structured texts
-
chanoma
Characters Normalization library. 文字列正規化処理用のライブラリです。
-
uniwhat
Display the unicode characters text
-
doc-chunks
Clusters of doc comments and dev comments as coherent view
-
split_ext
Extension traits for splitting
-
afrim-memory
handle of sequential codes easier for an input method
-
skribo
low-level text layout
-
yeslogic-fontconfig-sys
Raw bindings to Fontconfig without a vendored C library
-
uiuifree-text-data
csv and excel convert
-
haoxue-dict
Chinese dictionary and word segmenter
-
kanjidic_types
A collection of types encompassing the variety of data about kanji available from Kanjidic
-
morse-nostd
A nostd version of the morse crate
-
aaa
CLI tool for work with 3a files
-
asimov-core
ASIMOV Software Development Kit (SDK) for Rust
-
kth-lines
Command line tool for filtering stdin lines that just work
-
macro_colors
colorful printing macros
-
zbuf
“Zero-copy” string and bytes buffers
-
equt-md-ext
Extend event iterator
-
spel
A fast spell checker for everyone
-
genpdf
User-friendly PDF generator written in pure Rust
-
chinese2digits
The Best Tool of Chinese Number to Digits. A useful tool in NLP and robot project.
-
markov-text
creating a small markov model for text generation
-
noodler
A port of the python-ngram project that provides fuzzy search using N-gram
-
cringify
Annoy your friends with the cringified text
-
kincaid
A word statistics library in Rust
-
prettythanks
frontend to dtolnay/prettyplease library
-
wz
Count words, fast
-
com-croftsoft-lib-string
CroftSoft String Library
-
uniart
A CLI tool to convert images and gifs to terminal characters
-
yagenerator
Application that uses tinytemplate engine to generate text files. If you have a set of structured data, and need to generated a bunch of arbitrary types of files from it, this tool can help you to save some time.
-
trans-case
Transform case
-
retest
Command-line regular expression tester
-
anon-csv-cli
anonymise CSV files, providing various options to substitute real data with plausable fake data
-
spellabet
Convert characters into spelling alphabet code words
-
syllable
counter for use with reading level calculations
-
smoldown
Native Rust library for parsing Markdown
-
mdbook-checklist
An mdBook preprocessor for generating checklists and indexes
-
transition-table
transition table utilities for keyword parser
-
literumilo
A spell checker and morphological analyzer for Esperanto
-
serbzip
A quasi-lossless Balkanoidal meta-lingual compressor
-
perspicuity_formula
Calculate Flesh Reading Ease for a given text and language
-
kg-diag
Error/diagnostic management. I/O routines for reading UTF-8 textual data with position tracking.
-
transcript
A transcriber for European scripts
-
toktrie
LLM Token Trie library
-
djot
Djot parser written in pure Rust
-
icu-data
International Components for Unicode (ICU) data in Rust structures
-
goodname
assist you with cool naming of your methods and software
-
render_readme
Render Markdown or reStructuredText with syntax highlighting and image filtering similar to GitHub's
-
custom-rust-stemmers
Experimental fork of: A rust implementation of some popular snowball stemming algorithms
-
ryaspeller
lib for searching typos in text, files and websites
-
byte_string
Wrapper types for outputting byte strings (b"Hello") using the Debug ({:?}) format
-
yozuk-core-skillset
Set of default Yozuk skills
-
readput
Fast and easy stdin input parsing for competitive programming in rust
-
unicode-range
UnicodeRange is a Rust library for parsing and stringifying Unicode ranges. It provides functionality to convert a string representation of Unicode ranges into a vector of code points and vice versa.
-
mdbook-last-changed
mdbook preprocessor to add the last modification date per page
-
agentscript
A programming language for AI agents
-
twitter_text_parser
Parser for twitter-text in Rust
-
pinot
Fast, high-fidelity OpenType parser
-
arg_input
ARGF-style input handling for Rust
-
stone-mason
simplify using the Amazon Bedrock Rust SDK aws-sdk-bedrockruntime
-
trim_lines
An extremely simple and tiny library which provides an iterator over the lines of a string, trimmed of whitespace. It is a simple wrapper around the Lines iterator in std::str which trims the whitespace from each line.
-
doccy
brace based markup language
-
csv-sanity
Sanitize and transform large CSVs with millions of records quickly and efficiently
-
gdnative-doc
Documentation tool for gdnative
-
xavier
lightweight and versatile XML parsing library designed to streamline the process of handling XML data with ease and efficiency
-
gestalt_ratio
Calculate the gestalt pattern matching ratio between two strings
-
lunir
A universal intermediate representation oriented towards Lua
-
minigrep_baolhq
Just getting started with Rust, enjoying it so far 😇
-
indent_tokenizer
Generate tokens based on indentation
-
rustinsight
The launcher app for the interacive book
-
m2h
Convert Markdown to HTML with syntax highlighting
-
jput
puts and putc on unicode-width align for Rust
-
l
my personal library
-
gen-epub-book
Generate an ePub book from a simple plaintext descriptor
-
corollary
Cross-compiles Haskell into Rust
-
fontconfig-rs
Safe, higher-level wrapper around the fontconfig library
-
shoebill
A Wadler/Leijen style pretty-printer
-
ewin-com
editor for Window(GUI) users.No need to remember commands
-
rcut
replacement for GNU cut that supports UTF-8
-
moenarchbook
Creates a book from markdown files
-
vidyut-chandas
A Sanskrit metrical classifier
-
encoding-next-index-tradchinese
Index tables for traditional Chinese character encodings
-
hebrew_unicode_utils
Some functions for processing Hebrew unicode characters
-
mdtranslation
prepare multi-lingual Markdown documents
-
is_printable
Determine whether a given text-based value is printable
-
markdown-includes
Include other documents, table of content, or rust-doc in Markdown using a simple template system
-
masker
Mask patterns in data
-
mdbook-iced
An mdBook preprocessor to turn iced code blocks into interactive examples
-
charmap
one-to-(none/one/many) character mapping
-
swot
community-driven or crowdsourced library for verifying that domain names and email addresses are tied to a legitimate university of college
-
forming
lightweight architecture as code language. 架构描述语言
-
diffy-fork-filenames
Fork of https://docs.rs/diffy that allows specifiying filenames
-
tradukisto
Kinda useful natural language translation library and utility
-
ssexp
A powerful parser for s-expressions
-
binyl
A bitwise UTF-8 string inspection tool
-
lingua-chinese-language-model
The Chinese language model for Lingua, an accurate natural language detection library
-
mdbook-tectonic
An mdbook backend for generating LaTeX and PDF documents
-
docfmt
A document formatter using Handlebars templates
-
lindera-filter
Character and token filters for Lindera
-
deck
A command line tool to generate HTML presentations from Markdown documents
-
pdf_composer_definitions
PDF Composer definitions crate
-
pocky
A framework for building your own static site generator
-
mdbook-open-gh-issue
mdbook preprocessor to add a open-on-github link on every page
-
parattice
Recursive paraphrase lattice generator
-
code-splitter
Split code into semantic chunks using tree-sitter
-
jellybean
Syntax highlighting with tree-sitter. Sweet colors.
-
mdbook-unlink
A mdBook backend that validates local links
-
xhtmlchardet
Character set detection for XML and HTML
-
pattern-3
Needle API (née Pattern API 3.0), generalization of
std::str::pattern
-
mapm-cli
The command-line implementation of mapm
-
winparsingtools
collection of structs and utilities for parsing windows binary formats
-
igneous-md-viewer
The viewer component of igneous-md
-
emdb_lib
Orthographic token compression
-
tweak
when/then clauses to run
-
rust_nickname_generater
that generates user/nick names based on the rust language
-
character_frequency
counting character frequencies in a string concurrently
-
mdbook-morsels
Morsels plugin for Mdbook
-
texc-latex
Contains LaTeX templates for TeXCreate
-
encoding_c
C API for encoding_rs
-
ligotab
Format delimited data with lightweight markup
-
minigreper
Small grep style cli from the book
-
character-stream
Helper data structures for reading UTF-8 characters from a stream
-
gecliht
A disparate collection of text manipulation and formatting algorithms
-
eloran
Comics and Ebook web library written in rust, with reading, search, reading status, bookmarks
-
rew
A text processing CLI tool that rewrites FS paths according to a pattern
-
argot
Parse documentation from codebases into Markdown for easy doc creation
-
rosie-sys
build or link to librosie to access the Rosie Pattern Language
-
marker
finding issues in CommonMark documents
-
ascii_converter
converting between different ascii representations
-
ukiyoe
rendering images to the terminal
-
emoji-printer
Replace emoji shortcodes in string with emoji unicode (":sushi:" -> 🍣)
-
pomsky-macro
Macro for converting pomsky expressions to regexes
-
linetime
command line utility to add timestamps at the start of lines. The tool can either process lines from stdin or execute a command and process lines from the command's stdout and stderr.
-
hulk
An ultra simple no hassle static site generator
-
dequote
Remove nested quotes around text
-
squ
command-line utility for converting quotation marks in plaintext files to "smart quotes"
-
hunter_mygrep
A learning project to search query in files
-
hyphenator
segmenting words into syllables
-
vroom
Vim macros from the shell
-
rust_io_test
basic program for searching content in files
-
minigre_base
text file search tool
-
md-designer
A CLI tool for creating design docs in Markdown
-
paperoni
A web article downloader
-
futf
Handling fragments of UTF-8
-
minisearch
A mini search which can handle both - case sensitive or in-sensitive both
-
moscato
Outline scaler for OpenType glyphs
-
p101_enc
convert Olivetti P101 program to and from different encodings
-
rustex
auto-generated LaTeX files in Rust
-
mmseg
Chinese word segmenation algorithm MMSEG in Rust
-
rust-cheatsheet
a quick cheatsheet for rust
-
llm-tui
A Terminal User Interface (TUI) for interacting with Language Learning Models (LLM) using llm-cli
-
find_unicode
Find Unicode characters, the easy way!
-
mul
Bengali stemmer
-
unidok
A powerful, readable, easy-to-learn markup language
-
my_minigrep321
A command line tool to retrieve all lines from a file containing a given string
-
toml-test-data
TOML test cases
-
jposta
A fast and intuitive Terminal User Interface (TUI) tool for searching Japanese postal codes and addresses
-
monogrep
custom version of grep
-
deface
Lightweight markup to HTML converter
-
emojito
Find all the Emoji in a string. Supports composed emoji.
-
iwes
IWE LSP server
-
single_source
Generate code files from snippets in md tutorial files
-
file-search
File indexing and search
-
yitizi
異體字查詢 Get variant Chinese characters
-
grace-cli
CLI tool for processing files and strings
-
aprilasr-sys
Low-level FFI bindings for the april-asr C api (libaprilasr)
-
b64
Base64 encoding/decoding support. Originally from rustc-serialize.
-
kanpyo
Japanese Morphological Analyzer
-
code-to-pdf
Generates a syntax-highlighted PDF of your source code
-
auto_correct
provide auto correct suggestions. Currently supporting EN-US.
-
wcount
CLI word counting tool
-
scripter
A screenplay compiler
-
tgo
Heterogeneous data type transtion, it's safe, lightweight and fast
-
validations
arbitrary types
-
mdbook-fishextract
A mdbook preprocessor which handles mermaid graphs, offline, requires
mmdc
-
character-set
High performance
set.contains(char)
-
crustword
Crusty Crosswords
-
epubparse
Parse epub and convert to text-only Book structure
-
tectonic_xetex_format
Tectonic/XeTeX engine data structures and their expression in TeX "format" files
-
dekor
styling and character repository in Rust
-
highlight-pulldown
Process pulldown-cmark events to apply syntax highlighting to code blocks
-
encoded-words
Encoded Words for usage in MIME headers
-
ngram-search
Ngram-based indexing of strings into a binary file
-
lindera-dictionary
A morphological analysis library
-
old_icelandic_zoega
Old Icelandic dictionary for Rust. From "A Concise Dictionary of Old Icelandic" by Geir Zoëga
-
mykebab
convert snake_case strings to kebab-case
-
merge_pdf
Merge PDF files in a directory
-
find-simdoc
Time- and memory-efficient all pairs similarity searches in documents
-
lindera-py
Python binding for Lindera
-
jpreprocess-jpcommon
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
string_manip_rust
Demo of managing projects
-
beemovie-cli
Bee Movie CLI Application
-
split_exact
splitting strings into arrays of slices
-
remove-markdown-links
Turns
[example](https://example.com)
intoexample
. That’s it -
asciimath-text-renderer
Render asciimath in terminal
-
struckdown
A structured markdown / commonmark library for Rust
-
gregex-logic
Logic for the gregex crate
-
count-md
configurable command-line tool and Rust library for Unicode-aware, Markdown-aware, HTML-aware word counting in Markdown documents
-
regex_parser
This project provides a parser for standard regular expressions based on a defined grammar
-
noted2xero_cli
The commandline version of the noted to web converter
-
pygmentize
wrapper for syntax highlighting
-
goose-eggs
in writing Goose load tests
-
heckmv
A basic case-conversion renaming CLI tool
-
lingua-portuguese-language-model
The Portuguese language model for Lingua, an accurate natural language detection library
-
stylish-stringlike
API for string-like objects that have styles applied
-
nlprule-build
Build tools for a fast, low-resource Natural Language Processing and Error Correction library
-
faster-chars-count
counting length of chars faster than Chars::count()
-
minigrep_necimye
Functions required to find the lines in file that contains the query. query and file path should be entered through command line preceded by two dashes. Ex: cargo run -- body filename.txt
-
text-to-json
Convert text to json in rust
-
epub_metadata
Produce pdf and epub books from markdown source structures
-
ezemoji
Catigoryized Emoji's
-
typeline_ext_csv
csv parsing and serialization for typeline
-
darts
A double array trie, A Forward Maximum Matching Searcher
-
unicode_names
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
linkcheck
extracting and validating links
-
bqrs
apply boolean query to text
-
simplearrayhash
-
aki-gsub
substitude text command, replace via regex
-
csvsc
Build processing chains for CSV files
-
ron_to_table
pretty print RON as a table
-
bidi
Unicode Bidirectional Algorithm (UBA)
-
shelldon
your new Rust-powered buddy with GPT features!
-
yozuk-helper-english
English NLP utilities for Yozuk
-
ab-radix-trie
A compressed radix trie implementation supporting matching rules
-
dtex
Better TeX
-
br-pdf
PDF Invoice Processing
-
chars_data
Build-dependency for chars, the unicode character information CLI
-
rulet
figlet implementation
-
indoc
Indented document literals
-
cw
Count Words, a fast wc clone
-
chisel-lexers
Chisel backend lexers/scanners
-
utf8_reader
A UTF-8 reader that read UTF-8 characters from object that implement Read trait
-
mdbook-shiftinclude
mdbook preprocessor for file inclusion with shift
-
aki-stats
output the statistics of text, like a wc of linux command
-
minify-html-common
Common code and data for minify-html*
-
mdbook-asciidoc
mdBook backend for AsciiDoc generation
-
markov_strings
A simplistic Markov chain text generator
-
synterm
making beautiful REPLs and Shells with fish like as you type syntax highlighting
-
seven_seg
Seven-segment digital display for terminal
-
cozo-ce
A general-purpose, transactional, relational database that uses Datalog and focuses on graph data and algorithms
-
minigrep_david20019
Command line utility that searches for a string in files
-
rdg
Random data generator for the command line
-
tiny_pretty
Tiny implementation of Wadler-style pretty printer
-
pcre2
High level wrapper library for PCRE2
-
glyphana
Quickly find, inspect & collect unicode glyps
-
combos
Print all permutations of a word list
-
minigrep-cli
implement minimum grep cli program
-
soup
Inspired by the python library BeautifulSoup, this is a layer on top of html5ever that adds a different API for querying and manipulating HTML
-
validate_npm_package_name
validate npm package name
-
rpdf
PDF command-line utils written in Rust
-
difference
text diffing and assertion library
-
ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched
-
soundchange
implementing sound change algorithms in Rust
-
intname
Full English name for any integer of any primitive integer type
-
v_latexescape
The simd optimized LaTeX escaping code
-
naromat
Convert text to narou novel format
-
nib
static site generator
-
num2en
For converting integer and decimal numbers into English cardinal or ordinal number words
-
wxf-converter
Transform yaml, json, pkl files to wolfram
-
unic-common
UNIC — Common Utilities
-
dd
a clone of the unix coreutil dd
-
unicode-canvas
creating text base drawing
-
sauron-markdown
parsing markdown into sauron node
-
unicode_converter
CLI tool to convert data between various Unicode encodings
-
newline-converter
Newline byte converter library
-
igo-rs
Pure Rust port of the Igo, a POS(Part-Of-Speech) tagger for Japanese (日本語 形態素解析)
-
mdbook-typst-math
An mdbook preprocessor to use typst to render math
-
ldd_md_parse
markdow to html simple tool
-
git-busy
A wrapper around "git commit" that generates the commit messages for you
-
mdbook-translation
prepare multi-lingual mdBook books
-
text_distance
A collection of approximate string matching algorithms
-
arbitrator
Format text based on a set of rules and regexes
-
scrambler
command line tool to scramble letters
-
beemovie
Bee Movie crate
-
autoruby-cli
CLI to easily generate furigana for various document formats
-
mdbook-webinclude
Preprocessor for mdBook that includes content from URLs
-
twitch2csv
stream the chats of Twitch channels as a CSV
-
grammateus
facilitate working with Ancient Greek words
-
unicode-box-drawing
Unicode box-drawing characters
-
swappy
An anagram generator
-
tabwriter
Elastic tabstops
-
mojibake
Encode/Decode bytes as emoji base2048
-
rigrep
grep from Rust Book
-
tuilet
A textual user interface for Toilet, the ANSI-art text generator
-
ende
encoding/decoding unicode/utf-8/utf-16(ucs-2) code points
-
aklat
create books from markdown files (like Gitbook)
-
pdf-min
Very minimal crate for writing PDFs
-
basic-text
Basic Text strings and I/O streams
-
anagrambot
find anagrams of words
-
pdf-rename
This script reads a list of PDF files from a specified directory and renames each file based on its content. The renaming logic uses the content of the PDF to generate a more descriptive and meaningful filename.
-
lindera-unidic
A Japanese morphological dictionary for UniDic
-
assert-text
the testing macro tools
-
hashlogs
Command-line utility that hashes the part before a space on each line from stdin with blake2b keyed with an ephemeral randomly-generated key and writes to stdout
-
zuk
Yozuk command-line interface
-
minigrep_philip
A simplified version of the well-known grep command
-
regex-cli
A command line tool for debugging, ad hoc benchmarking and generating regular expressions
-
bookrafter
This repository contains code related to bookrafter rendering
-
tectonic_io_base
Basic types for Tectonic's pluggable I/O backend system
-
bos_books_codes
that handles 3-character Bible Books Codes
-
psa
PSA(Project structure analysis) is a analyzer for analysis project struct
-
bocu1
BOCU-1 compressed unicode encoding
-
hashtag-regex
regex matching hashtags accoding to the unicode spec: http://unicode.org/reports/tr31/#hashtag_identifiers
-
ru-html-extractor
A universal web page main content extractor based on line block density distribution
-
utf
UTF-8
-
wordshk_tools
A combination of parsers and other tools for words.hk (粵典)
-
webgrep
grep the web: a full-browser-spec search-focused ultra-simple way to read the web without having to leave the terminal
-
leven-distance
Compute operational differences between two sequences using the Levenshtein algorithm
-
opencc
binding for Rust
-
llmvm-core-lib
llmvm core application
-
august
& program for converting HTML to plain text
-
minigrep-extremq
Example crate from the rustbook
-
markdown-table-formatter
Markdown table formatter fully compliant with Unicode 15.1.0
-
mdbook-playscript
Preprocessor for mdBook, which styles stage play scripts
-
indexrs
inefficient multi-language search index
-
ayda
Ask your Documents Anything. A tool for querying your documents with a large language model.
-
bbx
A robust, performant BBCode pull parser
-
genere
randomization of text respecting grammatical gender of sentences
-
kradical_static
Ready-to-use EDRDG radical decompositions
-
flw
Process text via configurable tasks
-
readability-rs
Port of arc90's readability project to rust
-
ucd-parse
parsing data files in the Unicode character database
-
lingua-dutch-language-model
The Dutch language model for Lingua, an accurate natural language detection library
-
ascii_tree
generates ascii trees
-
cutters
Rule based sentence segmentation library
-
mdbook-all-the-markdowns
Render all markdown files in a given folder structure
-
md-include
include any file in markdown files
-
umlauts
text transformation of german umlauts
-
makogrep
mako 的 minigrep 示例 cli
-
bcdown
Bilibili漫画下载器,written in Rust,支持epub pdf zip格式
-
tnil
Parsing, glossing, and generating utilites for New Ithkuil
-
wfst4str
Python library based on rustfst for manipulatig strings with wFSTs
-
mdbookshelf
Create epubs from a list of mdbook repositories
-
tantivy-czech-stemmer
Czech stemmer as Tantivy tokenizer
-
ucd-trie
A trie for storing Unicode codepoint sets and maps
-
mdlynx
Small, fast utility to find broken file links in Markdown documents
-
e_book_sync_library
Synchonize e-book with your local e-library
-
const_format_proc_macros
detail of the
const_format
crate -
is_utf8
functions to determine if a sequence of bytes is valid utf-8
-
grep-reader
short text for crates.io
-
subscript-compiler
A modern LaTeX rendition
-
simplecc
Chinese Convert library (partially) compatible with OpenCC's dictionaries
-
czv
performing CSV-related operations for data engineering and analysis
-
ed_join
Implemtation of Ed-Join Algorithm for string similarity join
-
spellcheck_toy
a basic spellchecking library based on edit distance
-
khat
A cat clone, nothing more nothing less
-
bionic-ebooks
Takes an EPUB file and generate a copy with bionic like font applied
-
utf8-command
UTF-8 encoded
std::process::Command
output -
wordpieces
Split tokens into word pieces
-
minigrep_kashi754
lightweight implementation of the popular
grep
command line tool. Built as my first project, it is not meant to be used in production. -
word_filter
A Word Filter for filtering text
-
skyspell_core
skyspell core library
-
lines
Utililities for iterating readers efficiently line-by-line
-
gret
command line tool to search for patterns and show matches in a tree structure
-
utf8_slice
Lightweight UTF8 Slice Utilities
-
color-convert
Support RGB,RGBA,HEX,HSL,HSLA,HSV,CMYK to convert each other, write by rust
-
csvre
replacing data in CSV columns with regular expressions
-
ascii-rs
Process image into colored-ascii image
-
dictcc
Rust API for reading and querying the dict.cc offline translation database
-
dhoni
converting Bengali text into their phonetic counterpart
-
demoji
Remove all emojis from a string
-
base100
Encode your data into emoji
-
bytescolor
A versatile Rust library for colorizing strings and byte data in terminal applications using ANSI escape codes
-
harfbuzz-traits
Rust Traits for the HarfBuzz text shaping engine
-
wikitext-parser
Partial parser for wikitext
-
gqlog
👾 filter your json logs with graphql 👾
-
lindera-compress
A morphological analysis library
-
vl-convert-pdf
convert SVG to PDF with embedded text
-
nano_parser_gen
A parser generator inspired by yacc (types and functions)
-
spider_scraper
A css scraper using html5ever
-
replace-all
Cli to quickly replace occurences of a word in a file
-
cattocol
Combine two text into one text as columns
-
anystr
An abstraction over string encoding that supports ASCII, UTF-8, UTF-16 and UTF-32
-
pdfutil
PDF document manipulation
-
flesh-reading-ease
Calculate Flesh Reading Ease for a given text and language
-
economic_indicator_finder
A finder for extracting economic indicators from paragraphs
-
indentation
Formatter
-
fum
fum finds fuzzy matches to a literal search pattern, searching recursively through all the files in the current directory and respecting gitignore rules
-
wz-utf16
UTF-16 counters for wz
-
sudachiclone
sudachiclone-rs is a Rust version of Sudachi, a Japanese morphological analyzer
-
sprinkles
Randomly colors input text and outputs it to the terminal
-
lexmatch
lexicon matching tool that, given a lexicon of words or phrases, identifies all matches in a given target text. Uses suffix arrays.
-
chinese_segmenter
Tokenize Chinese sentences using a dictionary-driven largest first matching approach
-
in_rainbows_printer
Prints some In Rainbows-style (the Radiohead album) text to your termial
-
html_to_markdown
Convert HTML to Markdown
-
stringsort
Pathological sorting of string characters
-
mdbook-obsidian
mdBook preprocessor to render Obsidian specific syntax
-
gesha-core
Core functionality for Gesha project
-
unicount-lib
Alphabetic counter supporting unicode
-
any2utf8
Convert any enncoding to utf-8
-
iasthk
Harvard-Kyoto to IAST conversion
-
carlo-latex
A LaTeX emitter for the simple interpreted programming language Carlo
-
nib-cli
A cli for a yet another static site generator Nib
-
shallow
long text
-
rcut-lib
rcut is a Rust replacement for GNU cut that supports UTF-8
-
jpreprocess-window
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
blockcounter
Counts the blocks in a stream
-
markdown-heading-id
Filter for
pulldown-cmark
which converts headings with custom ID -
case_convert
Converts the first letter of a Rust String to uppercase
-
whitespace
Encode arbitrary data whitespaces and vice versa
-
fmtm_ytmimi_markdown_fmt
Fork of @ytmimi's Markdown formatter; powers FMTM
-
minigrep_dungtl2003
very small project for rust that can find lines you want in a file
-
tokengeex
efficient tokenizer for code based on UnigramLM and TokenMonster
-
top-english-words
retrieve top words from the English language
-
bibliofile
A TUI epub reader inspired by DOS-era programs
-
jg
Jeff Goldblum (jg) is a command-line JSON processor. jg searches for structural patterns in json input and prints each json object that matches the pattern.
-
hline
a grep-like tool that highlights lines in files
-
veryfi
Module for communicating with the Veryfi OCR API
-
charjpoet
Charj Poet is a API for write to
.cj
language -
milligrep
Custom simplified implementation of grep
-
txttyp
Formatted string typewriter
-
mupdf-sys
Rust FFI binding to MuPDF
-
mdbook-compress
Compress an mdBook project into a single PDF file
-
mdoc
Modern PDF creation through Markdown and LaTeX
-
iconv-compat-win-sys
iconv bindings for Rust
-
pest_ascii_tree
Helper crates converting the parsing result of any pest grammar into an ascii tree
-
pra
Print Random ASCII
-
minigrep_ao
Learning Rust
-
spongedown
Converts markdown to html with svgbob support
-
zw
encoding and decoding text using zero-width characters
-
ngrams
Generate n-grams from sequences
-
textos
Texts, strings, formatting, unicode…
-
goya
morphological analyzer for Rust and WebAssembly
-
org-rust-parser
parser for org mode documents
-
braille_pics
producing text-art pictures using Braille characters
-
wkhtmltopdf
High-level bindings to wkhtmltopdf
-
latex-to-html
Latex to html converter
-
kanjidic_converter
A program to convert from the Kanjidic XML format to a JSON format
-
encoding_c_mem
C API for encoding_rs::mem
-
bbd-lib
Binary Braille Dump
-
jellybean-pack-2
Sweet syntax highlighting with tree-sitter
-
stamd
Webservice for working with stand-off annotations on text (STAM)
-
cang-jie
A Chinese tokenizer for tantivy
-
textocx
Tex code to Office MathML
-
yozuk-sdk
Types used in the Yozuk ecosystem
-
ankiding
Creating Anki-Flashcards within Markdown!
-
mdbook-numeq
An mdbook preprocessor for automatically numbering centered equations
-
allsorts_no_std
Font parser, shaping engine, and subsetter for OpenType, WOFF, and WOFF2
-
lindera-cc-cedict
A Japanese morphological dictionary for CC-CEDICT
-
base_u256
base-u256 is to utf-8 as base-64 is to ascii
-
wordbreaker
A Unicode-aware no_std crate (requires alloc) that rapidly finds all sequences of dictionary words that concatenate to a given string
-
convert_encoding
Convert encoding of text files in batch
-
mdtable-cli
that makes creating tables in markdown much easier!
-
varcon-core
Varcon-relevant data structures
-
fst-subseq-ascii-caseless
An automaton that matches if the input contains a specific subsequence ignoring ASCII case to be used with fst
-
tb_normalization
normalization utf8 string, loc dau vietnamese and some language
-
typeline_ext_sqlite
sqlite integration for typeline
-
pulldown-cmark-fork
A pull parser for CommonMark
-
asciir
Print ASCII table/values
-
const-utf16
Utf8 to utf16 conversion functions for use in const contexts
-
lindera-sqlite
Lindera tokenizer for SQLite FTS5 extention
-
yhy-email-encoding
Low level email encoding RFCs implementations
-
llmvm-outsource-lib
outsource backend for llmvm
-
quill_delta_pdf
Convert Quill Delta to PDF
-
lines_lossy
extension to
BufRead
with a functionlines_lossy
that works likeBufRead::lines
but with lossy UTF-8 decoding -
lithe-cli
A cli of lithe
-
tadm
A collection of algorithms and data structures wrote out while reading The Algorithm Design Manual book
-
unic-idna-mapping
UNIC — IDNA — IDNA Mapping Table
-
latex
An ergonomic library for programatically generating LaTeX documents and reports
-
aki-txpr-macro
the more easy to use libaki-*
-
xgrepx
xgrep is a rust implementation of grep. This is a follow up from the rust book
-
uwu_cli
uwuifying the terminal
-
asciicast
file format used by Asciinema
-
unicode-character-database
Unicode character database tables (Unicode Standard Annex #44) generated using
ucd-generate
-
fountain-parser-rs
parse Fountain-formatted plain text files
-
untex
Understand and manipulate TeX files with ease
-
wordnet
Read a wordnet dictionary in Rust
-
rusty_code_code_for_book
my book_rusty code
-
snake_case_converter
convert strings to snake case
-
cjieba-sys
unsafe ffi to cppjieba
-
mdbook-chapter-number
A mdBook preprocessor that adds chapter numbers to the each page header
-
rust-cedar
efficiently-updatable double-array trie in Rust (ported from cedar)
-
yeslogic-unicode-blocks
Functions to access and search Unicode blocks
-
strip-tags
Strip HTML and PHP tags from strings
-
ced
Dead easy csv editor
-
rustyword
An anagram finder
-
lindera-ipadic-neologd
A Japanese morphological dictionary for IPADIC NEologd
-
sauron-md
parsing markdown into sauron node
-
fbihtax
CLI tool to help manage tax payments in FBiH (Bosnia and Herzegovina Federation)
-
char_trie
Text segmentation based on trie tree, High performance, support for custom dictionary
-
crypto-invert
Unicode Upside-Down Mapping
-
volt_parse
The advanced, slightly different take on the parser combinator concept
-
hoedown
bindings for the Hoedown markdown processor
-
anagram
A collection of anagram utility functions
-
sitdown
Static site generator
-
math-text-transform
Transform greek letters, latin letters, or decimal digits into certain variants from the mathematical alphanumeric symbols Unicode block (U+1D400–U+1D7FF). For example to bold, italic, script or double-struck.
-
kytea-tokenizer
Wrapper of tokenization by KyTea
-
simple-word-count
word count function, try to get same result with Microsoft Office Word application
-
kanpyo-dict
Dictionary Library for Kanpyo
-
minigrep_mxcln
command line tool to search for a string in a file
-
strizer
minimal and fast library for text tokenization
-
ttf_word_wrap
Wraps text based on character width
-
tectonic_bridge_harfbuzz
Expose the Harfbuzz C/C++ APIs to Rust/Cargo
-
emojicons
Parse :emoji: notation to unicode representation
-
cautious-octo-funicular
Test: shipping an mdbook with API docs
-
lingua-italian-language-model
The Italian language model for Lingua, an accurate natural language detection library
-
word_iter
Iterator over all words in a string
-
textr
TeX-inspired plug-n-play interface for converting JSON documents into PDFs
-
lindera-ipadic
A Japanese morphological dictionary for IPADIC
-
charwise
This lightweight, dependency-free rust library provides a convenient way to read characters from different resources
-
md_parser_wasm
A markdown parser written in Rust and compiled to WebAssembly
-
markdown2unicode
Converter from markdown notation to unicode characters
-
unicode_skeleton
detects unicode strings that look nearly identical once rendered, but do not compare as equal. It defines "confusable" and "skeleton" based on Unicode Standard Annex #39
-
cmark2tex
A small utility to convert markdown files to pdf exploiting tectonic
-
CorrosionMark
markdown parser libary
-
token-read
reading whitespace delimited files intended for competitive programming
-
quoted-string-parser
Quoted string parser for grammar defined in RFC3261
-
mdbook-files
Preprocessor for mdbook which renders files from a directory as an interactive widget
-
books_description_parser
A Rust-based parser to extract book details from structured markdown-like text and output them in formats like JSON or Rust structs for further processing
-
rustextile
Textile markup language parser for Rust
-
quick_io
facilitate input and output within programs, with a set of macros
-
df_cp437
Decoder for CP437 to UTF-8
-
makudaun
Markdown renderer tool made on Rust
-
unidecode
pure ASCII transliterations of Unicode strings
-
kilo
small, fast utility crate/library for manipulating strings and generating sourcemaps with all in Magic 🪄
-
lix-score
Calculate LIX score for a given text and language
-
backslash
parsing escape characters
-
minigrep_sopesto
minigrip es una aplicación hecha siguiendo la guía del libro *The Rust Programming Lenguage*. La misma busca recrear de forma minimalista la aplicación grep.
-
decline-word
Choose word form based on given number
-
html2runes
An HTML to Text converter
-
mime-rs
A text processing framework, inspired by Emacs lisp and keyboard macros
-
txtframe
Creates a frame for text
-
iterlower
Final-sigma-correct lowercasing iterator adapter with option for Turkish/Azeri I behavior
-
yozuk-model
NLP model generator for Yozuk
-
mdbook-reference-table
mdBook preprocessor to create reference tables
-
csv_coincidence
Tool designed to efficiently search for and identify specific patterns within CSV files
-
text-tables
A terminal/text table prettifier with no dependencies
-
mdbook-extended-markdown-table
Preprocessor for mdBook that generates tables with merged cells from ASCII text
-
stfu
Shut The Ferris Up - profanity filtering for Rust
-
twemoji-rs
A word-cloud image generation crate
-
morc
Dead simple, minimal markdown generator library written in Rust
-
docstring
manipulating and parsing documentation strings
-
aki-json-pick
The json pick out command
-
poetry-book
Create a poetry book in latex, starting from plain text
-
chisel-parsers
Chisel parser front ends
-
blingfire
Wrapper for the BlingFire tokenization library
-
terminal-emoji
safely displaying emoji inside of terminals
-
presciidoc
Preprocessing AsciiDoc for other tools
-
unicode-utf8
that converts utf-8 bytes to a unicode scalar value, and vice versa
-
text_to_emoji
Convert text to emoji
-
pdf_encoding
Font related encodings
-
xsystem
Conversion between the Esperanto x-system and Unicode circumflexes
-
re2
Wrapper for the re2 C++ regex library
-
openlibrary-rs
A wrapper around openlibrary's Web API
-
norm-email
strip email provider defined behaviour from email addresses
-
rough
A very simple and opinionated static site generator
-
quartz_commands
Generates a parser at compile-time for handling commands similar in structure to those of Minecraft
-
alpino-tokenize
Wrapper around the Alpino tokenizer for Dutch
-
cmdcjones_minigrep
A minimal grep clone from the Rust Book
-
paxcii
Transform images and videos to ascii
-
mdbook-trace
A traceable document preprocessor for mdbook
-
chapter-8-exercises
Exercises from the 8th chapter of the book
-
rckive-genpdf
User-friendly PDF generator written in pure Rust
-
jpreprocess-dictionary
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
johalun/module
FreeBSD kernel module in Rust
-
jiang_mini_grep
minigrep 查询文件的某个字符
-
readable-readability
Really fast readability
-
catmark
Console printer for CommonMark
-
ssml-parser
parsing speech sythnesis markup language
-
suffix
arrays
-
pdf_composer_base
PDF Composer base functionality crate
-
xsv
A high performance CSV command line toolkit
-
encoding-next-types
Traits and types for the encoding package
-
md-to-html
CLI tool to convert Markdown files to HTML
-
hsk
Return HSK Level for Simplified Chinese Characters
-
tiniestsegmenter
Compact Japanese segmenter
-
mdbook-mathpunc
An mdbook preprocessor that prevents line breaks between inline math blocks and punctuation marks when using katex
-
asciifolding
ascii folding library
-
vaporetto_tantivy
Vaporetto Tokenizer for Tantivy
-
pdf_form
programatically filling out pdf forms
-
with-str-bytes
Safely manipulate the bytes of a UTF-8 string
-
morsels_lang_ascii
Basic ascii tokenizer for morsels
-
bytepiece_rs
The Bytepiece Tokenizer Implemented in Rust
-
uniaxe
replace Unicode letters with Ascii equivalents
-
text_converter
A trait that helps with manipulating text
-
basic_lexer
Basic lexical analyzer for parsing and compiling
-
tfidf-summarizer
Basic tf-idf compute for documents
-
bgrp
A very simple minigrep in terminal
-
overlap
shows overlap text in files
-
jellybean-pack-1
Sweet syntax highlighting with tree-sitter
-
minigrep-bahadir
A fun project to learn the great language Rust
-
maybe_utf8
Byte container optionally encoded as UTF-8
-
termbook
behind the
termbook-cli
-
mdbook-post
A CLI for add post to mdbook
-
ewts-c
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (c lib)
-
ergrep
grep strings within a line from a text file
-
fnew
A Unicode-aware line-oriented drop-in replacement for coreutils' fold
-
bgrep
grep tailored to handle binary patterns and files
-
minigrep-yogie
A demo Rust to grep some word from rust-lang.com
-
password-characters
help with the "enter the 12th, 35th, and 63rd characters from your password" situations
-
static_format
Format strings with no runtime overhead
-
rusty_word_builder
Syllable and Word generation library written fully in Rust
-
unicode_font
Convert unicode characters between fonts
-
lingua-slovak-language-model
The Slovak language model for Lingua, an accurate natural language detection library
-
spyglass
Search engine for documents, inspired by bioinformatics
-
mdtranslation-cli
Command-line tools for using mdTranslation, which can be used to prepare multi-lingual Markdown documents
-
caseformat
Power flow case data format
-
text-sanitizer
convert text to plain ASCII text
-
bookbinder_latex
Produce latex and pdf books
-
pix-brcode
A ready to use compliant PIX specification, featuring fast de/serialization
-
rustrawi
Rust port of the original PHP Sastrawi
-
vaporetto_rules
Rule-base filters for Vaporetto
-
grep-clone
A mini grep clone from the Rust-lang official tutorial
-
utf8reader
wrapper around Reader that returns a stream of UTF-8 characters
-
gulpeaseindex
Calculate Gulpease index for a given text and language
-
STKLR
STKLR: pronounced 'stickler'. Is a cli tool to automatically link functions, enums, structs, traits etc in rust-doc docstrings. I couldn't find a tool like this when I needed it so... here we are.
-
rsonpath-test-codegen
Blazing fast JSONPath query engine powered by SIMD. TOML-based test codegen for rsonpath-lib.
-
codepage
Mapping between Windows code page numbers and encoding_rs character encodings
-
rs_handstrength
relative to board omaha hand strength calculator and equity on flop
-
minigrep_flict
Simplest text-in-file search engine from rust book
-
minigrep_iaziz786
grep
-
llmvm-chat
An llmvm frontend that acts as a CLI chat interface
-
japanese-ruby-filter
Japanese ruby notation parser
-
emoji_converter
Converts text to emojis
-
font-map-core
Core font-parsing capabilities for font-map
-
ascii-alphabetic-char
Traits for ASCII alphabetic characters
-
corpus-preproc
A preprocessor for text and HTML corpora
-
xmldecl
Extracts an encoding from an ASCII-based bogo-XML declaration in text/html in a Web-compatible way
-
lindera-assets
A helper crate to fetch assets and build dictionary for lindera
-
mnumonic
A tiny library to convert opaque binary data to and from a human-memorable phrase
-
align_text
Aligns lines in a block of text within a number of columns
-
encoding
Character encoding support for Rust
-
mathml-latex
Convert between MathML and LaTeX
-
yeslogic-fontconfig
RENAMED: use the fontconfig crate instead
-
jieba-macros
jieba-rs proc-macro
-
latin1str
Windows-1252 string types
-
tex
The νTeX typesetting engine
-
trexter
Text progression tracking library
-
askama-filters
Extra template filters for Askama
-
ascii-read
BufRead
-like methods for reading into anAsciiString
-
html-to-pulldown-cmark-events
Parse HTML to pulldown-cmark's events
-
json-peek
Amature JSON parser library designed for my specific need
-
tectonic_engine_xetex
The XeTeX engine as a reusable crate
-
token-counter
wc
for tokens: count tokens in files with HF Tokenizers -
conveyance
A stop-gap CLI for conveyancing
-
rustascii
Display Rust in ASCII
-
dom-content-extraction
Content extraction via text density paper
-
gbx
GBX (Grundbuch-Exchange) Dateiformat
-
typeline_ext_http
http(s) tooling for typeline
-
pdftotext
High-level library that binds to Poppler to extract text from a PDF
-
lindera-dictionary-builder
Shared code for building Lindera dictionary files
-
naveengrep
command line tool similar to the grep
-
brack-tokenizer
The tokenizer for the Brack programming language
-
h_hangul
Korean Characters
-
mdify
A CLI tool that translates md files to html while keeping project structure
-
rs_html_parser_tokenizer
Rs Html Parser Tokenizer
-
minigrep_desonglll
grep implementation from The Rust Programing Book
-
pdf_forms
programatically filling out pdf forms
-
borderrs
Add stylish borders around your text and datastructures
-
ascii-to-hex
A small, simple library to converting an ASCII text string into its hexadecimal equivalent
-
slicer
that slices string slices into smaller string slices
-
jellybean-pack-0
Sweet syntax highlighting with tree-sitter
-
tpng
A small tool that prints truecolor png renderings to the terminal using unicode block characters
-
genpdfi
User-friendly PDF generator written in pure Rust
-
my_project_parser_super_puper
A brief description
-
folia
High-performance library for handling the FoLiA XML format (Format for Linguistic Annotation)
-
grep-searcher
Fast line oriented regex searching as a library
-
ruby-parser
A parser for the Ruby language
-
fontfor
find fonts which can show a specified character and preview them in terminal or browser
-
my_parser_kma_test_group_3_1
A brief description
-
grep-table-converter
A cli utility to convert grep result to table (csv, markdown, textile)
-
tabled
An easy to use library for pretty print tables of Rust
struct
s andenum
s -
jpreprocess-dictionary-builder
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
findtext_pdf
Search text in PDF
-
lindera-wasm
A morphological analysis library for WebAssembly
-
wz-conf
Configuration options for wz
-
dvi2html
converter
-
ascii_utils
handle ASCII characters
-
unic-ucd-hangul
UNIC — Unicode Character Database — Hangul Syllable Composition & Decomposition
-
alphabet-encoder
A quick and dirty way to deal with escape characters
-
unicode_clusters
variable width unicode characters as single items, allowing for array like indexing etc
-
shift_or_euc
Detects among the Japanese legacy encodings
-
ascii-engine
A fast ascii engine written in rust!
-
domrs
Document builder and serializer
-
escaped-delimiter
Iterator of delimited slices with escape characters
-
code-span
Add additional infomation to code character
-
names-changer
Convert a names of sql schemes from camelcase to snake case
-
my_parser_kma_test_group_3_khrystyna
A brief description