-
regex
regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
-
unicode-width
Determine displayed width of
charandstrtypes according to Unicode Standard Annex #11 rules -
textwrap
word wrapping, indenting, and dedenting strings. Has optional support for Unicode and emojis as well as machine hyphenation.
-
encoding_rs
A Gecko-oriented implementation of the Encoding Standard
-
tokenizers
today's most used tokenizers, with a focus on performances and versatility
-
fancy-regex
regexes, supporting a relatively rich set of features, including backreferences and look-around
-
similar
A diff library for Rust
-
ropey
A fast and robust text rope for Rust
-
const_format
Compile-time string formatting
-
convert_case
Convert strings into any case
-
tabled
An easy to use library for pretty print tables of Rust
structs andenums -
heck
case conversion library
-
unicode-normalization
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
pulldown-cmark
A pull parser for CommonMark
-
deunicode
Convert Unicode strings to pure ASCII by intelligently transliterating them. Suppors Emoji and Chinese.
-
lopdf
PDF document manipulation
-
unicode-segmentation
Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules
-
rustybuzz
A complete harfbuzz shaping algorithm port to Rust
-
mime_guess
detection of a file's MIME type by its extension
-
regress
A regular expression engine targeting EcmaScript syntax
-
widestring
wide string Rust library for converting to and from wide strings, such as those often used in Windows API or other FFI libaries. Both
u16andu32string types are provided, including support for UTF-16 and UTF-32… -
mdbook
Creates a book from markdown files
-
prometheus-client
Open Metrics client library allowing users to natively instrument applications
-
lazy-regex
lazy static regular expressions checked at compile time
-
fuzzy-matcher
Fuzzy Matching Library
-
unicase
A case-insensitive wrapper around strings
-
indenter
A formatter wrapper that indents the text, designed for error display impls
-
onig
Rust-Onig is a set of Rust bindings for the Oniguruma regular expression library. Oniguruma is a modern regex library with support for multiple character encodings and regex syntaxes.
-
nucleo-matcher
plug and play high performance fuzzy matcher
-
unicode-general-category
Fast lookup of the Unicode General Category property for char
-
strip-ansi-escapes
Strip ANSI escape sequences from byte streams
-
html2text
Render HTML as plain text
-
const-str
compile-time string operations
-
lindera
A morphological analysis library
-
lngcnv
linguistics: display pronunciation, translate between dialects, convert between orthographies; support for multiple languages: English, Latin, Polish, Quechua, Spanish, Tikuna
-
grok
popular Java & Ruby grok library which allows easy text and log file processing with composable patterns
-
bat
A cat(1) clone with wings
-
termimad
Markdown Renderer for the Terminal
-
text-splitter
Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.
-
emojis
✨ Lookup emoji in *O(1)* time, access metadata and GitHub shortcodes, iterate over all emoji, and more!
-
printpdf
reading and writing PDF files
-
titlecase
Capitalize text according to a style defined by John Gruber for Daring Fireball
-
selectors
CSS Selectors matching for Rust
-
stringprep
algorithm
-
diff
An LCS based slice and string diffing implementation
-
unicode-bidi
Unicode Bidirectional Algorithm
-
finl_unicode
handling Unicode functionality for finl (categories and grapheme segmentation)
-
unicode-xid
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
charabia
detect the language, tokenize the text and normalize the tokens
-
pulldown-cmark-to-cmark
Convert pulldown-cmark Events back to the string they were parsed from
-
Inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
html-to-markdown-cli
Command-line interface for html-to-markdown - high-performance HTML to Markdown converter
-
chardetng
A character encoding detector for legacy Web content
-
synoptic
low-level, syntax highlighting library with unicode support
-
unescaper
Unescape strings with escape sequences written out as literal characters
-
typstyle
The CLI for Typstyle
-
rphonetic
Rust port of phonetic Apache commons-codec algorithms
-
autocorrect
A linter and formatter for help you improve copywriting, to correct spaces, words, punctuations between CJK (Chinese, Japanese, Korean)
-
unicode-script
exposes the Unicode
ScriptandScript_Extensionproperties from UAX #24 -
line-index
Maps flat
TextSizeoffsets to/from(line, column)representation -
rxing
port of the zxing barcode library
-
os_display
Display strings in a safe platform-appropriate way
-
lipsum
lorem ipsum text generation library. It generates pseudo-random Latin text. Use this if you need filler or dummy text for your application. The text is generated using a simple Markov chain…
-
stringzilla
Search, hash, sort, fingerprint, and fuzzy-match strings faster via SWAR, SIMD, and GPGPU
-
unicode_names2
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
cow-utils
Copy-on-write string utilities for Rust
-
htmd
A turndown.js inspired HTML to Markdown converter
-
diffy
Tools for finding and manipulating differences between files
-
heatseeker
A fast, robust, and portable fuzzy finder
-
regex-syntax
A regular expression parser
-
jieba-rs
The Jieba Chinese Word Segmentation Implemented in Rust
-
pdf-extract
extract content from pdfs
-
regex-cursor
regex fork that can search discontiguous haystacks
-
zawk
An efficient Awk-like language implementation by Rust with stdlib
-
harfrust
A complete HarfBuzz shaping algorithm port to Rust
-
any_ascii
Unicode to ASCII transliteration
-
roff
ROFF (man page format) generation library
-
edit-distance
Levenshtein edit distance between strings, a measure for similarity
-
stfu8
Sorta Text Format in UTF-8
-
omekasy
Decorate alphanumeric characters in your input with various font; special characters in Unicode
-
spellbook
A spellchecking library compatible with Hunspell dictionaries
-
unicode-joining-type
Fast lookup of the Unicode Joining Type and Joining Group properties
-
igrepper
The interactive grepper
-
rumdl
A fast Markdown linter written in Rust (Ru(st) MarkDown Linter)
-
decancer
that removes common unicode confusables/homoglyphs from strings
-
cruet
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
textsurf
Webservice for efficiently serving multiple plain text documents or excerpts thereof (by unicode character offset), without everything into memory
-
prettydiff
Side-by-side diff for two files
-
crop
A pretty fast text rope
-
uncased
Case-preserving, ASCII case-insensitive, no_std string types
-
html2md
binary to convert simple html documents into markdown
-
mdbook-katex
mdBook preprocessor rendering LaTeX equations to HTML
-
icu_pattern
ICU pattern utilities
-
mdbook-admonish
A preprocessor for mdbook to add Material Design admonishments
-
mdbook-pdf
A backend for mdBook written in Rust for generating PDF based on headless chrome and Chrome DevTools Protocol
-
ascii
ASCII-only equivalents to
char,strandString -
str_indices
Count and convert between indexing schemes on string slices
-
textdistance
Lots of algorithms to compare how similar two sequences are
-
stringcase
Converts string cases between camelCase, COBOL-CASE, kebab-case, and so on
-
mad-useful
A multi-tool utility for file operations and text processing
-
font-types
Scalar types used in fonts
-
entities
raw data needed to convert to and from HTML entities
-
mdbook-pandoc
A pandoc-powered mdbook backend
-
hypher
separates words into syllables
-
treegrep
regex pattern matcher that displays results in a tree structure with an interface to jump to matched text
-
diff-match-patch-rs
The fastest implementation of Myer's diff algorithm to perform the operations required for synchronizing plain text
-
hgrep
grep tool with human-friendly search output. This is similar to
-Coption ofgrepcommand, but its output is enhanced with syntax highlighting focusing on human readable outputs. -
stop-words
Common stop words in many languages
-
htop
HTML to PDF converter
-
ferris-says
flavored replacement for the classic cowsay
-
sdml-cli
Rust CLI for Simple Domain Modeling Language (SDML)
-
readable-name-generator
Generate a readable name for throwaway infrastructure
-
emojic
Emoji constants
-
bashtestmd
Compiles shell commands in .md files into Bash scripts for testing
-
sliceslice
A fast implementation of single-pattern substring search using SIMD acceleration
-
wana_kana
checking and converting between Japanese characters - Kanji, Hiragana, Katakana - and Romaji
-
unicode-blocks
contains a list of all unicode blocks and provides some functions to search across them
-
smartcat
Putting a brain behind
cat. CLI interface to bring language models in the Unix ecosystem 🐈⬛ -
unicode-reverse
Unicode-aware in-place string reversal
-
chewing
(酷音) intelligent Zhuyin input method
-
sentencepiece
Binding for the sentencepiece tokenizer
-
languagetool-rust
LanguageTool API bindings in Rust
-
markon
A lightweight Markdown renderer with GitHub styling and Medium-like annotations
-
askalono
detect the contents of license files
-
iepub
epub、mobi电子书读写
-
lsp-textdocument
A LSP text documents manager that map of text document
-
dwrote
Lightweight binding to DirectWrite
-
uwc
Counts things in unicode text files
-
bulletty
a pretty TUI feed reader (RSS+ATOM) that stores articles locally as Markdown files
-
text_io
really simple to use panicking input functions
-
prema
convert markdown to html
-
tiefdownconverter
A CLI tool to manage and convert Markdown-based projects
-
lindera-tantivy
Lindera Tokenizer for Tantivy
-
mdbook-yapp
mdBook preprocessor for simple text replacements
-
cargo-spellcheck
Checks all doc comments for spelling mistakes
-
norad
Read and write Unified Font Object files
-
blocklet
A cross-platform CLI tool that generates ASCII art using Unicode block characters, similar to figlet but with beautiful solid Unicode blocks instead of outlines or hash symbols
-
focaccia
no_std implementation of Unicode case folding comparisons
-
in_definite
Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.
-
markdown_timesheet
processing markdown files to extract and format timesheet data
-
xan
The CSV magician
-
matchers
Regex matching on character and byte streams
-
mdbook-preprocessor-graphql-playground
A preprocessor for mdbook to add GraphQL playgrounds
-
awful_news_vibes
Daily news meta-analysis pipeline with AI-powered clustering and D3 visualizations
-
termdiff
Write a diff with color codes to a string
-
mime-infer
detection of a file's MIME type by its extension
-
unindent
Remove a column of leading whitespace from a string
-
wildcard
matching
-
lgtmeow
🐾 —— 「本喵觉得很不错~」
-
mlc
The markup link checker (mlc) checks for broken links in markup files
-
savvy
R extension interface
-
ncount
A word count tool intended to derive useful stats from markdown
-
epub-builder
generating EPUB files
-
autumnus
Syntax highlighter powered by Tree-sitter and Neovim themes
-
line-ending
Detect, normalize, and convert line endings across platforms, including support for character streams. Ensures consistent handling of LF, CRLF, and CR line endings in text processing.
-
hyperlink
Very fast link checker for CI
-
uncomment
A CLI tool to remove comments from code using tree-sitter for accurate parsing
-
commitbot
A CLI assistant that generates commit and PR messages from your diffs using LLMs
-
wit_owo
interacting with the Wit.ai API
-
hyphenation
Knuth-Liang hyphenation for a variety of languages
-
unicode-id
Determine whether characters have the ID_Start or ID_Continue properties according to Unicode Standard Annex #31
-
nu_plugin_regex
nu plugin to search text with regex
-
arrow-string
String kernels for arrow arrays
-
qpdf
Rust bindings to QPDF C++ library
-
chat-gpt-lib-rs
interacting with OpenAI's ChatGPT API, providing a simple interface to make API requests and handle responses
-
markdown-tool
A CLI utility for converting Markdown into AST and vice versa
-
awabi
A morphological analyzer using mecab dictionary
-
usage-lib
working with usage specs
-
srgn
A grep-like tool which understands source code syntax and allows for manipulation in addition to search
-
pdflens-mcp
An MCP server for reading PDFs, coded by human, designed for AI
-
hck
A sharp cut(1) clone
-
codepage
Mapping between Windows code page numbers and encoding_rs character encodings
-
llmvm-core
The core application for llmvm
-
difflib
Port of Python's difflib library to Rust
-
dom-content-extraction
Content extraction via text density paper
-
mdbook-svgbob
SvgBob mdbook preprocessor which swaps code-blocks with neat SVG
-
todo_lib
Collection of utilities for todo.txt format
-
unicode-truncate
Unicode-aware algorithm to pad or truncate
strin terms of displayed width -
vaporetto
pointwise prediction based tokenizer
-
near-facsimile
Find similar or identical text files in a directory
-
tossicat
입력된 단어에 맞게 같이 입력된 토시(조사)를 적절하게 변환하는 라이브러리
-
stylin
Convert markdown to pandoc markdown with custom styles
-
awful_dataset_builder
Build LLM-ready Q/A datasets from reference text-to-question mappings produced by Awful Knowledge Synthesizer
-
decasify
A CLI utility and library to cast strings to title-case according to locale specific style guides including Turkish support
-
orgflow-tui
A terminal user interface for orgflow - manage notes and tasks with a smooth workflow
-
sanitizer
A collection of methods and macros to sanitize struct fields
-
unicode-casing
Titlecase helper function on characters
-
mintyml
A minialist alternative syntax to HTML
-
gpui-component
UI components for building fantastic desktop application by using GPUI
-
general-sam
A general suffix automaton implementation in Rust
-
rschess
chess library with the aim to be as feature-rich as possible
-
quixote
Quizzes and tests in Markdown
-
md-tui
A terminal markdown viewer
-
presenterm
A terminal slideshow presentation tool
-
ascii-canvas
canvas for drawing lines and styled text and emitting to the terminal
-
deno_media_type
Media type used in Deno
-
bfom
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
unicode-case-mapping
Fast lowercase, uppercase, and titlecase mapping for characters
-
sentencex
Sentence segmentation library with wide language support optimized for speed and utility
-
matcher_rs
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
mdbook-kroki-preprocessor
render kroki diagrams from files or code blocks in mdbook
-
mime_guess2
detection of a file's MIME type by its extension
-
wordcut-engine
Word segmentation/breaking library
-
patchkit
parsing and manipulating patch files
-
reword
some utility functions for human-readable formatting of words
-
shaperglot
Test font files for OpenType language support
-
seeyou-cub
reading and writing the SeeYou CUB binary file format, which stores airspace data for flight navigation software
-
mdbook-plantuml
A preprocessor for mdbook which will convert plantuml code blocks into inline SVG diagrams
-
graphannis
new backend implementation of the ANNIS linguistic search and visualization system
-
mdcat
cat for markdown: Show markdown documents in terminals
-
type-safe-id
A type-safe, K-sortable, globally unique identifier
-
mdbook-epub
An EPUB renderer for mdbook
-
arf-strings
Encoding and decoding for ARF strings
-
fuzzt
Implementations of string similarity metrics. Includes Hamming, Levenshtein, OSA, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Sørensen-Dice.
-
tauri-plugin-clipboard
A clipboard plugin for Tauri that supports text, html, rtf, files and image, as well as clipboard update listening
-
rdfless
A colorful pretty printer for RDF (Turtle/TriG/N-Triples/N-Quads) with ANSI colors
-
txt_to_md
Command converting from a txt file to a markdown file
-
simsearch
lightweight fuzzy search engine that works in memory, searching for similar strings (a pun here)
-
dptran
run DeepL translations on command line written by Rust
-
regex-literal
delimited regular expression literals
-
blockwatch
Language agnostic linter that keeps your code and documentation in sync and valid
-
fontconfig
Safe, higher-level wrapper around the Fontconfig library
-
rapidfuzz
rapid fuzzy string matching library
-
indefinite
Prefix a noun with an indefinite article - a or an - based on whether it begins with a vowel
-
unicode_categories
Query Unicode category membership for chars
-
linkify
Finds URLs and email addresses in plain text. Takes care to get the boundaries right with surrounding punctuation like parentheses.
-
ssg08
static site generator
-
icy_sauce
handling SAUCE – Standard Architecture for Universal Comment Extensions
-
bloodtree
A hierarchical note-taking system with focus on relationships between nodes
-
mdbook-combiner
combine mdbook summaries from multiple source into one mdbook
-
bear-query
A read-only Rust library for querying the Bear note-taking app's SQLite database with minimal interference
-
typst-kit
Common utilities for Typst tooling
-
inlyne
Introducing Inlyne, a GPU powered yet browserless tool to help you quickly view markdown files in the blink of an eye
-
zettel-cli
cli app for Luhmann-style Zettelkasten management
-
boreal
evaluate YARA rules, used to scan bytes for textual and binary pattern
-
json-escape
A no_std, zero-copy, allocation-free library for streaming JSON string escaping and unescaping. Ergonomic, fast, RFC 8259 compliant, with layered APIs for iterators, I/O streaming, and low-level tokens.
-
lindera-ko-dic-builder
A Korean morphological dictionary builder for ko-dic
-
svgdx-pandoc
pandoc filter for svgdx codeblocks in Markdown
-
inflection-rs
Inflection is a string transformation library. It singularizes and pluralizes English words, and transforms strings from CamelCase to underscored string.
-
blurhash
A pure Rust implementation of blurhash
-
treemd
A markdown navigator with tree-based structural navigation and syntax highlighting
-
makepad-widgets
Makepad widgets
-
buup
Core transformation library with zero dependencies
-
textra
type less, write more
-
levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
harper-core
The language checker for developers
-
vibequest
A vibe-coded scripting language focused on developer happiness with a REPL implementation in Rust
-
etradeTaxReturnHelper
Parses etrade and revolut financial documents for transaction details (income, tax paid, cost basis) and compute total income and total tax paid according to chosen tax residency (currency)
-
apisnip
A terminal user interface (TUI) tool for trimming OpenAPI specifications down to size ✂️
-
word-tally
Output a tally of the number of times unique words appear in source input
-
obsidian-logging
A journaling/logging CLI that stores logs in Obsidian markdown files
-
slice-command
slice is a command-line tool that allows you to slice the contents of a file using syntax similar to Python's slice notation
-
unicode-ccc
Unicode Canonical Combining Class detection
-
ttypr
terminal typing practice
-
mdbook-typst
An mdBook backend to output Typst markup, pdf, png, or svg
-
unreal-doc
generating documentation from Unreal C++ sources
-
emoji-remover
A fast command-line tool to remove emojis from source code files
-
str-utils
some traits to extend
[u8],strandCow<str> -
mdless
A terminal-based markdown file viewer
-
tantivy-common
common traits and utility functions used by multiple tantivy subcrates
-
moonwave
generating documentation from comments in Lua source code
-
esed
Easy sed
-
idna
IDNA (Internationalizing Domain Names in Applications) and Punycode
-
mdbook-mermaid
mdbook preprocessor to add mermaid support
-
sd
An intuitive find & replace CLI
-
simple-string-patterns
Makes it easier to match, split and extract strings in Rust without regular expressions. The parallel string-patterns crate provides extensions to work with regular expressions via the Regex library
-
slugomatic
🐌 A simple CLI tool to slugify and unslugify text, perfect for branch names and URLs
-
at-commands
AT Commands builder and parser for Rust #![no_std]
-
ristretto_classfile
reading, writing and verifying Java classfiles
-
rhai-autodocs
Custom documentation generator for the Rhai scripting language
-
tabprinter
creating and printing formatted tables in the terminal. It supports various table styles and offers both color and non-color output options.
-
ctj
A command-line tool to convert CSV to JSON written in Rust
-
skyspell
Fast and handy spell checker for the command line
-
cmark-writer
A CommonMark writer implementation in Rust for serializing AST nodes to CommonMark format
-
aki-gsub
substitude text command, replace via regex
-
uroman
A blazingly fast, self-contained Rust reimplementation of the uroman universal romanizer
-
kiru
Fast text chunking for Rust
-
pomsky
A new regular expression language
-
oranda
🎁 generate beautiful landing pages for your projects
-
yangon
A high-performance, stack-allocated string type for Rust with fixed capacity and zero heap allocations
-
zspell
Native Rust library for spellchecking
-
asimov-cli
ASIMOV Command-Line Interface (CLI)
-
anyxml
A fully spec-conformant XML library
-
ripgrep_all
rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc
-
no-crlf
A CLI tool to convert CRLF line endings to LF in text files
-
okh-tool
A CLI tool to deal with Open Know-How (OKH) data files. Its main functionalities are: validation of and conversion between the different formats
-
collclean
Clean up collaboration commands in LaTeX files
-
document_tree
reStructuredText’s DocumentTree representation
-
ADA_Standards
help you handle checks on your ADA projects, especially good to build scripts to check coding standards conformity
-
mdv
Terminal Markdown Viewer
-
unicode-security
Detect possible security problems with Unicode usage according to Unicode Technical Standard #39 rules
-
phs
Runtime for Phlow Script, PHS
-
zaz
cross-platform textual UI toolkit with bindings for Rust, C++, Zig and etc
-
inflections
High performance inflection transformation library for changing properties of words like the case
-
mdbook-quiz
Interactive quizzes for your mdBook
-
mdbook-catppuccin
🎊 Soothing pastel theme for mdBook
-
dicexp
A Dice Expression Interpreter program and library for parsing (and rolling) role-playing game style dice notations (e.g. "2d8+5")
-
galm
pattern matching library
-
tendril
Compact buffer/string type for zero-copy parsing
-
rake
Rapid Automatic Keyword Extraction (RAKE) algorithm
-
rustdoc-md
Convert Rust documentation JSON into clean, organized Markdown files
-
xrusty
Parse documents and transform using χrust
-
bmfont_rs
Load/ save/ manipulate BMFont files
-
quagga
CLI tool that combines multiple text files into a single prompt suitable for Large Language Models
-
codex
Human-friendly notation for Unicode symbols
-
mintyml-cli
Creates HTML from MinTyML, a minialist alternative syntax to HTML
-
gazenot
Gaze Not Into The Abyss, Lest You Become A Release Engineer
-
claude-kb-cli
CLI for generating, validating, and searching Markdown knowledge bases under a .claude/ hierarchy
-
topiary-queries
tree-sitter query files compatible with Topiary
-
fiat-lux
Offline terminal-accessible Bible
-
kas-text
Text layout and font management
-
derivre
A derivative-based regular expression engine
-
stringdex
A suffixtree search system for static sites
-
advreader
Split text file into text sequences, strings and (line) comments
-
annatto
Converts linguistic data formats based on the graphANNIS data model as intermediate representation and can apply consistency tests
-
crowbook
Render a Markdown book in HTML, PDF or Epub
-
ogam
A markup language for story writers
-
Inscribe
A markdown preprocessor that executes code fences and embeds their output
-
quickmd
Quickly preview a markdown file
-
quillmark-cli
Minimal CLI for testing Quillmark packages locally
-
kmdbook
Creates a book from markdown files
-
quickctx
A bidirectional file content aggregator and extractor that converts between files and markdown-formatted representations
-
asimov-imap-module
ASIMOV module for IMAP email import
-
dmos
Djot HTML renderer with advanced features
-
codeprism-core
Core incremental polyglot parser and graph builder for code analysis
-
anda_db_hnsw
A high-performance vector search library in Rust
-
markdown2pdf
Create PDF with Markdown files (a md to pdf transpiler)
-
mdbook-theme
A preprocessor and a backend to config theme for mdbook, especially creating a pagetoc on the right and setting full color themes from the offical ace editor
-
pad
padding strings at runtime
-
whatwg_streams
whatwg_streams for rust
-
shell2batch
Coverts simple basic shell scripts to windows batch scripts
-
trpl
A support crate for The Rust Programming Language book
-
cskk
C ABIから使う事を目的とした SKK(Simple Kana Kanji henkan)方式のかな漢字変換ライブラリ
-
latexmk-diff-head
LaTeX compilation tool that generates diff PDFs against Git commits
-
semantic-edit-mcp
MCP server for semantic code editing with tree-sitter
-
raylib_interactive
An interactive library for Raylib
-
koicore
core KoiLang module
-
csvpretty
A command-line tool that formats CSV input into tables with Unicode box-drawing characters
-
sbnf
A BNF-style language for writing sublime-syntax files
-
ib-matcher
A multilingual, flexible and fast string, glob and regex matcher. Support 拼音匹配 (Chinese pinyin match) and ローマ字検索 (Japanese romaji match).
-
deeprl
DeepL client library with all the things (blocking)
-
fuzzy-aho-corasick
Aho–Corasick automaton with fuzzy matching
-
rustdoc-stripper
manipulate rustdoc comments
-
panfix
parsing: linear time parsing of multifix operators
-
hanja_hangul
that converts Chinese characters to Korean characters. That is, convert hanja to hangul
-
mdbook-wordcount
Word count for mdbook, inspired by the mdbook tutorial
-
cicero-sophia
High-performance NLU (natural language understanding) engine built in Rust for speed, accuracy, and privacy
-
quickmark-cli
Lightning-fast Markdown/CommonMark linter CLI tool with tree-sitter based parsing
-
obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
mdbook-alerts
mdBook preprocessor to add GitHub Flavored Markdown's Alerts to your book
-
zalgo-codec
Convert an ASCII text string into a single unicode grapheme cluster and back. Provides a macro for embedding Rust source code that has been encoded in this way.
-
newdoc
Generate pre-populated module files formatted with AsciiDoc that are used in Red Hat and Fedora documentation
-
zhconv
Traditional/Simplified and regional Chinese variants converter based on MediaWiki & OpenCC rulesets and powered by AC automata 轉換简体、繁體及兩岸、新馬中文地區詞,基於MediaWiki和OpenCC之字詞轉…
-
search-text
A fast and flexible command-line tool to recursively search for text or regex patterns in files under a directory
-
mandown
Markdown to groff (man page) converter
-
lychee-lib
A fast, async link checker
-
syllabize-es
Syllabize Spanish text, and much more
-
olpc-cjson
serde_json Formatter to serialize as OLPC-style canonical JSON
-
repoyank
Interactively traverse your repository, select files/directories, and quickly prepare structured snippets for LLM interactions
-
string_pipeline
A flexible, template-driven string transformation pipeline for Rust
-
gen-mdbook-summary
generate SUMMARY.md for mdbook project
-
vi
An input method library for vietnamese IME
-
ast-grep-language
Search and Rewrite code at large scale using precise AST pattern
-
cmx
Rust Spectral Color Management Library
-
simdnbt
an unnecessarily fast nbt decoder
-
jsongrep
A JSONPath-inspired query language for JSON documents
-
mle
The markup link extractor (mle) extracts links from markup files (Markdown and HTML)
-
esri_ascii_grid
reading ESRI Ascii Grid .asc files
-
roe
Unicode case conversion
-
substring
method for string types
-
precis-tools
Tools and parsers to generate PRECIS tables from the Unicode Character Database (UCD)
-
ansiterm
ANSI terminal colours and styles (bold, underline)
-
regect
A cli tool to quickly test regular expressions
-
go22dos
go to todos
-
lindera-python
Python binding for Lindera
-
svgbob
Transform your ascii diagrams into happy little SVG
-
vibrato-rkyv
Vibrato: viterbi-based accelerated tokenizer with rkyv support for fast dictionary loading
-
gulagcleaner_rs
Ad removal tool for PDFs
-
ultra-nlp
A NLP library
-
hexout
A compact and dependency-free, flexible and customizable hex dump library for Rust that provides beautiful, configurable binary data visualization
-
two_percent
Fuzzy Finder in rust!
-
whitespace-sifter
Sift duplicate whitespaces away!
-
mdbook-git
Insert git commit files and diffs into mdbook
-
fonts
High-performance font parsing and analysis library for Grida Canvas
-
serpl
terminal UI for search and replace, ala VS Code
-
mdbook-cmdrun
mdbook preprocessor to run arbitrary commands
-
string_wizard
manipulate string like a wizard
-
marqant
Quantum-compressed markdown format for AI consumption with 90% token reduction
-
spargebra
SPARQL parser
-
four-char-code
A string of 4 ascii chars represented by an u32
-
searchfox-cli
CLI for https://searchfox.org
-
howmany
A blazingly fast, intelligent code analysis tool with parallel processing, caching, and beautiful visualizations
-
yake-rust
Yake (Yet Another Keyword Extractor) in Rust
-
minspan
a package for determining the minimum span of one vector within another
-
pretty-console
A fluent, zero-cost API for styling terminal text with colors and attributes
-
repgrep
An interactive command line replacer for
ripgrep -
roman-numerals-rs
Manipulate well-formed Roman numerals
-
rfgrep
Advanced recursive file grep utility with comprehensive file type classification - search, list, and analyze 153+ file formats with intelligent filtering and safety policies
-
stam-tools
Command-line tools for working with stand-off annotations on text (STAM)
-
mdbook-pagetoc
A mdbook plugin that provides a table of contents for each page
-
rst
a reStructuredText parser and renderer for the command line
-
bangumi-api
An api implementation for Bangumi website
-
tessera-embeddings
Multi-paradigm embedding library: ColBERT, dense, sparse, vision-language, and time series models
-
shibuichi
zsh prompt preprocessor to add git integration
-
nib-cli
A cli for a yet another static site generator Nib
-
none-shall-pass
Artifact for GitHub Action to validate hyperlinks in all markdown files
-
mdfried
A markdown viewer for the terminal that renders images and big headers
-
latkerlo-jvotci
Tools for creating and decomposing Lojban lujvo
-
opentalk-types-common-identifiers
Common identifier types for OpenTalk crates
-
unicode-bidi-mirroring
Unicode Bidi Mirroring property detection
-
vlazba
Lojban words generator and analyzer
-
ferrous-opencc
A pure Rust implementation of Open Chinese Convert (OpenCC), for fast and reliable conversion between Traditional and Simplified Chinese
-
minimo
terminal ui library combining alot of things from here and there and making it slightly easier to play with
-
gibberish-or-not
Figure out if text is gibberish or not
-
comically
fast manga & comic optimizer for e-readers
-
bibleref
Structures and functions for managing Bible references
-
filename-refactor
Command to refactor file names
-
twas
A text substitution application for using random look-up tables to generate text in a manner similar to the Mad Libs game
-
claudius
SDK for the Anthropic API
-
bbd
Binary Braille Dump
-
uast
Unicode Aware Saṃskṛta Transliteration in Rust 🦀
-
picodiff
Tiny GUI app to compare text easily
-
mdvalidate
Markdown schema validation engine
-
erg_common
A common components library of Erg
-
datafusion-functions
Function packages for the DataFusion query engine
-
sile
Simon’s Improved Layout Engine
-
asposepdf
Aspose.PDF for Rust via C++ is a powerful toolkit that allows developers to manipulate PDF files directly and helps do various tasks for PDF. Contains unique features for converting PDF to other formats.
-
tu
CLI tool to convert a natural language date/time string to UTC
-
rlibphonenumber
A high-performance Rust port of Google's libphonenumber for parsing, formatting, and validating international phone numbers
-
sanitize-filename-reader-friendly
A filename sanitizer aiming to produce reader friendly filenames
-
ankinase
A parser which generates Anki cards from CommonMark
-
bbqr
Implementaion of the bbqr spec in rust
-
ragit
git-like rag pipeline
-
archive-pdf-urls
Extract all links from a PDF and archive the URLs in the Internet Archive's Wayback Machine
-
unidoc
Unite all Markdown
-
tars-bin
A small, fast, static site generator
-
strs_tools
Tools to manipulate strings
-
luciferous-case-converter
A CLI tool to convert text between different cases
-
command-error
Detailed error messages and status checking for
std::process::Command -
bogrep
Full-text search for bookmarks from multiple browsers
-
cleanpy
Python 3.12+ 最佳实践知识库 - MCP 服务器
-
vmks-exam-generator
CLI program for pseudo-randomly generating different variants of an embedded programming exam
-
model2vec-rs
Official Rust Implementation of Model2Vec
-
mdbook-environment
A preprocessor for MdBook for working with environment variables
-
plsfix
Text cleaner upper
-
capitalize
Change first character to upper case and the rest to lower case, and other common alternatives
-
fast-slice-utils
Highly optimized slice utilities using SIMD instructions when available
-
dspy-rs
A DSPy rewrite(not port) to Rust
-
mdka
HTML to Markdown converter
-
latex-thebib
Clean and sort legacy TeX bibliographies written using ‘thebibliography’ via the
refactorsub-command. Compile BibTeX files to legacythebibliographyTeX code using thecompilesub-command… -
srt2txt
Convert SRT subtitle files into clean plain text (strip timestamps, tags, merge lines)
-
blame-rs
Line-by-line authorship tracking for revisioned text
-
colornames
An enum of color names, with a catchall RGB variant
-
mdbook-graphviz
mdbook preprocessor to add graphviz support
-
dash-em
Enterprise-Grade Em-Dash Removal Library — SIMD-Accelerated String Processing
-
css_lexer
A spec-compliant CSS tokenizer with zero-copy cursors and optional feature gates
-
pdf_tables
Scrape text from tables in PDF files
-
vesti
A preprocessor that compiles into LaTeX
-
transmutation
High-performance document conversion engine for AI/LLM embeddings - 27 formats supported
-
textwrap-cli
Command line interface for textwrap
-
nanohtml2text
A zero-dependency library to convert HTML to plain text
-
wdl-doc
Documentation generator for Workflow Description Language (WDL) documents
-
cedarwood
efficiently-updatable double-array trie in Rust (ported from cedar)
-
mdbook-aquascope
Interactive Aquascope editor for your mdBook
-
hebrew_unicode_script
A low-level library designed to ascertain whether a character belongs to the Hebrew Unicode script. It supports checks for individual characters as well as for membership within collections
-
lazy-grep
A high-performance, line-oriented command-line tool for searching text with regular expressions
-
pager
pipe your output through an external pager
-
wchar
Procedural macros for compile time UTF-16 and UTF-32 wide strings
-
tphrase
A translatable phrase generator
-
xi-unicode
Unicode utilities useful for text editing, including a line breaking iterator
-
fasttext
binding
-
mdbook-angular
mdbook renderer to run angular code samples
-
acroform
High-level PDF form manipulation library using lopdf
-
yosina
Japanese text transliteration library
-
markdown-harvest
designed to extract, clean, and convert web content from URLs found in text messages into clean Markdown format. Originally created as an auxiliary component for Retrieval-Augmented Generation (RAG)…
-
mdbook-embedify
based mdbook preprocessor plugin that allows you to embed apps to your book, like youtube, codepen, giscus and many other apps
-
blogr-cli
A CLI static site generator for blogs
-
kreuzberg-tesseract
Rust bindings for Tesseract OCR with cross-compilation, C++17, and caching improvements
-
simstring_rust
A native Rust implementation of the SimString algorithm
-
airs-memspec
Streamline AI-assisted development with Multi-Project Memory Bank management and GitHub Copilot integration
-
mdbook-d2
D2 diagram generator plugin for MdBook
-
rsxiv
Tools for working with arXiv and the arXiv API
-
string-offsets
Converts string offsets between UTF-8 bytes, UTF-16 code units, Unicode code points, and lines
-
rustkorean
processing Korean characters. It provides functionalities to check if a character is Korean, classify Korean characters, verify if a character is a leading consonant (choseong), a medial vowel (jungseong)…
-
unidown
Convert Markdown to Unicode
-
zp
Copy the contents of the source file or the standard output buffer to the clipboard, with support for maintaining a history of copied content, allowing users to easily paste into another file or program
-
chinese_dictionary
A searchable Chinese / English dictionary with helpful utilities
-
fontheight
Find out the vertical extents your font reaches on shaped words
-
steelkilt
Draft 0.4 RPG rule set
-
hawkeye-fmt
The formatter library for hawkeye cli
-
nanokit
A collection of tiny, reusable utility methods that reduce code size and improve performance
-
clima
A minimal Markdown reader in the terminal
-
easymark
Lightweight Markdown rendering utility that just works
-
text2num
Parse and convert numbers written in English, Dutch, Spanish, Portuguese, German, Italian or French into their digit representation
-
seams
High-throughput sentence extractor for Project Gutenberg texts with dialog-aware detection
-
boxen
creating styled terminal boxes around text
-
sff
SemanticFileFinder (sff): Fast semantic file finder using sentence embeddings. Searches .txt, .md, .mdx files.
-
pdf_oxide
High-performance PDF parsing and conversion library with ML capabilities
-
mdbook-private
An mdbook preprocessor that controls visibility of private chapters and sections within them
-
text-to-ascii-art
program to convert text to ASCII art
-
lo_
A modern Rust utility library delivering modularity, performance & extras ported from JavaScript Lodash
-
fast_symspell
Spelling correction & Fuzzy search
-
noil
file explorer using text buffers
-
mdbook-linkcheck2
A backend for
mdbookwhich will check your links for you -
quant1x
Cross-language standard library for quantitative trading
-
madoru
markdown task runner
-
create_broken_files
Create broken files from other ones
-
dvd-term
A bouncing ASCII art DVD logo (or custom text) for the terminal
-
folderwalk
Folder walking tool
-
jx
An interactive JSON explorer for the command line
-
ruma-events
Serializable types for the events in the Matrix specification
-
wcount
CLI word counting tool
-
COXave
Instruments for codings
-
affinidi-messaging-text-client
Affinidi Messaging SDK
-
difference-rs
text diffing and assertion library
-
numeronym-cli
A CLI tool that generates numeronym abbreviations from text
-
nobom
remove UTF-8 BOM (Byte Order Mark) from stdin and write to stdout
-
neofile
linux file command for windows. PATCH 4.
-
GemFetch
neofetch-style inspired in a Gem
-
semtools
Semantic search and document parsing tools for the command line
-
rlvgl-ui
High-level UI components and theming for rlvgl
-
xhtml_parser
Non-validating XHTML Tree-based parser
-
malachi
A domain specific pattern matching language made for defining bot commands
-
wikipedia_prosesize
Count Wikipedia prose size
-
skimtoken
Fast token count estimation library
-
aki-mcolor
mark up text with color
-
forbidden-bands
8-bit string handling library
-
indent
Functions for indenting multiline strings
-
spanned
string processing with file/line/col information and the regular rust
strAPI -
syntaxfmt
A derive macro-based library for flexible syntax tree formatting with pretty printing support
-
escaping
configurable string escaping and unescaping
-
tesseract-rs
Rust bindings for Tesseract OCR with optional built-in compilation
-
lexicmp
comparing and sorting strings lexicographically and naturally
-
sk-skimmer
Fuzzy Finder in rust!
-
retrofont
Retro terminal font toolkit: TDF/FIGlet parsing, rendering, conversion
-
braillify
Rust 기반 크로스플랫폼 한국어 점역 라이브러리
-
tiny_pretty
Tiny implementation of Wadler-style pretty printer
-
ansic
does ansi parsing in a dynamic DSL and at compile time for efficient and zero cost ansi styling
-
mdlynx
Small, fast utility to find broken file links in Markdown documents
-
spel-right
A fast and lightweight spell checker and suggester
-
rblog
Blog app
-
string-auto-indent
Normalizes multi-line string indentation while preserving platform-specific line endings
-
datafusion-spark
DataFusion expressions that emulate Apache Spark's behavior
-
fm
Non-backtracking fuzzy text matcher
-
mktoc
Generate Table of Contents from Markdown files
-
jpreprocess
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
twars-url2md
A powerful CLI tool that fetches web pages and converts them to clean Markdown format using Monolith for content extraction and htmd for conversion
-
textpod
Local, web-based notetaking app inspired by 'One Big Text File' idea
-
streplace
A tiny library for matching and replacing in strings and slices with user-defined functions
-
xim-ctext
compound text en/decoder
-
shifter
A CLI tool to rotate lines from stdin by n positions
-
asciisavers
A small collection of ascii screensavers
-
markdown-translator
A translation library with DeepLX API integration, rate limiting, and smart text chunking
-
jetscii
A tiny library to efficiently search strings and byte slices for sets of ASCII characters or bytes
-
autosurgeon
working with data in automerge documents
-
quillmark
engine API
-
zipcodes
Query US zipcodes without SQLite
-
ggemtext
Glib-oriented Gemtext API
-
zet
zet finds the union, intersection, set difference, etc of files considered as sets of lines
-
distrs
PDF, CDF, and percent-point/quantile functions for the normal and Student’s t distributions
-
anthropic-text-editor
A micro-CLI to apply tool calls from Anthropic for their text_editor_20250124 built-in computer use tool
-
mantra-miner
your software recite mantras while it runs
-
rob_test_sagebox_integration_001
Internal test crate for validating Sagebox packaging and README rendering. Not intended for public use.
-
jaarg
It can parse your arguments you should use it it's called jaarg
-
mdbook-numthm
An mdbook preprocessor for automatically numbering theorems, lemmas, etc
-
adc-lang
Array-oriented reimagining of dc, a terse RPN esolang
-
objectstore-client
Client SDK for Objectstore, the Sentry object storage platform
-
flatpage
file system based markdown flat page
-
kfst-rs
Fast and portable HFST-compatible finite-state transducers
-
mdtrans
Markdown parser and transformer using
pest.rs, focused on flexibility to a project’s needs -
datadriven
Rewritable table-driven testing
-
ipset_lookup
ipset is a command-line tool that takes networks or IPs and searches through a lot of different threat feeds quickly. It can also download the feed data necessary to perform the queries…
-
libdav
CalDAV and CardDAV client implementations
-
unicode_titlecase
add Unicode titlecase and Turkish and Azeri locale upper/lowercase utilities to chars and strings
-
mdbook-pikchr
A mdbook preprocessor to render pikchr code blocks as images in your book
-
good-pc
Just a silly little tool to pet your pc and display some ASCII art
-
goose-eggs
in writing Goose load tests
-
devek
CLI for copying HTML to clipboard
-
oxford_join
Join string slices with Oxford Commas!
-
tre-regex
Rust safe bindings to the TRE regex module
-
lethe-core-rust
High-performance hybrid retrieval engine combining BM25 lexical search with vector similarity using z-score fusion. Features hero configuration for optimal parity with splade baseline…
-
mailidator
A lightweight Rust library for checking email address misspellings
-
deencode
Reverse engineer encoding errors
-
krafna
terminal-based alternative to Obsidian's Dataview plugin, allowing you to query your Markdown files using standard SQL syntax
-
wordcutw
A C-interface wrapper for Wordcut - a Lao/Thai word segmentation/breaking library
-
inkjet
A batteries-included syntax highlighting library for Rust, based on tree-sitter
-
booky
analyze English text
-
vidyut-prakriya
A Sanskrit word generator
-
pinyin2ch
converting Chinese Pinyin to Chinese characters with various levels of detail
-
textcode
Text encoding/decoding library. Supports: UTF-8, ISO6937, ISO8859, GB2312
-
shvar
POSIX-compliant shell variable substitution routine
-
izihawa-tantivy-stacker
term hashmap used for indexing
-
fax
Decoder and Encoder for CCITT Group 3 and 4 bi-level image encodings used by fax machines TIFF and PDF
-
r2md
Entire codebase to single markdown or pdf file
-
natural
Pure rust library for natural language processing
-
cosmic-text-tessera-fork
Pure Rust multi-line text handling
-
tree-sitter-stack-graphs-typescript
Stack graphs definition for TypeScript & TSX using tree-sitter-typescript
-
mdbook-last-changed
mdbook preprocessor to add the last modification date per page
-
mdbook-codeblocks
A mdbook preprocessor to prepend customizable vignette to code blocks
-
mdbook-qr
An mdBook preprocessor that generates a QR code using fast_qr
-
casile
The command line interface to the CaSILE toolkit, a book publishing workflow employing SILE and other wizardry
-
rust-persian-tools
Official Rust implementation of Persian Tools
-
cardz
contact book
-
doxygen-bindgen
Converts Doxygen comments into Rustdoc markdown
-
md-ulb-pwrap
Markdown paragraph wrapper using Unicode Line Breaking Algorithm
-
string-patterns
Makes it easier to work with common string patterns and regular expressions in Rust, adding convenient regex match and replace methods (pattern_match and pattern_replace) to the standard…
-
nu-utils
Nushell utility functions
-
constr
Constant string generics
-
text-similarity-metrics
A high-performance Rust library for computing text similarity using multiple algorithms
-
rsrpp
project for research paper pdf
-
CompactPrefixRadix
a minimalistic but efficient radix tree implementation with extra prefix support
-
yara-x-parser
A parsing library for YARA rules
-
recursive-file-loader
recursively load files via references in the files
-
easy_reader
easily navigating forward, backward or randomly through the lines of huge files
-
bullet_stream
Bulletproof printing for bullet point text
-
creature_feature
Composable n-gram combinators that are ergonomic and bare-metal fast
-
veltox
A fast, themeable static site generator written in Rust — ideal for documentation and blogs
-
indent_write
Write adapters to add line indentation
-
exml
Pure Rust XML library based on libxml2
-
case
A set of letter case string helpers
-
slack-blocks-render
Slack blocks render is a Rust library to render Slack blocks as Markdown
-
jsrmx
command-line tool to manipulate JSON files. It can split large single-object JSON files into many files; merge multiple JSON files into one large JSON file; bundle multiple JSON files into one NDJSON file…
-
smol_strc
smol_str forks, replace Arc with Rc
-
compression-prompt
Fast statistical compression for LLM prompts - 50% token reduction with 91% quality retention
-
xid
Globally unique sortable id generator. A Rust port of https://github.com/rs/xid.
-
tbll
tbll outputs data in tabular format
-
mdbook-chess
An mdbook preprocessing plugin to generate chess boards
-
unicode-writing-script
The package provides detection of writing scripts from Unicode codepoints
-
lister-cli
Lister: Navigate Markdown Lists
-
phd
an esoteric gopher server
-
tortilla
Somewhat syntax-aware text wrapping for source code and plain text documents
-
url_encor
A lightweight library to encode and decode special characters in urls
-
dnd-character
A Dungeons and Dragons character generator
-
tibco_ems
A high level API for the Tibco EMS
-
pangu
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
mdbook-variables
mdBook proprocessor for risolve variables configured from book.toml
-
mdbook_fork4ls
Fork of mdBook for mdBook_LS
-
multipart-write
Sink-like interface for writing an object in parts
-
secular
No Diacr!
-
grapheme_machine
Grapheme cluster text segmentation (UAX #29) state machine for streaming input
-
utf8_iter
Iterator by char over potentially-invalid UTF-8 in &[u8]
-
tinyvec_string
tinyvec based string types
-
fkys-rs
F*cking Kill Yourself lang interpreter written in Rust
-
rust-censure
profanity filter
-
lumin
searching and displaying local files
-
mdbook-glossary
mdBook preprocessor to generate glossary tables
-
utf16_lit
macro_rules to make utf-16 literals
-
unicode-properties
Query character Unicode properties according to UAX #44 and UTR #51
-
mdbook-docfx
mdbool-docfx
-
teip
Masking tape to help commands "do one thing well"
-
genpdf
User-friendly PDF generator written in pure Rust
-
rust-texas
generate latex documents
-
asimov-prompt
ASIMOV Software Development Kit (SDK) for Rust
-
pinyin-parser
Parses a string of pinyin syllables. Covers marginal cases such as
ẑ,ŋandê. -
twincat
ADS tools
-
rustpress
增量编译倒分页无后端 Rust 纯静态博客程序
-
vectradb-chunkers
Chunking utilities for VectraDB in Rust
-
json_to_table
pretty print JSON as a table
-
tcod
bindings for the Doryen library (a.k.a. libtcod).
-
rtranslate
dependency-free Rust wrapper for Google Translate public web API
-
newline_normalizer
Zero-copy newline normalization to \n or \r\n with SIMD acceleration
-
likewise
A diff library for Rust (fork of similar)
-
regexml
XPath compatible regex engine
-
prompt-input
lightweight library for user input prompts in Rust, designed to make input handling straightforward
-
simplematch
Fast wildcard pattern matching for strings and bytes with a simple api
-
kashida
Insert Kashidas/Tatweel into Arabic text, e.g. for justification purposes.
-
zh_num
Convert ASCII numbers and zh words
-
pgf2json
Application Programming Interface to load and interpret grammars compiled in Portable Grammar Format (PGF). The PGF format is produced as a final output from the GF compiler. The library…
-
alphabet_detector
Natural language alphabet detection library
-
genedex
A small and fast FM-Index implementation
-
trailfix
Trim trailing whitespace and ensure single newline at EOF
-
dug
An exhaustive name resolution aggregator
-
codebase-to-markdown
convert codebase to markdown format
-
rascii_art
Advanced ASCII Art Generator
-
cai
The fastest CLI tool for prompting LLMs
-
patiencediff
algorithm
-
utf8proc
Rust bindings to the utf8proc library
-
seshat-unicode
A Unicode Library for Rust. Unicode 16.0.0 ready. XID_Start and XID_Continue are also available.
-
langextract-rust
extracting structured and grounded information from text using LLMs
-
himmelblau_red_asn1
A little library to encode/decode ASN1 DER
-
lindera-unidic-builder
A Japanese morphological dictionary builder for UniDic
-
ripsecrets
A command-line tool to prevent committing secret keys into your source code
-
emojicon
Find Emoji by using Emoticons and GitHub's, Bengali emoji names
-
trustformers-tokenizers
Tokenizers for TrustformeRS
-
furigana
Map furigana to a word given its reading
-
glu
Unpacks a Markdown document full of code snippets into a temp directory
-
mdbook-org
A preprocessor plugin for mdBook that enables org-mode file support
-
smt-str
working with SMT-LIB strings in Rust
-
xgrammar-rs
Efficient, Flexible and Portable Structured Generation for Rust - Rust bindings for XGrammar
-
threeway_merge
Git-style 3-way string merging using proven algorithms from libgit2/xdiff. Statically links xdiff (LGPL-2.1+).
-
token_processor
fast, streaming‑first Rust library for processing LLM outputs by attaching callbacks to XML‑style tags—supporting both streaming and buffered handlers—and using aho‑corasick for ultra‑efficient…
-
kaff_sso
Small-buffer-optimized generic buffer and UTF-8 string type
-
rascii_art_img
Advanced ASCII Art Generator. Fork for imgii.
-
perky
An application to permute and score keyboard layouts
-
sakurs-core
High-performance sentence boundary detection using Delta-Stack Monoid algorithm
-
zmodem2
ZMODEM file transfer protocol
-
typed-oid
Typed Object IDs
-
isbn
handling ISBNs
-
mdtablefix
mdtablefixunb0rks and reflows Markdown tables so that each column has a uniform width. When the--wrapoption is used, it also wraps paragraphs and list items to 80 columns. -
did-toolkit
spec-compliant implementation of did-core, W3C's 'Decentralized Identity Documents'
-
opusmeta
reading and writing metadata to opus files
-
mdref
Markdown Reference finding and migration tool
-
mdbook-callouts
mdBook preprocessor to add Obsidian Flavored Markdown's Callouts to your book
-
sapling-streampager
streampager is a pager for command output or large files
-
fast-cat
An ergonomic macro for efficient string concatenation with a single memory allocation
-
encoding-next
Character encoding support for Rust
-
rsticle
Treat source files as articles / narrative documentation
-
rspack_error
rspack error
-
lil-tabby
A macro-based library for creating visually appealing tables with automatic column spanning
-
rewrite
Safely rewrite file contents from stdin, even when file is open as an input
-
utf64
encode utf-8 strings into utf-64, and decode them back
-
zen-rs
generating non-interactive content like cards or files
-
quot
A fast and flexible command-line tool that converts text input into escaped string literals
-
e9571_str_lib
string processing, sorting, counting, and hashing
-
topiary-cli
CLI app for Topiary, the universal code formatter
-
nlpo3
Thai natural language processing library, with Python and Node bindings
-
mdsh
Markdown shell pre-processor
-
diagnostic
Pretty diagnostic report
-
custard
A frontmatter-querying server
-
anycase
a case conversion library for Rust
-
langram
Natural language detection library
-
omnix-common
Common functionality for omnix frontends
-
ticker-sniffer
extracting multiple stock ticker symbols from a text document
-
gazetta-render-ext
A static site generator framework. Extra render code.
-
basic-text
Basic Text strings and I/O streams
-
csvpp
Compile csv++ source code to a target spreadsheet format
-
dodo-zh
operate on pinyin and cedict.u8 file
-
itext
Safe rust bindings to the iText 9 PDF generation library written in Java
-
colourss
parsing CSS color strings into RGB values
-
armnod
random string generator
-
mdbook-curly-quotes
mdBook preprocessor that replaces straight quotes with curlyquotes, except within code blocks or code spans
-
slugrs
A fast, locale-aware slugify library for Rust
-
float-pretty-print
Format f64 for showing to user, not for serialisation
-
asciidork-eval
Asciidork eval
-
dedoc
Terminal-based viewer for DevDocs documentation
-
okh-scraper
A scraper of Open Source Hardware (OSH) projects. based on the Open Know-How (OKH) standard
-
n_gram
training n-gram language models
-
mdbook-tocjs
A mdbook preprocessor which adds extra js and css file for ToC hydration
-
wcc
my own version of wc for personal use
-
hlight
dedicated to delivering exceptional syntax highlighting capabilities
-
spellchk
A blazingly fast spellchecker CLI for any text file
-
univiz
A command-line tool for analyzing Unicode strings, providing detailed information about graphemes, code points, and UTF-8 byte sequences
-
repvar
A tiny CLI tool that replaces variables of the style
${KEY}in text with their respective value. It can also be used as a rust library -
text-scatters
A cut-up technique generator from text and ebook files in the terminal
-
unicode-matching
match Unicode open/close brackets
-
crate2bib
Create BibLaTeX entries for crates hosted on crates.io
-
codebook-lsp
A code-aware spell checker with language server implementation, installable via cargo install
-
unicode-display-width
Unicode 15.1.0 compliant utility for determining the number of columns required to display an arbitrary string
-
text_trees
textual output for tree-like structures
-
ucf
A universal code formatter
-
pkgcraft-c
C library for pkgcraft
-
iotext_rs
IoText data protocol
-
bpe-match
A pattern matching library for BPE tokenization, intended to replace regex-based approaches
-
unbom
Remove UTF-8 BOM from files
-
dprint-plugin-markdown
Markdown formatter for dprint
-
lcat
lolcat in rust! Full unicode support, escapes for ANSI escape sequences, hue shift in Cubehelix
-
deno_node
Node compatibility for Deno
-
polished_scancodes
handling and mapping keyboard scancodes in Rust
-
slugify-core
Fast, Unicode-aware slug generation library with multi-language bindings
-
gitbook2text
A CLI tool to download GitBook pages and convert them to markdown and text
-
loc
Count lines of code (cloc) fast
-
jayce
tokenizer 🌌
-
harfbuzz_rs
A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust
-
mut-str
A toolkit for working with mutable string slices (&mut str)
-
cin
that simplifies command-line input in Rust, especially mimicking C++-style input
-
intuicio-frontend-simpleton
Simpleton frontend module for Intuicio scripting platform
-
basalt-tui
Basalt TUI application for Obsidian notes
-
mdbook-selfpath
A preprocessor for mdbook that allows self-referencing paths
-
matchr
A fast fuzzy matcher library written in Rust for use in CLI tools and TUI apps
-
ere
A compile-time alternative for POSIX extended regular expressions
-
pdf2pwg
Single purpose A4 page renderer rendering PDF using pdfium to PWG/URF
-
mdbook-bib
mdbook plugin allowing to load and present a bibliography in BibLaTex format in your books and cite its references
-
strf
Format strings with arbitrary expressions using highlighting friendly syntax
-
partial-json-fixer
Partial JSON fixer fixes partial JSON
-
autotex
Continuously compile TeX and LaTeX
-
wrap-ansi
A high-performance, Unicode-aware Rust library for intelligently wrapping text while preserving ANSI escape sequences, colors, styles, and hyperlinks
-
uresamp
HIFI uresamp delivers ultrasonic-fidelity text resampling via adaptive 64-bit floating-point spectral mapping, preserving Unicode 32-bit codepoint integrity with zero-phase distortion
-
dwg-core
Deterministic Writing Guard core analysis engine for spotting AI-styled prose
-
rust-regex-dsl
Regular expression DSL
-
emoji-search
Fast fuzzy emoji searcher and picker for the terminal
-
autofoam
related tools
-
flowquad
that helps you build UI stuff with Macroquad
-
iati-types
Core data types for IATI Activity v2.03: Activity, Transaction, Money, TxType, etc
-
turbovault-parser
Obsidian Flavored Markdown (OFM) parser
-
treebender
An HDPSG inspired symbolic NLP library for Rust
-
unicode-rs
A comprehensive Unicode character library for Rust applications with theme support
-
agentai
designed to simplify the creation of AI agents
-
gemini-map
A command-line tool to run files in parallel through Google Gemini
-
puroro
Google Protocol Buffer
-
byteforge
A next-generation byte-level transformer with multi-signal patching and SIMD optimization
-
illuminate-string
A comprehensive Rust library for advanced string manipulation and processing
-
brk_string_wizard
manipulate string like a wizard
-
text_utils_s
edit array. Example delete duplicate in array. Clear string
-
typst-cffi
FFI to Typst
-
rust-ai
A collection of 3rd-party AI APIs for Rust
-
ranked-searcher
Search inside text files using tf-idf formula, showing the most relevant search at the top
-
trump
Spawn a background thread that prints a Donald Trump quote every 10 seconds
-
seam
Symbolic Expressions As Markup
-
human_regex
A regex library for humans
-
rrag
High-performance Rust framework for Retrieval-Augmented Generation with pluggable components, async-first design, and comprehensive observability
-
async-utf8-decoder
Convert AsyncRead to incremental UTF8 string stream
-
metatron
core library
-
aki-xtee
copy standard input to each files and standard output
-
emoji
Every emoji, their metadata, and localized annotations
-
lingua-english-language-model
The English language model for Lingua, an accurate natural language detection library
-
waken_snowball
Snowball stemming algorithms for 33 languages
-
adobe-cmap-parser
parse Adobe CMap files
-
typing_engine
A typing game engine for Japanese and English
-
jawk
JSON AWK
-
vibrato
viterbi-based accelerated tokenizer
-
caseify
A CLI tool to convert strings between different cases
-
scrunch
full-text-searching compression
-
pyo3-filelike
Rust access to Python file-like objects
-
sedregex
Sed-like regex library
-
ngram_rs
Facilitate creating ngrams in Rust to be used in the polars plugin
-
broken-md-links
A command-line tool and library to detect broken links in Markdown files
-
mdbook-tiny
Use mdbook to generate tiny and fast static sites
-
monochora
gif to ascii art converter written in rust
-
tiktokenx
A high-performance Rust implementation of OpenAI's tiktoken library
-
rlex
A cursor-based, utf-8 Vec<char> lexer
-
sortuniq
Find or count unique values in an input stream
-
hyphertool
Hypertool is a command-line tool for syllabification and hyphenisation
-
sigrs
Interactive grep (for streaming)
-
japanese-codepoints
A high-performance Rust library for Japanese character validation and code point handling based on JIS standards
-
rust_string_utils
String utilities for rust based on org.apache.commons.lang3
-
img2epub
Convert images to EPUB
-
diffy-imara
Tools for finding and manipulating differences between files
-
vader-sentimental
A faster Rust version from the original Python VaderSentiment analysis tool
-
instant-segment
Fast English word segmentation
-
litsea
extreamely compact word segmentation and model training tool implemented in Rust
-
hanconv
Convert between Chinese characters variants
-
rustpython-parser-vendored
RustPython parser vendored third-party crates
-
google-book-scraper
downloading the contents of books hosted on books.google.com for offline viewing
-
runefix-core
Unicode character display width engine supporting CJK, emoji, and grapheme clusters
-
egui_text_animation
text animation using egui
-
dmos-cli
Djot HTML renderer with advanced features - CLI
-
mdbook-godbolt
A preprocessor for mdbook to add runnable code snippets via Godbolt
-
readability
Port of arc90's readability project to rust
-
invisible-characters
A list of invisible characters
-
colonnade
format tabular data for display
-
enma
serving anime and manga information 📦
-
ricat
A Rust-Based implemenation of classic UNIX
catcommand -
pandoc_types
Rust port of pandoc-types
-
regextrie
High-performance Rust library designed for matching a large number of regular expressions against a given input string
-
djotters
Djot (Markdown) parser that runs at hyper speeds!
-
htmd-cli
The command line tool for htmd
-
chord3
Create pdf songbooks from chopro source
-
lindera-wasm
A morphological analysis library for WebAssembly
-
snips
Keep code snippets in markdown files in sync
-
pragmatic-segmenter
Rust port of pySBD v3.1.0
-
domrs
Document builder and serializer
-
mdbook-lint-rulesets
Modular rulesets for mdbook-lint - standard and mdBook-specific linting rules
-
plot3d
reading, writing, and manipulating NASA PLOT3D structured grids
-
spacemod
A easy to understand and powerful text search-and-replace tool
-
pspp
Statistical analysis software
-
csep
Cosine Similarity Embeddings Print
-
rjot
A minimalist, command-line jotting utility that's fast, private, and git-friendly
-
normalized-line-endings
Returns an iterator over characters with all line endings converted to \n (U+000A)
-
webdog
static site generator fit for a dog
-
opentalk-types-signaling-legal-vote
Signaling types for the OpenTalk legal vote module
-
yekdast
normalizing and cleaning up Persian (Farsi) text
-
timeblok-cli
A CLI for timeblok
-
yeslogic-ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched with support for additional tables
-
tdoc
assorted CLI tools for working with FTML (Formatted Text Markup Language) documents
-
vds
Visibly distinguishable string types for identifiers and codes
-
caco3
common lib
-
inlet_manifold
A general purpose highlighting library
-
vader_sentiment
Bindings for Rust from the original Python VaderSentiment analysis tool
-
mdloc
command-line tool for processing image links in Markdown files. Download remote images and convert them to Base64 embedded format or local file references.
-
wp-mini-epub
Minimal async WP to EPUB downloader | Extremely minimal
-
drova_sdk
Sdk for absolute converter of formats for dalet
-
json-carver
Digital forensics tool that reads (carves) JSON strings from a dump. Think of it as a more accurate and faster replacement for the strings(1) utility.
-
bpetok
CLI for tokenizing text input using Byte Pair Encoding (BPE)
-
soft-ascii-string
char/str/string wrappers which add a "is-ascii" soft constraint
-
grapheme-cluster-utils
Unicode-safe String manipulation utilities by extended grapheme cluster (UAX #29)
-
hexdump
Easy hexdump to stdout or as an iterator
-
tergo-tokenizer
R language tokenizer
-
recase
Changes the convention case of input text
-
piki-gui
FLTK-based GUI for Piki personal wiki application
-
mdmg
A scaffold prototype code tool
-
tetratto-shared
Shared stuff for Tetratto
-
udataframe_rs
A pure Rust library for data frame operations, particularly useful for processing data extracted from PDF files or OCR recognize
-
charx
A replacement for char::is_ascii*
-
pathmut
Command line utility for manipulating path strings
-
slugi
slugify text (e.g., 'Café au lait' → 'cafe_au_lait')
-
doxx
Terminal document viewer for .docx files
-
asimov-serpapi-module
ASIMOV module for data import powered by the SerpApi search data platform
-
odict
A blazingly-fast dictionary file format for human languages
-
artificial
Typed, provider-agnostic prompt-engineering SDK for Rust
-
case_insensitive_hashmap
A HashMap that uses case-insensitive strings as keys
-
markitdown
designed to facilitate the conversion of various document formats into markdown text
-
wg-ragsmith
Semantic chunking and RAG utilities for document processing and retrieval-augmented generation
-
tuicam
Terminal-based camera with switchable modes
-
bashdoc
generating documentation/help menu for user defined bash functions
-
md_check
Check markdown link validity
-
semantic-query
AI-powered schema validation with automatic JSON generation for type-safe responses
-
orgflow
managing documents with support for tasks and notes
-
pdfgen
PDF rendering library
-
bwrap
A fast, lightweight, embedded systems-friendly library for wrapping text
-
hat-splitter
HAT splitter
-
postcode_extractor
extract and identify postcodes
-
libchai
汉字编码优化算法
-
mdbook-cat-prep
a preprocessor for mdbook which provides teacher, subject, material and tag functionality
-
rzozowski
A regex crate using Brzozowski derivatives
-
duvet
A requirements traceability tool
-
ebg
Eric's Blog Generator, a simple static site generator
-
stenotype
Machine stenography primitives
-
pulldown-cmark-toc
Generate a table of contents from a Markdown document
-
shift_or_euc
Detects among the Japanese legacy encodings
-
furze
finite state transducers (fst) writen in rust
-
regex-charclass
Manipulate and convert regex character classes
-
rexify
trait based matcher alternative to Regex
-
verugent
Verilog: Hardware Discription DSL library
-
term_grid
formatting strings into a grid layout
-
skyspell_kak
skyspell - kakoune integration
-
colored_text
adding colors and styles to terminal text
-
ucfirst
Uppercase the first letter of a string
-
runiq
An efficient way to filter duplicate lines from input, à la uniq
-
yamake
yet another make tool
-
speedreader
A command-line speed reading tool
-
purlu
A full-text search engine
-
feedr
feature-rich terminal-based RSS/Atom feed reader written in Rust
-
utf-64
The next-generation text encoding standard using 64 bits per character
-
askama-markdown-cmark
Askama filter for markdown, using pulldown-cmark
-
inflector-plus
Adds String based inflections for Rust. Snake, kebab, camel, word, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
aranya-policy-text
The Aranya Policy Language textual types
-
mdbook-linkcheck
A backend for
mdbookwhich will check your links for you -
mdbook-hints
mdBook preprocessor to add hover hints to your book
-
count-md
configurable command-line tool and Rust library for Unicode-aware, Markdown-aware, HTML-aware word counting in Markdown documents
-
advent-ocr
Converts ASCII-art representations of letters generated by Advent of Code puzzles into a String containing those letters
-
ised
An interactive tool for find-and-replace across many files
-
herring-automata
Automata construction for Herring
-
bump-bin
Increments version with semver specification
-
unsafe-tools-mimic
Size and alignment matched opaque types
-
extstd
intended as an extension of the standard library
-
deucalion
High-performance Windows library for capturing decoded FFXIV packets
-
like
A SQL like style pattern matching
-
wikidot-normalize
provide Wikidot-compatible string normalization
-
codebook_config
Configuration handling for the Codebook spell checker
-
matrix-ui-serializable
Opinionated abstraction of the matrix-sdk crate with serializable structs
-
unicode-intervals
Search for Unicode code points intervals by including/excluding categories, ranges, and custom characters sets
-
overlap-chunk
splitting text into chunks of specified size with adjustable overlap percentage
-
text-editing
string with utilities for editing
-
openapi-nexus-config
OpenAPI 3.1 to code generator
-
files-to-text-cli
Files to text cli
-
minigrep_pack
A small clip clone of minigrep
-
meme_generator_memes
Meme generator built-in memes
-
loki_text
advanced string manipulation with pattern searching and replacement capabilities
-
ewts-cli
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (cli)
-
line-numbers
Find line numbers in strings by byte offsets, quickly
-
markdowndown
acquiring markdown from URLs with smart handling
-
pink_accents
Replacement of patterns in string to simulate speech accents
-
src2md
Turn source code into a Markdown document with syntax highlighting, or extract it back
-
url-cleaner-engine
The engine behind URL Cleaner
-
sniffer-rs
that simplifies fuzzy string matching in rust
-
vidyut-lipi
A Sanskrit transliterator
-
shimmytok
Pure Rust tokenizer for GGUF models with llama.cpp compatibility (SentencePiece + BPE)
-
translit
Transliteration for cyrillic languages that uses officially approved rules
-
stringmatch
Allow the use of regular expressions or strings wherever you need string comparison
-
hangeul
Korean alphabet manipulation library
-
reason-shell
Reason: A Shell for Research Papers
-
cli-boxes
Unicode box drawing characters for creating beautiful CLI interfaces
-
csml_interpreter
The CSML Interpreter is the official interpreter for the CSML programming language, a DSL designed to make it extremely easy to create rich and powerful chatbots
-
scraps_libs
Scraps is a portable CLI knowledge hub for managing interconnected Markdown documentation with Wiki-link notation
-
css_recess_order
Recess-based sort order for CSS properties
-
pukram2html
converting Pukram-formatted text to HTML
-
ftb
A fast CLI tool to format and align Markdown tables
-
sansaccent
Convertit les chaînes françaises en slugs URL-friendly en supprimant accents et caractères spéciaux
-
filecheck
writing tests for utilities that read text files and produce text output
-
ferret
A trigram-based tool for detecting similarity in groups of text documents or program code
-
harfshapedfa
Glue functionality between skrifa & harfrust
-
annoying
others and yourself
-
lformat
Clone of Lua
string.formatin Rust based on Cs(n)printf -
luau-lexer
A lexer for the luau language
-
re_view_text_document
view that shows a single text box
-
unified-diff
GNU unified diff format
-
uniquewords-rs
Count the frequencies of words in text file(s) or stdin
-
analiticcl
approximate string matching or fuzzy-matching system that can be used to find variants for spelling correction or text normalisation
-
viddy
A modern watch command
-
expression_parser
minimal Rust parser and evaluator for arithmetic expressions with variables, built using the pest parsing library. It converts a text formula into an abstract syntax tree (AST) and…
-
codump
A straightforward and flexible code/comment dump tool
-
eddie
Fast and well-tested implementations of edit distance/string similarity metrics: Levenshtein, Damerau-Levenshtein, Hamming, Jaro, and Jaro-Winkler
-
trezoa-offchain-message
Trezoa offchain message signing
-
clipcount
Counting words from the clipboard content
-
gspell
Rust bindings for gspell
-
colored-hexdump
Create beautifuly colored hexdumps
-
anda_nexus
Anda -- an AI Nexus powered by the Knowledge Interaction Protocol (KIP)
-
commit_crafter
AI powered tool for Git commit message generator
-
lemmeknow
Identify any mysterious text or analyze strings from a file
-
zz-data
Data structures for Zanzarah apis
-
cymbal
search for symbols in a codebase
-
minigrep_xyz1
For finding strings in text
-
shellvetica
Bring your shell output to the web
-
unicount
Alphabetic counter supporting unicode
-
learner
learning stuff
-
pullup
Convert between markup formats
-
indentsort
Structure-preserving sorting of arbitrary indented text
-
wdl-lint
Lint rules for Workflow Description Language (WDL) documents
-
statisk
opinionated static site generator
-
string-width
Accurate Unicode string width calculation for terminal applications, handling emoji, East Asian characters, combining marks, and ANSI escape sequences
-
santoka
Translations of 668 of Taneda Santoka's free-verse haiku
-
vn-settings
Various settings intended to simulate visual novels
-
ident_case
applying case rules to Rust identifiers
-
probly-search
A lightweight full-text search engine with a fully customizable scoring function
-
mdbook-fs-summary
Summary generator for mdbook
-
yara-x-fmt
A code-formatting library for YARA rules
-
hexxy
all over the place!
-
chonkier
🦛 Chonkie, now in Rust 🦀: No-nonsense, ultra-fast, ultra-light chunking library
-
clipboard-stream
Async stream of clipboard change events
-
tracery
Text-expansion library
-
ndg-commonmark
Flavored CommonMark processor for Nix-related projects, with support for CommonMark, GFM, and Nixpkgs extensions
-
stam
powerful library for dealing with stand-off annotations on text. This is the Rust library.
-
eco
reasoning about breaking changes in Rust ecosystems
-
enum-ts
TypeScript Enum pattern matcher codegen
-
interslavic
in rust
-
case_clause
case clause macro for rust
-
tkrar
Count frequency of words in a file or a directory
-
unicode-language
detect language coverage given a list of codepoints
-
miku_best_friend_printer
Contains a MikuPrinter that can produce an ASCII art of Miku - your best friend - from the music video 'Monitoring (Best Friend Remix)' by DECO*27
-
europa
A lightweight AI utilities library for Rust
-
cron_clock
A cron expression parser and schedule explorer. Rich documentation and case studies and related upper-level libraries are available.
-
pretty_regex
🧶 Elegant and readable way of writing regular expressions
-
mdbook-quiz-schema
Schema for quizzes used in mdbook-quiz
-
zine
opinionated tool to build your own magazine
-
linebreak
breaking a given text into lines within a specified width
-
libphext
A rust-native implementation of phext
-
typoglycemia
A function to convert text to typoglycemic format
-
cwc
A word counter utility that properly handles CJK and Unicode text
-
mpdf
writing PDF files
-
languages
GitHub's language data, compiled into a tiny, fast Rust library
-
nsys-curses-utils
Rust *curses utilities
-
varion
텍스트 기반 DSL로 구성된 대화 시스템 파싱 라이브러리
-
my-notes
note taking app for taking hierarchical notes in markdown
-
mdbook-presentation-preprocessor
A preprocessor for utilizing an MDBook as slides for a presentation
-
spongebob
convert text to spongebob case a.k.a tHe MoCkInG sPoNgEbOb MeMe
-
grapheme-utils
Handy utils for working with utf-8 [unicode] Extended Grapheme Clusters
-
thoth-note
note-taking app written in Rust
-
regex_ansi
ANSI escape sequence regular expression (Rust port of ansi-regex behavior)
-
yuuang-test-napi
N-API bindings
-
block-list
A minimalist hosts-based tool for managing block lists and ad-blocking
-
ohos-ime
Bindings to the
inputmethodAPI of OpenHarmony -
linecache
高效读取文本文件,并自动缓存。Efficiently read text files and automatically cache them
-
opentalk-types-signaling-whiteboard
Signaling types the OpenTalk whiteboard module
-
unescape
Unescapes strings with escape sequences written out as literal characters
-
mcat
a powerfull extended cat command, to cat all the things you couldn't before
-
htmls
parsing HTML and extracting HTML elements or text
-
pomsky-bin
Compile pomsky expressions, a new regular expression language
-
etch
Not just a text formatter, don't mark it down, etch it
-
uwurs
UwUify your strings with uwurs!
-
dissolve-python
dissolve deprecated calls in Python codebases
-
vectorscan-rs
Ergonomic bindings to the Vectorscan high-performance regex library
-
voirs-g2p
Grapheme-to-Phoneme conversion for VoiRS speech synthesis
-
ystd
An opinionated and batteries included
stdmirror for convenient, correct code and pleasant error messages -
fast-str
A flexible, easy-to-use, immutable, efficient
Stringreplacement for Rust -
icondata_zi
Zi icon set for the icondata family of crates
-
html2pango
convert html to pango
-
galaxy_save_core
Defining common types and traits used in the save data of Super Mario Galaxy and Super Mario Galaxy 2
-
libappindicator-zbus
zbus implement for libappindicator
-
glk
Bindings for the Glk I/O interface for hosting interactive fiction interpreters
-
evp
work with evidence packages (*.evp)
-
input_py
you can use input like python3
-
textwrap-macros
procedural macros to use textwrap utilities at compile time
-
quranize
Encoding transliterations into Quran forms
-
charname
Incredibly simple library that just gives you the Unicode name for a character
-
tectonic
A modernized, complete, embeddable TeX/LaTeX engine. Tectonic is forked from the XeTeX extension to the classic “Web2C” implementation of TeX and uses the TeXLive distribution of support files.
-
twitter_text_config
Configuration for twitter-text in Rust
-
string-replace-all
String replacement utility inspired by JavaScript, allowing pattern-based substitutions with support for both exact matches and regex patterns
-
ansi-align
Text alignment library with ANSI escape sequence and Unicode support
-
array_tool
Helper methods for processing collections
-
koji
An interactive CLI for creating conventional commits
-
processors-rs
Embed anything at lightning speed
-
pager2
pipe your output through an external pager
-
gofmt
implemention
-
convert_case_extras
Extra features for convert_case
-
nova-forms
Build online forms with ease
-
xerg
Ultra-fast grep implementation in Rust - built for maximum speed with direct output and parallel processing
-
mdbook-typst-pdf
mdbook typst pdf backend
-
leptos_forge_ui_components
Components you can use in your leptos_forge application
-
ezemoji
Catigoryized Emoji's
-
promptify
A plaintext directory formatting tool for interacting with LLMs on the command line
-
tantivy-tokenizer-api
Tokenizer API of tantivy
-
modeling
tools to analysis different languages by Ctags
-
mind-tree
Organize your thoughts in a tree-like structure
-
asimov-module-cli
ASIMOV Module Command-Line Interface (CLI)
-
ast-grep-lsp
Search and Rewrite code at large scale using precise AST pattern
-
spcfile
.spc file format reading and writing
-
snailquote
Escape and unescape strings with shell-inspired quoting
-
rust_readability
A package to assess the complexity of texts using a variety of readability formulas
-
epcmanager
EPC text tool for RFID
-
opentalk-report-generation
OpenTalk report generation functionality
-
linurgy
Manipulate the output of multiple newlines. Replace/Insert/Append newlines with text. Input and output from stdio/files/buffers
-
zenpatch
A robust library for applying text-based patches, designed for AI coding agents with backtracking algorithm
-
qm-role-build
Role builder from markdown tables
-
md-kroki
Render kroki diagrams inside markdown
-
copier
A bidirectional file content aggregator and extractor that converts between files and markdown-formatted representations
-
streampager
pager for command output or large files
-
ascii_help
help you quickly convert ASCII codes
-
easy-regex
Make long regular expressions like pseudocodes
-
wildcard-trie
A space-efficient router / radix-trie that supports wildcards
-
fortune-rs
classic BSD fortune program
-
substring-replace
developer-friendly methods to manipulate strings with character indices
-
record-query
doing record analysis and transformation
-
tergo-formatter
Formatter for tergo
-
regex_generate
Use regular expressions to generate text
-
scan-rules
some macros for quickly parsing values out of text. Roughly speaking, it does the inverse of the print!/format! macros; or, in other words, a similar job to scanf from C.
-
afrim-translator
Manage the predication system of the afrim input method
-
rust_file_encode_mode_convert
这是一个rust的库,用于检测文件的编码格式。 支持GBK,GBK2312 , UTF8, UTF16LE, UTF16BE, UTF8+BOM,UTF32 等多种编码格式。
-
byteutils
that provides a collection of frequently used utility functions for working with bytes, strings, and vectors. It includes common tasks such as converting between strings and byte arrays…
-
turbo-json-checker
A pushdown automaton low memory JSON bytes stream checker returning the JSON root-type followed by its start and end index in the Reader
-
riimut
Transform latin letters to runes & vice versa
-
helia-strings
string storage and retrieval for Helia IPFS nodes
-
safe-string
safe interface for interacting with multi-byte strings in Rust, namely IndexedStr, IndexedString, and IndexedSlice
-
kotoba-formatter
Kotoba Code Formatter - Formats .kotoba files with consistent style
-
cesu8
Convert to and from CESU-8 encoding (similar to UTF-8)
-
book2pdf
CLI utility to turn published code documentation into PDFs
-
uclanr
A random word picker that gives you actually useful words
-
microcad-builtin
µcad Builtin Library
-
markdown-extract
Extract sections of a markdown file
-
csmlinterpreter
The CSML (Conversational Standard Meta Language) is a Domain-Specific Language developed for creating conversational experiences easily
-
aimds-detection
Fast-path detection layer for AIMDS with pattern matching and anomaly detection
-
mdbook-translator
A translation preprocessor plugin for mdBook that automatically translates documents using the DeepSeek API
-
gsm7-alt
GSM7 encoding and decoding by integer
-
mdbook-open-on-gh
mdbook preprocessor to add a open-on-github link on every page
-
maybe-regex
Wrapper for strings that may be either a regex or a plain-text string
-
maddi-recipe
parsing and scaling markdown recipes
-
cloc
Count, or compute differences of, lines of source code and comments
-
rst_parser
a reStructuredText parser
-
vtashkov-bf
Brainfuck interpreter
-
ngrammatic
Character-oriented ngram generator and fuzzy matching library
-
toolstr
formatting data into String's
-
artificial-prompt
Fluent builders and helpers for composing markdown prompt fragments
-
aho-corasick
Fast multiple substring searching
-
rargs
xargs + awk with pattern matching support
-
pandoc
API that wraps calls to the pandoc 2.x executable
-
bitutils2
A package of tools for bit manipulations, including bit indexing, bitfields, and a variation of regular expressions for binary data
-
pdfcr
render a codebase to a pdf
-
kathoey
text feminization using open corpus linguistics data
-
file-editor
Clean, elegant API for creating and editing text files
-
hebrew_unicode_utils
Some functions for processing Hebrew unicode characters
-
subslay
Text → emoji 💅🏻 Powered by Rust
-
kmdbook-driver
High-level library for running mdBook
-
biometrics
provide the vitals of a process in the form of counters, gauges, moments, and T-digests
-
arabic_pdf_to_text
A CLI tool to convert Arabic PDFs to text using Google's Gemini API
-
pretty-xmlish
Pretty print XML-ish data with unicode art
-
human-time-cli
A command-line tool for converting time durations to human-readable formats, built using the
human-timecrate -
cotitch
concatenate project files into a single Markdown file
-
tantivy-stemmers
A collection of Tantivy stemmer tokenizers
-
whichlicense_detection
detect licenses used by the WhichLicense project
-
utf16_iter
Iterator by char over potentially-invalid UTF-16 in &[u16]
-
evtx-msg
enrich evtx entries with messages
-
tcalc-rustyline
A fork of Rustyline for use specifically with tcalc
-
md2adf
Markdown to Atlassian Document Format converter
-
arinamcnulty-markdown-parser
Markdown parser - university project
-
mdbook-inline-highlighting
mdBook preprocessor that enables support for inline highlighting
-
mdi
markdown include
-
gigagei
random quote fetching console utility
-
text-tokenizer
Custom text tokenizer
-
cosmic-text
Pure Rust multi-line text handling
-
asimov-x-module
ASIMOV module
-
yozuk
Chatbot for Programmers
-
skill-tree
generate graphviz files to show roadmaps
-
fx-mistral
leverage the Mistral API for OCR and data extraction from PDFs
-
avatarsay
Beautiful quotes from Avatar: The Last Airbender
-
ingrid_core
Crossword-generating library and CLI tool
-
nu_plugin_emoji
a nushell plugin called emoji
-
rosie
Interface for the Rosie Pattern Language, for efficient and maintainable text pattern matching and search
-
antex
Styled text and tree in terminal
-
mdbook-blush
An mdBook small-caps preprocessor
-
thesaurus
An offline thesaurus library for Rust
-
keep-a-changelog
generating and manipulating CHANGELOG.md files that use the Keep A Changelog format
-
mdbook-nice
A mdbook plugin to add nice css to your book
-
glyphana
Quickly find, inspect & collect unicode glyps
-
real_time_note_taker
A terminal UI tool to take time stamped notes in real time
-
transportations_library
A comprehensive Rust-based library implementing transportation engineering methodologies (e.g. the Highway Capacity Manual (HCM)) with Python bindings.
-
weasel-gen
Random ascii generation animation until target string is met
-
mdbook-ocirun
mdbook preprocessor to run arbitrary commands and code snippets inside containers
-
waterui-str
String utilities for WaterUI
-
rsrusl
A really simple useful library ported to Rust
-
rustruut
Text-to-IPA converter and phonetic translator for Rust, powered by the Goruut phonemization engine
-
utf8-fix
Fix invalid UTF-8 sequences in-place while preserving buffer size - useful for fuzzing and mutation testing
-
rakugaki
rendering TTF/OTF font characters as ASCII art in the terminal
-
basen
Convert binary data to ASCII with a variety of supported bases
-
quickmark-core
Lightning-fast Markdown/CommonMark linter core library with tree-sitter based parsing
-
ascii_table_rs
Elegant ASCII table renderer for Rust CLI and terminal apps
-
ilyvion-util
Collection of utility functions and types for use in my personal projects
-
sesters
💱 Fast, offline currency converter 💴 💷 💶 💵
-
justcsv
CSV-file reader/writer
-
rust_iso15924
ISO 15924, Codes for the representation of names of scripts, is an international standard defining codes for writing systems or scripts (a "set of graphic characters used for the written form of one or more languages")…
-
agnostic-levenshtein
Levenshtein distance for ASCII or Unicode strings
-
libcommand
creating ascii command interpreters with builtin argument-splitting
-
mdlib
A beautiful markdown note-taking application
-
taboc
A table of contents generator for markdown documents
-
typos-cli
Source Code Spelling Correction
-
onig_sys
onig_syscrate contains raw rust bindings to the oniguruma library. This crate exposes a set of unsafe functions which can then be used by other crates to create safe wrappers around Oniguruma… -
text-document
Text document structure and management
-
rust-functions
A collection of Rust utility functions (starting with format_number)
-
sayit
String replacements using regex
-
prefix-file-tree
Predictable file directory structures
-
glow_glyph
A fast text renderer for glow, powered by glyph_brush
-
yy1
Tiny utility to convert KiCad centroid files into Neoden YY1 pick and place machine format
-
pangu2
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
simple-xlsx-writer
memory efficient XLSX writer
-
fast_trie
A memory efficient trie library
-
pukram-formatting
A type to represent the formatting of the pukram markup language
-
utf58
High-tech encoding of the Unicode space in one quibble and up to 3 bytes
-
genpdfi
User-friendly PDF generator written in pure Rust
-
scanlex
lexical scanner for parsing text into tokens
-
goofy-animals
Generate a name in adjective-adjective-animal form
-
orly
Download O'Reilly books as EPUB
-
ojcmp
online judge comparer
-
scanix
search a text or pattern in files. A fast and lightwight text tool.
-
awful_book_sanitizer
CLI to clean up OCR-mangled book excerpts into readable text using OpenAI-compatible APIs
-
viks
vim-like key crate
-
blocks
A block editor library that converts to Markdown and HTML
-
rust-tfidf
calculate TF-IDF (Term Frequency - Inverse Document Frequency) for generic documents
-
textgridde-rs
dealing with Praat TextGrid files. MIT licensed.
-
what-rs
Identify what something is! A pyWhat reimplementation in Rust
-
tectonic_bridge_core
Exposing core backend APIs to the Tectonic C/C++ code
-
mdbook-summary
Summary parser for mdBook
-
rsnltk
Rust-based Natural Language Toolkit
-
afsort
American Flag sort implementation for faster sorting of Strings
-
diamond-types
The world's fastest text CRDT
-
ipa-translate
translating between IPA and ASCII text
-
loe
Very fast and yet another line ending (CRLF <-> LF) converter written in Rust
-
dingtalk
Robot Util, Send text/markdown/link messages using DingTalk robot, 钉钉机器人
-
magic-regexp
creating regular expressions with ease
-
readability-js
wrapper for Mozilla's Readability.js library
-
chatter
Translate a human-readable dialogue script into Rust
-
lowcharts
draw low-resolution graphs in terminal
-
convert-to-spaces
Convert tabs to spaces in a string
-
r-matrix
Rust port of cmatrix
-
ratwenture
atventure clone
-
animated-emojis-rs
Noto Animated emojis
-
mdbook_header_footer
mdBook preprocessor to prepend header and append footer to certain chapters
-
nstd_core
NSTD core crate
-
puppet-fmt
Automatic code formatter for puppet manifests
-
srx
A mostly compliant Rust implementation of the Segmentation Rules eXchange (SRX) 2.0 standard for text segmentation
-
ada-idna
Fast IDNA (Internationalized Domain Names in Applications) implementation in Rust
-
xi-rope
A generic rope data structure built on top of B-Trees
-
filenamify
Convert a string to a valid filename
-
mdbook-toc
mdbook preprocessor to add Table of Contents
-
anaso_site_api_models
API models for Ana.so
-
quake_text
Utils for Quake strings and characters
-
freetype
Bindings for Freetype used by Servo
-
mdplayscript
An extension of Markdown for play scripts
-
screenplay-doc-parser-rs
Tools to parse Screenplay-formatted documents into semantically-typed structs
-
lsp-ty
type definitons for LSP
-
deindent
A command line utility and Rust library to format overly-indented text
-
byte_string
Wrapper types for outputting byte strings (b"Hello") using the Debug ({:?}) format
-
kl-hyphenate
Knuth-Liang hyphenation for a variety of languages
-
markdown-toc
Markdown Table of Contents generator
-
aki-mcycle
mark up text with cycling color
-
tengwar
Transliterate text into J.R.R. Tolkien's Tengwar.
-
badascii
Backend rendering library for BadASCII diagrams. Block diagrams in ASCII.
-
yara-x-cli
A command-line interface for YARA-X
-
cgp-dispatch
Extensible data type dispatchers for CGP handlers
-
string-overlap
A helper crate for "layering" ASCII art
-
prometheus-32bitfix
Prometheus instrumentation library for Rust applications
-
asciidoctor-client
A kludge to improve the performance of static site generators that use asciidoc through its cli
-
strloin
copy on write slices of a string
-
vregex
Regular Expression engine
-
gpui_util
A collection of utility structs and functions used by Zed and GPUI
-
substr-iterator
Substring extractor based on characters without allocation
-
magic_string
magic string
-
asoiaf-api
Rust wrapper for the api of ice and fire
-
krile
A static site generator
-
regex-chunker
Iterate over the data in a
Readtype in a regular-expression-delimited way -
subtitles
A command line programme to generate SRT format subtitles for use in video players
-
smart-patcher
Patcher based on rules
-
timug
It has been created for personal blog creation purpose. Timus has its limits, but it fulfills the purposes for which it was created.
-
fontheight-cli
Find out the vertical extents your font reaches on shaped words
-
lindera-ipadic-builder
A Japanese morphological dictionary builder for IPADIC
-
nlprule
A fast, low-resource Natural Language Processing and Error Correction library
-
dianac
An emulator, compiler, and interpreter for the Diana Compiled Language
-
typship
A cli for typst packages
-
caseless
Unicode caseless matching
-
lingua-french-language-model
The French language model for Lingua, an accurate natural language detection library
-
mdzk
Plain text Zettelkasten based on mdBook
-
slugify
Macro for flexible slug generation
-
directwrite
A safe abstraction for interacting with DirectWrite, intended initially to be used with direct2d for easy text rendering
-
key-lang
To be the prettiest and simpliest script
-
u8char
An alternative to the char type that uses UTF-8 as its in-memory representation
-
rslug
fast, and configurable library to create URL-friendly slugs from strings
-
html-index
Generate an HTML index
-
mdbook-driver
High-level library for running mdBook
-
doc_loader
A comprehensive toolkit for extracting and processing documentation from multiple file formats (PDF, TXT, JSON, CSV, DOCX) with Python bindings
-
ethan-rs-wc
The ethan-rs-ws(erwc) is word, line, character, and byte count. Like wc command but not just wc command, more accurate and faster. Text can also be read from standard input for statistics.
-
man
Generate structured man pages
-
gdnative-doc
Documentation tool for gdnative
-
fsays
flavored replacement for the classic cowsay
-
hns
Human numeric sorting program — does what
sort -his supposed to do! -
readable-regex
Regex made for humans. Wrapper to build regexes in a verbose style.
-
stego_wps
text-based steganography using the Words Per Sentence (WPS) method. It allows encoding and decoding hidden messages within the structure of a given text.
-
laser-pdf
programmatic PDF generation with precise, predictable layout control
-
tagalyzer
A CLI tool to gather statistics on collections of plaintext-adjacent files
-
iregex
Intermediate representation for Regular Expressions
-
neo-mime
Strongly Typed Mimes
-
clarifai_grpc
The official Clarifai gRPC Rust client
-
swc_ecma_regexp_ast
AST definitions of ECMAScript regular expressions
-
catalog-of-markdown
Generate the catalog of markdown file
-
lexical-sort
Sort Unicode strings lexically
-
unicode-canonical-combining-class
Fast lookup of the Canonical Combining Class property
-
text-parsing
Hierarchical text processing preserving char position info
-
svgbob_cli
Transform your ascii diagrams into happy little SVG
-
cglue-bindgen
cleanup cbindgen headers for CGlue
-
ttpkit-utils
Various utilities for ttpkit
-
rehuman
Unicode-safe text cleaning & typographic normalization for Rust
-
bfom-lib
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
bstr
A string type that is not required to be valid UTF-8
-
mdast_util_to_markdown
Markdown to AST
-
vndb_tags_get
convert VNDB tag list (JSON to markdown)
-
libretranslate
A wrapper for the LibreTranslate web API
-
mdbook-auto-gen-summary
A preprocessor and cli tool for mdbook to auto generate summary
-
tpt
Pure Rust implementation of the Unix concatenate (cat), word-count (wc) and echo command
-
opstr
‘Operate on strings’ command line utility
-
steer-tui
Terminal UI library for Steer
-
sublime_fuzzy
Fuzzy matching algorithm based on Sublime Text's string search
-
cistring
A string type that preserve case, but compares insensitiveley
-
ascii-img
Convert images to ASCII
-
tectonic_engine_bibtex
The
bibtexprogram as a reusable crate -
natural-sort-rs
sorting strings based on Natural sort order
-
recvmsg
Traits for receiving datagrams reliably, without truncation
-
pithy
Ultra-fast, spookily accurate text summarizer that works on any language
-
somedoc
A very simple document model and markup generator
-
unicode-width-16
Determine displayed width of
charandstrtypes according to Unicode Standard Annex #11 rules -
ai-context-gen
A context generator for Rust repositories that creates structured markdown files with relevant information for LLMs and AI agents
-
hunspell-rs
Rust bindings to the Hunspell library
-
bgmtv
An API wrapper for bangumi.tv based on reqwest
-
fmtt
A diff-friendly text formatter that breaks lines on sensible punctuations and words to fit a line width
-
html-linter
An HTML linting library for checking HTML structure and semantics
-
codetypo-dict
Source Code Spelling Correction
-
chaf
A CLI filter tool that removes lines based on logical conditions using AND, OR, NOT, and parentheses
-
poriborton
Interconversion between Unicode and various Bengali ANSI encodings
-
ezstr
A String wrapper supporting negative indexing with grapheme indexing for slices and Regex::find_iter and Regex::find
-
gh-emoji
Convert
:emoji:to Unicode using GitHub’s emoji names -
flatlake
In the works
-
tweet-memo
A CLI tool to record Twitter-style short memos in Markdown files
-
rusk
a Specification Language
-
ncase
Enforce a case style
-
convert_string
A trait to convert Strings to safe non-keywords and/or convert a Strings case (snake_case, PascalCase, ...)
-
pulldown-cmark-mdcat
Render pulldown-cmark events to TTY
-
kbremap
Custom keyboard layouts for windows
-
mdopen
Preview markdown files in a browser
-
rumage
framework for making simple markdown sites
-
japanese-text
日本語テキスト正規化ライブラリ - 全角半角変換、ひらがなカタカナ変換
-
pandoc-ac
pandoc filter for converting acronym codes to LaTeX
-
stylish-plain
stylishhelpers for discarding styles -
ean-rs
generating and validating EAN barcodes
-
uapi-version
Compare versions according to the UAPI Version Format Specification
-
vestalia
Vestaboard API wrapper
-
getch-rs
getchfor Windows and Unix -
markdown-live-preview
A live Markdown preview server with WebSocket and HTML rendering
-
ftrace
trace files and paths
-
mutf8
Basic support for mutf8 strings
-
sbert
Sentence Bert (SBert)
-
bundle_repo
Pack a local or remote Git Repository to XML for LLM Consumption
-
zenbase-llml
Lightweight Language Markup Language - converts data structures to XML-like markup
-
boykisser
A collection of compile time ASCII art and images, especially boykisser
-
jira-clean
clean up Jira task description that is an output of jira-cli tool
-
choco
markup language for dialogue systems
-
strval
Parse strings into values
-
justify
plaintext while handling Unicode gracefully
-
ccase
Command line interface to convert strings into any case
-
safe-regex-compiler
Regex compiler for the safe-regex crate
-
rsmorphy
Morphological analyzer / inflection engine for Russian and Ukrainian (soon) languages (WIP)
-
eldenring
Raw structures and bindings for From Software's title Elden Ring
-
yeslogic-fontconfig-sys
Raw bindings to Fontconfig without a vendored C library
-
line-rs
Extract lines from files without hacks!
-
csv-groupby
execute a sql-like group-by on arbitrary text or csv files
-
letter-sequence
A method to create sequence displayed as uppercase or lower letters, or digits
-
boreal-cli
CLI utility to run boreal, a YARA rules engine
-
textnonce
Text based random nonce generator
-
reggy
friendly, resumable regular expressions for text analytics
-
abacus-rs
A simplified plain text cli accounting tool
-
linkcheck2
extracting and validating links
-
noctisroll
Text-based TRPG dice rolling system
-
mdbook-ai-pocket-reference
mdbook preprocessor for the ai-pocket-reference project
-
coverio
Better code coverage reporting for Rust crates
-
random-zh
generating random Chinese characters
-
waterui-text
Text and typography components for WaterUI
-
sakurs-cli
Command-line interface for Sakurs sentence boundary detection
-
re_view_text_log
A view that shows text entries in a table and scrolls with the active time
-
zettel-core
Core library for Luhmann-style Zettelkasten management
-
rfc9839
RFC 9839 specification
-
fuse-rust
Fuse is a super lightweight library which provides a simple way to do fuzzy searching. Fuse-Rust is a port of Fuse-Swift, written purely in rust
-
base64-lib
A base64 encoder / decoder
-
wildcard_ex
extended wildcards that allows VB-like specifications
-
repub-rs
binary for converting mhtml webpages into remarkable-style summarized epubs
-
wdict
Create dictionaries by scraping webpages or crawling local files
-
jq-rs
Run jq programs to extract data from json strings
-
mention
code highlighter
-
notion2md
converting Notion pages to Markdown
-
unveil-rs
Unveil Rs is a tool to create presentations from markdown files
-
mdbook-metadata
mdBook preprocessor to parse markdown metadata
-
typope
Pedantic source code checker for orthotypography mistakes and other typographical errors
-
matchpick
Find and replace multi-lines using a match-case
-
detect-newline-style
Determine a string's preferred newline character
-
md-to-incodoc
Convert markdown to incodoc
-
zhconv-cli
Convert Traditional/Simplified Chinese and regional words of Taiwan/Hong Kong/mainland China/Singapore based on Wikipedia and OpenCC rulesets 轉換中文簡體、繁體及兩岸、新馬地區詞,基於維基百科和…
-
asciimath-unicode
Convert asciimath to unicode
-
character-set
High performance
set.contains(char) -
libharu
Rust binding for libharu(http://libharu.org/) PDF library
-
easy_io
Fast and dead-simple IO for competitive programming in Rust
-
cargo-markdown
Local crates.io readme development server with ultra-fast hot reloading goodness
-
caseformat
Power flow case data format
-
latex
An ergonomic library for programatically generating LaTeX documents and reports
-
base256emoji
A multibase base256emoji implementation
-
diff-man
diff utility lib
-
plagiarismbasic_lib
Basic plagiarism checker written in Rust
-
pinyin-sort
sort pinyin
-
poly_doctest
Polyglot documentation snippet test generator - extract code snippets from docs and generate tests
-
escrit
learning languages by reading texts
-
superfold
A multilingual Rust library and CLI to process UTF-8 strings to exclude diacritics and fold non-phonetic graphemes into their phonetic ASCII representation
-
highly_sendable
Containers for sending Strings and results within your process and beyond
-
ari
ari. the extras library.
-
hfs_nfd
Handle Apple's unique NFD-like Unicode normalization, which is used in HFS+, in Rust
-
liwe
IWE core library
-
ramp-text
Pure Rust
-
ib-unicode
Unicode utils
-
crowbook-text-processing
some utilities functions for escaping text (HTML/LaTeX) and formatting it according to typographic rules (smart quotes, ellipsis, french typograhic rules)
-
assemblage_db
Distributed Document/Graph DB for Connected and Overlapping Pages
-
wtf8-rs
WTF-8 encoding
-
rust_tokenizers
High performance tokenizers for Rust
-
ob
A Blog and RSS system written in Rust
-
ya_advent_lib
Advent of Code Library
-
mdbook-infisearch
InfiSearch plugin for Mdbook
-
synthphonia-rs
DryadSynth solver for string transformation programs
-
mdbook-external-links
Open external links inside your mdBooks in a different tab
-
createqrcode
A CLI tool to convert data into QR codes
-
zalgo-codec-common
Convert an ASCII text string into a single unicode grapheme cluster and back
-
csvpeek-rs
A CLI tool to quickly peek into, list, and filter CSV data
-
parlib
A barebones parser combinator library, inspired by Haskells parsec
-
winsfs-cli
Site frequency spectrum estimation based on window expectation-maximisation algorithm
-
geoipsed
Inline decoration of IPv4 and IPv6 address geolocations
-
slicedisplay
Simplistic Display implementation for Vecs and slices
-
quillmark-typst
Typst backend for Quillmark
-
marktask
A CLI tool for parsing and manipulating Markdown tasks
-
character_converter
Turn Traditional Chinese script ot Simplified Chinese script and vice-versa and tokenize
-
kbnf-regex-automata
A forked version of regex-automata for kbnf
-
p4d-mdproof
Markdown to PDF converter
-
ironstorm_lookup
Lightning fast lookup table for auto completion, type ahead, suggestion engines
-
minigrep_101
Minigrep implementation of the Rust Book
-
rsonpath-lib
Blazing fast JSONPath query engine powered by SIMD. Core library of
rsonpath. -
searcher_txt
A copy of grep that I made to show that im bad at rust
-
tree-sitter-stack-graphs-python
Stack graphs definition for Python using tree-sitter-python
-
unicode_reader
Adaptors which wrap byte-oriented readers and yield the UTF-8 data as Unicode code points or grapheme clusters
-
topfew
CLI to find high frequency occurrences in structured text files
-
code-to-pdf
Generates a syntax-highlighted PDF of your source code
-
char-ranges
Iterate chars and their start and end byte positions
-
git2prompt
command-line tool that takes a GitHub repository URL, downloads its contents, and generates a single text file optimized for use as input to AI tools
-
east-asian-width
Determine the display width of Unicode characters in East Asian contexts
-
merge3
merge tool for three-way merges
-
binyl
A bitwise UTF-8 string inspection tool
-
gregex
Regex solver utilizing NFA
-
lookbook
Component preview framework for Dioxus
-
truncate_string_at_whitespace
Truncate a &str at the closest whitespace to a specified length with unicode safety
-
git-blamediff
A program to automatically annotate changes to a file in git(1)
-
rtss
A command-line tool to annotate stdout/stderr with elapsed times
-
cloudproof_anonymization
Cosmian Cloudproof Anonymization library
-
bbcode-tagger
BBCode tree parser and tagger
-
semchunk-rs
A fast and lightweight Rust library for splitting text into semantically meaningful chunks
-
unicode-vo
Unicode vertical orientation detection
-
mecab
Safe Rust wrapper for mecab a japanese language part-of-speech and morphological analyzer library
-
portmanteau
create portmanteaux
-
savagestr
SAVAGE string encoder/decoder. If can encode or decode by specifying the code page or the encoding name, it works, else it uses the savage way to encode/decode by using
String::from_utf8_lossy() -
path2regex
Express style path to RegExp utility
-
krilla-svg
Converting SVG files to PDF
-
replaxe
A command-line tool to replace text in files with easy patterns
-
text_unit
Newtypes for text offsets
-
subject-classifier
classifying a commit by it's subject
-
tiefdownlib
manage and convert TiefDown projects
-
turn-uppercase
Small command to uppercase text in command line and copy to clipboard
-
ranting
Linguistic formatting placeholder extensions for rust
-
bbx
A robust, performant BBCode pull parser
-
ib-pinyin
一个高性能拼音查询、匹配库
-
tmenu
TUI fuzzy finder
-
rustme
Automatically generate Rusty READMEs
-
arabic-script
An expressive API for the characters of the Arabic script
-
iconv-native
A lightweight text encoding converter based on platform native API or libiconv
-
ascii_converter
converting between different ascii representations
-
static-lang-word-lists
Runtime decompressed statically-included word lists
-
dec_from_char
Small library for converting unicode decimal into numbers
-
tgrep
Toy grep that honors .gitignore
-
sms_splitter
An SMS message splitter and part calculator with support for GSM and Unicode
-
ripmors
encoding and decoding international Morse code and several variants
-
quoted-string-parser
Quoted string parser for grammar defined in RFC3261
-
corpus-preproc
A preprocessor for text and HTML corpora
-
memkb
A local AI in memory RAG for markdown files in a folder exposed as MCP
-
mdbook-pagebreaks
A mdbook preprocessor to insert page breaks when rendering to HTML
-
esl01-renderdag
Render a graph into ASCII or Unicode text
-
mdbook-embed
A preprocessor that simplifies embedded URL
-
pgn-traits
Traits for chess variations that can be annotated with Portable Game Notation
-
mdbook-scientific
Enables inline equations for mdbook to set by $..$ signs and $$..$$
-
loro-internal
Loro internal library. Do not use it directly as it's not stable.
-
holy-carpet
customizable blog creator
-
kanuni
AI-powered legal intelligence CLI - The Ottoman Edition
-
fuzzy-datetime
Detects, completes and normalises fuzzy date and date-time strings for interoperability with chrono or direct output as ISO-8601-compatible strings
-
protokit_proto
Usable protocol buffers
-
docket
markdown to HTML documentation rendering
-
ansi-width
Calculate the width of a string when printed to the terminal
-
quickner-core
A fast and simple NER tool
-
rawstring
A string type that can hold data which may or may not be valid UTF-8
-
fastn-builtins
fastn: Full-stack Web Development Made Easy
-
context-notation
Featherweight semantic notation for text
-
fifthtry-mdbook
fork of mdbook, only for ft-cli
-
anslatortray
translate from English to Pig Latin!
-
see-cat
A cute cat(1)
-
bardic
A performance-focused toolkit for procedural narrative generation
-
bilingual
A cmdline tool used for markdown translation via calling Chinese translation api cloud services
-
raw-string
RawStr/RawString are like str/String, but without any guaranteed encoding
-
natural-date-rs
A parser to convert natural language date and time specifications into DateTime
-
kspconfigtool
KSP1 ConfigNode parser and block removal tool
-
asciifolding
ascii folding library
-
hina
:]
-
swot
community-driven or crowdsourced library for verifying that domain names and email addresses are tied to a legitimate university of college
-
hebrew_accents
finding, filtering, and displaying Hebrew accents, specifically focusing on the Tiberian accent system as documented by the Masoretes
-
mdbook-open-git-repo
mdbook preprocessor to add a open-on-git-repo link on every page
-
perm-text
curling straight/dumb quotation marks ("") and apostrophes (') into their curly/smart (“”’) equivalents
-
akiaki
A good old fashioned wiki engine with a flat-file database
-
fasta-cleaner
Transform fasta files by upper-casing all sequence characters and removing non-ACGT sequence characters
-
nonsense
Lorem ipsum placeholder text generator with clipboard integration
-
chicken_esolang
An interpreter for the esolang Chicken
-
gecliht
A disparate collection of text manipulation and formatting algorithms
-
rulet
figlet implementation
-
squishyid
Shorten and obfuscate IDs
-
awful_knowledge_synthesizer
Generate LLM-powered exam questions from YAML books, manpages, mdbooks, tealdeer pages, and code
-
xml1
sane, non compliant xml parser without allocations
-
owned_chars
Owned iterators with the same output as Chars and CharIndices
-
casespaetzle
This package introduces common identifier case traits and a macro for case trait generation
-
probe-code
AI-friendly, fully local, semantic code search tool for large codebases
-
mdbook-checklist
An mdBook preprocessor for generating checklists and indexes
-
langsan
sanitizing language model input and output
-
codebook_downloader
Dictionary downloading utility for the Codebook spell checker
-
tet_rs
A third-party implementation of Text Entry Throughput (ref. https://doi.org/10.1145/3290605.3300866) for Rust
-
kana-converter
converter for half-width/full-width Japanese language characters (katakana, hiragana, and ASCII)
-
textr
TeX-inspired plug-n-play interface for converting JSON documents into PDFs
-
simdutf
Unicode validation and transcoding at billions of characters per second
-
imperative
Check for imperative mood in text
-
grapheme
Abstractions for working with extended Unicode grapheme clusters
-
deck
A command line tool to generate HTML presentations from Markdown documents
-
blackboxmc_java
BlackboxMC bindings for java.util
-
mask-text
mask text with multiple masking options
-
zrbecker-minigrep
a mini version of the grep tool
-
udp-logger-rs
Log macro for log's kv-unstable backend and a UDP socket logger
-
yaml-include
A lib and a CLI for recursively parsing "!include" data in yaml files
-
mdbook-plugin-utils
mdBook plugins
-
speki-cli
cli version of speki
-
midstring
Create a string between two other strings, that is lexicographically halfway between them
-
gchemol-parser
Text parsing made simple
-
truecase
Restore correct letter casings in arbitrary text using a statistical model
-
lindera-cc-cedict-builder
A Chinese morphological dictionary builder for CC-CEDICT
-
text-size
Newtypes for text offsets
-
kyte
Extensible mechanics for operational transformation in Rust that are generic with respect to their value (not constrained to text), wire-compatible with Quill and fully fuzzed
-
hmd
Custom Markdown Engine for my personal blog
-
dbxcase
Dropbox-compatible case-folding algorithm
-
pseudo-cyrillic
Faux Russian typography
-
bottomify
Fantastic (maybe) CLI for translating between bottom and human-readable text
-
pillar
small tool to format lines into columns
-
enso-lazy-reader
An efficient buffered reader
-
az_minigrep
minigrep CLI program to search a query string from a file and print out the matched lines
-
quartz_commands
Generates a parser at compile-time for handling commands similar in structure to those of Minecraft
-
rustyink
Blazing fast static site generator
-
mdbook-dtmo
Creates a book from markdown files with added plugins
-
matcher_py
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
texting
string helpers
-
char_reader
Safely read wild streams as chars or lines
-
natord-plus-plus
Natural ordering for Rust
-
invisible-sun
program for generating svg "suns"
-
deliminator
Universal code documentation generator
-
grammateus
facilitate working with Ancient Greek words
-
dtools
A collection of tools for working with Commodore disk images
-
ptero-cli
A text steganography CLI tool for Social Media
-
fusefiles
Concatenate a directory full of files into a single prompt for use with LLMs
-
xee-ir
Xee intermediate representation and compilation to bytecode
-
stylish-style
Internal implementation details of
stylish-core -
sqdj
sqdj shortens delimited data
-
markov_str
Markov Chain implementation optimized for text generation
-
serde_ssml
A robust Rust library for parsing, manipulating, and generating Speech Synthesis Markup Language (SSML) documents
-
mango
programming language (UNDER CONSTRUCTION)
-
sonai_metrics
Text metrics for sonai
-
ascii_tree
generates ascii trees
-
r4d
Text oriented macro processor
-
uwu-rs
uwuifying library
-
sixbit
Small packed strings
-
naming_utils
generating naming conventions, pluralizing words, and rest api paths in Rust
-
bard
Creates PDF and HTML songbooks out of easy-to-write Markdown sources
-
unicode-normalization-alignments
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
gfm-autolinks
Parse GitHub Flavored Markdown autolinks
-
nfa_regex
NFA regex engine for text processing
-
mdbook-compress
Compress an mdBook project into a single PDF file
-
cursive_hexview
hexviewer for cursive
-
spandex-hyphenation
Knuth-Liang hyphenation for a variety of languages
-
g2-unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
valid_rust_char
A tiny library to check if a char it's valid in a rust file
-
opentalk-types-signaling-meeting-report
Signaling types the OpenTalk meeting-report module
-
strange
A static website generator
-
uwu_colors
language server to colorize hex color strings via textDocument/documentColor
-
win-utf8-rs
a function to enable UTF-8 for windows
-
nib
static site generator
-
bcdown
Bilibili漫画下载器,written in Rust,支持epub pdf zip格式
-
strip-prefix-suffix-sane
A small utility crate providing ‘sane’
strip_prefixandstrip_suffixmethods forstrthat returnselfinstead of anOptionwhen the prefix/suffix is not found -
fuzzywuzzy
A pure-Rust clone of the incredibly useful fuzzy string matching python package, FuzzyWuzzy
-
alass-ffi
FFI bindings for subtitle synchronization with alass
-
pdf_composer_definitions
PDF Composer definitions crate
-
curtana
Simplified zero-cost wrapper over llama.cpp powered by lama-cpp-2
-
hayro-font
A parser for CFF and Type1 fonts
-
inlinable_string
inlinable_stringcrate provides theInlinableStringtype – an owned, grow-able UTF-8 string that stores small strings inline and avoids heap-allocation – and theStringExttrait… -
cnpj
Brazilian CNPJ parsing, validating and formatting library
-
palpad
A really simple static site generator
-
m2h
Convert Markdown to HTML with syntax highlighting
-
kvu
The simplest command line tool to manage key-value pair lines
-
cfasttext-sys
fastText ffi binding
-
parser-cli
Command-line interface for extracting text from various file formats
-
xpath-cli
Evaluate XPath selectors on XML or HTML documents
-
unicode_extension
Don't use this crate
-
unaccent
remove accents from strings, inspired by PostgreSQL's unaccent extension
-
aki-xcat
concatenate files that are plain, gzip, xz and zstd
-
mdbook-davids_cooking
A preprocesor for whatever https://davidsotomarchena.gitlab.io/davids-cooking/ needs
-
dispnet-hash
Hash Library used for Dispnet
-
workstyle
Workspaces with style! This program will dynamically rename your workspaces to indicate which programs are running in each workspace. It uses the i3 ipc protocol, which makes it compatible with sway and i3…
-
icemelter
minimize files that trigger internal compiler errors (ICEs)
-
charclass
define and modify unicode character classes
-
fea-rs
Tools for working with Adobe OpenType Feature files
-
comp_input
Input library for competitive programming
-
text-transliterate
transliterate texts using the SO iconv from POSIX
-
kvarn-chute
A Markdown converter designed to use the Kvarn templating engine
-
stam-python
STAM is a library for dealing with standoff annotations on text, this is the python binding
-
dynfractal
A program to render any equation (in terms of z and c) into an ASCII fractal
-
unobtanium-segmenter
A text segmentation toolbox for search applications inspired by charabia and tantivy
-
text_distance
A collection of approximate string matching algorithms
-
nlf
A CLI to append newline characters (LF) at the end of text file
-
rusttyper
Basic text layout, using rusttype
-
clafrica
This application allows you to type most of the characters in the african alphabet in any text field
-
meet
building website
-
hitori
Generic compile-time regular expressions
-
utf8-bytes
bytes::Bytes, but UTF-8
-
timeblok
A language for event scheduling in plain text
-
xenon-lexer
The Xenon compiler's lexer
-
mors
converting morsecode to text and reverse
-
rustrings
Strings manipulation for Rust
-
porter-stemmer
Flexible and unicode friendly, Porter stemmer implementation
-
htmlescape
HTML entity encoding and decoding
-
jmdict-fast
Blazing-fast Japanese dictionary engine with FST-based indexing
-
extract-strings
Extract ascii strings from files
-
korean
hangul manipulation
-
datatroll
a robust and user-friendly Rust library for efficiently loading, manipulating, and exporting data stored in CSV files
-
ipwhois_rs
Rust wrapper for the ipwhois.io geolocation API
-
runefix-cli
Unified CLI frontend for runefix tools: width detection, markdown formatter, and terminal testing
-
chargrid_decorator
chargrid views for decorating other views
-
id32
32-character RFC 4648 lowercase Base32 IDs
-
bitflip
functions to generate bitflips of binary and UTF-8 strings
-
simplecc
Chinese Convert library (partially) compatible with OpenCC's dictionaries
-
code-agent
AI-Native Code Assistant Library
-
moodle_api
Moodle Webservice Client in Rust
-
aneurysm
Brainf**k interpreter
-
giff
Visualizes the differences between the current HEAD and a specified branch in a git repository using a formatted table output in your terminal. The differences are displayed with color-coded…
-
yinzhe9
喵喵隐者9
-
gstring
String with support for Unicode graphemes
-
biodiff-wfa2-sys
Rust bindings for the WFA2 library
-
pdf_form
programatically filling out pdf forms
-
rusty_regex
A minimalistic regex engine using the pipeline: Regex -> AST -> NFA -> DFA -> Match(String)
-
opencc-fmmseg
High-performance OpenCC-based Chinese conversion using FMM (Forward Maximum Matching) segmentation
-
swc-formatjs-visitor
formatjs custom transform visitor for SWC
-
kansuji
漢数字と数字の相互変換のためのライブラリ
-
ansi-escape-sequences
High-performance Rust library for detecting, matching, and processing ANSI escape sequences in terminal text with zero-allocation static regex patterns
-
text_layout
Text layout algorithms
-
ferrissay
cowsay
-
base16-rs
in Rust offers capabilities for encoding and decoding data in Base16 format. By utilizing the hex library, you can transform data into its hexadecimal representation and also decode…
-
detone
Decompose Vietnamese tone marks
-
cgp-monad
Monadic constructs for CGP computation
-
protobuf
Protocol Buffers - Google's data interchange format
-
mdxjs
Compile MDX to JavaScript in Rust
-
iso15924
ISO 15924 data
-
rubidium
Rb37
-
jfmt
command-line tool for formatting json files in both readable and compact formats. It supports stdin/stdout shell usage, as well as working on files directly.
-
help_crafter
help message generator without hussle
-
glyph-names
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
qpprint
console printing/formatting
-
anagrambot
find anagrams of words
-
asciis
ASCII base on RFC20. Just ord() and chr()
-
meztide
SDK for lotide
-
nmd-core
Official NMD compiler core
-
ps-str
String transcoding library
-
markov_strings
A simplistic Markov chain text generator
-
cuid1
CUID protocol in rust
-
tiny-clean
A lightweight, high-performance string sanitizer with configurable rules
-
mystem
Wrapper around Yandex Mystem for Rust
-
delay_writer
Wraps a writer and delays its output after each newline
-
string-box
Create Rust string from UTF-8 string, byte string or wide string
-
string_search
searching strings using substrings
-
cp437_constants
Defined constants covering the entirety of ASCII code page 437 (https://en.wikipedia.org/wiki/Code_page_437)
-
atrium-api
API library for AT Protocol (Bluesky)
-
ucd-raw
Uninterpreted access to the unicode UCD
-
regex-split
split_inclusive for the regex crate
-
c6o-obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
biblib
Parse, manage, and deduplicate academic citations
-
johalun/module
FreeBSD kernel module in Rust
-
snakecase
general purpose snakecase implementation supporting both ascii and unicode
-
pg_str
Adds str functions to Postgresql via an extension
-
correct_word
A No brainer 'did you mean' library for Rust
-
wrapr
wrap your code for ai
-
hello_lib
Demonstrate Generics Function
-
rst_renderer
a reStructuredText renderer
-
case-conv
Faster case conversion crate
-
reddish
A Rust utility library
-
open-protocol
Atlas Copco/Torque Open Protocol specification in Rust
-
cmark-syntax
Syntax highlighting for pulldown-cmark
-
ascii-hangman-backend
customizable Hangman game with ASCII-art rewarding for children (backend)
-
rust-regex-dsl-creator
Regular expression DSL derive macros
-
merge-whitespace
Procedural macros for merging whitespace in const contexts
-
ssort
CLI tool for suffix (inverse lexicographic) sorting
-
wcrs
GNU wc in Rust
-
mdbook-gitbook
mdBook preprocessor to properly render GitBook specific syntax
-
solr_post
command line tool and library to post files in a directory to a Solr server
-
matcher_c
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
docstr
Ergonomic multi-line string literals
-
comment-strip
Remove comments out of text files
-
pprint
Flexible and lightweight pretty printing library for Rust
-
flxy
Full-text searching and scoring of strings
-
nile-library
supporting nile
-
notegraf
Core library for building a graph-oriented notebook
-
strip_markdown
remove markdown syntax from markdown files
-
jp-location-relation
隣接する市区町村の一覧を取得
-
chinese
language nlp tools
-
guarding
guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
harfbuzz
Rust bindings to the HarfBuzz text shaping engine
-
gbx
GBX (Grundbuch-Exchange) Dateiformat
-
idna-cli
Encode/decode Unicode domain names to/from IDNA ASCII
-
encoding-index-singlebyte
Index tables for various single-byte character encodings
-
CLI_Project_Scott_Coakley
CLI Project in Rust
-
owlz
"Owlz" ascii emojis, created randomly or by design
-
wcounter
Give the word and count the appearance
-
mdxbook
Fork of mdBook, with more customizations and flexibility for programmers
-
analyse-json
CLI tool for inspecting (Newline Delimited) NDJSON or JSON to understand the contents
-
sluggify
slug or clean url generator for rust. With default settings, you will get an hyphenized, lowercase, alphanumeric version of any string you please, with any diacritics removed, whitespace and dashes collapsed…
-
scrambler
command line tool to scramble letters
-
refac
Transform some text given a generic natural language prompt
-
makepad-rustybuzz
A complete harfbuzz shaping algorithm port to Rust
-
bin-rs
Binary Reader from multi source
-
brainfuck_interpreter
brainfuck interpreter written in Rust
-
mdbook-yml-header
mdBook preprocessor for removing yml header
-
mdbook-renderer
assist implementing an mdBook renderer
-
prefix_writer
writer that can prefix text that contains mulptiple lines or incomplete lines
-
fast_whitespace_collapse
Collapse consecutive spaces and tabs into a single space using SIMD
-
lindera-ko-dic
A Korean morphological dictionary for Ko-Dic
-
tfon
Bitmap font parsing / conversion
-
forgiving-htmlescape
HTML entity encoding and decoding, with support for leaving malformed entities intact
-
lindera-dictionary
A morphological dictionary library
-
chromalog
A customizable logger with dynamic color coding and file logging
-
deepl-openapi
The DeepL API provides programmatic access to DeepL’s machine translation technology
-
pdfrust
PDF parser
-
mdbook-indexing
mdbook preprocessor for index generation
-
lingua-german-language-model
The German language model for Lingua, an accurate natural language detection library
-
ctf-brute
Brute-force utilities for Rust
-
mdbook-footnote
mdbook preprocessor for footnotes
-
bible-io
working with Bible text data structures
-
translitrs
Transliteration utility for Serbian language
-
crustword
Crusty Crosswords
-
ruSTLa
A reStructuredText → LarST ⊂ LaTeX transpiler
-
lipgloss-tree
A tree component for terminal user interfaces, styled with Lip Gloss
-
bpack
Fast, simple and easy to use data compression library meant to work with byte slices of String and str types
-
needleman
needleman-wunsch algorithm implement
-
anystr
An abstraction over string encoding that supports ASCII, UTF-8, UTF-16 and UTF-32
-
swift-check
High-performance, robust, and expressive searching and validation (uses SIMD on x86_64, aarch64, and WASM)
-
thank
Shows info about crates used in your projects so you know who to thank for them and where to do the thanking
-
decondenser-cli
Pretty-print any text based on brackets nesting. CLI crate
-
mtf
Markdown Table Formatter
-
mdbook-svgdx
mdbook preprocessor to convert svgdx fenced code blocks into inline SVG images
-
an-rope
an rope for large text documents
-
mime_4
Strongly Typed Mimes
-
aprilasr
High-level wrapper for the april-asr C api (libaprilasr) using aprilasr-sys
-
baste64
A base64 codec
-
mdbook-typstpdf
An mdBook backend that generates PDF output using Typst
-
insim_core
Core types and traits for the insim crate
-
xml-string
String types for XML
-
strmatch
Conditionally match strings in Rust using regex without much boilerplate
-
libopenlipc-sys
Wrapper around liblipc to interact with Kindle dbus-based LIPC events
-
strcursor
string cursor type for seeking through a string whilst respecting grapheme cluster and code point boundaries
-
slugify-rs
generate slugs from strings
-
text_searcher
text searcher tool
-
story-dl
Story web scraping
-
mdbook-preprocessor-boilerplate
Boilerplate code for mdbook preprocessors
-
wdpe
WebDynpro Parse Engine
-
zw
encoding and decoding text using zero-width characters
-
flexstr
A flexible, simple to use, immutable, clone-efficient
Stringreplacement for Rust -
roman_numerals_fn
A function to convert integers to their roman numeral representation as strings. Values from 1 to 3999 are possible, otherwise it returns an OutOfRangeError. Zero has no representation in roman numerals.
-
alpha-counter
Alphabetic counter
-
minigrep_cargo_test
Just a simple grep exercise followed from the rust book and published in order to learn Cargo
-
yagenerator
Application that uses tinytemplate engine to generate text files. If you have a set of structured data, and need to generated a bunch of arbitrary types of files from it, this tool can help you to save some time.
-
fmt-extra
Extra formaters not shipped in the standard library
-
asimov-valkey-module
ASIMOV module
-
aki-unbody
output first or last n lines, like a head and tail of linux command
-
sourceannot
render snippets of source code with annotations
-
rust_stringz
string manipulation library for Rust that you should probably don't need. It's just for learning purposes.
-
cow-rewrite
Rewrite copy-on-write types copying only when it's neccessary
-
mdbook-bash-tutorial
A mdbook preprocessor that allows embedding Bash scripts as tutorials
-
bocu1
BOCU-1 compressed unicode encoding
-
num2en
For converting integer and decimal numbers into English cardinal or ordinal number words
-
qsv_vader_sentiment_analysis
Bindings for Rust from the original Python VaderSentiment analysis tool. Forked for use with qsv.
-
libabbs
aosc-os-abbs maintenance
-
mdbook-hide
A preprocessor for mdbook that adds support for hidden chapters
-
wqpl
The wq programming language
-
lesspub
CLI tool for downloading Sequences from LessWrong and exporting them as EPUB format ebooks
-
bge
Rust interface for BGE Small English Embedding Library
-
AsgoreCore
A small rust library to manipulate arabic text to fit in non-supporting arabic games or programes
-
bibutils-sys
Rust bindings for bibutils, a program for bibliography format interconversion
-
md-parser-kma
parser that converts basic Markdown syntax (headers, bold, italic, lists) into corresponding HTML tags
-
zoitei
alphabet conversions
-
yar_markdown
Markdown handling for yar
-
mdbook-chapter-zero
A mdBook preprocessor that allows 0th (sub-)chapter
-
scfmt
formatting code with structured comments
-
unindenter
unindent text
-
mdbook-treesitter
mdBook preprocessor for html adding tree-sitter highlighting support
-
best_utils_lib_ever
Best utility library/crate ever made!
-
rtml-rust
RTML for rust
-
komga-sdk
Async Rust SDK for the Komga REST API, generated from OpenAPI spec. Supports all endpoints, file upload, and strong typing.
-
llmtext
Turns any website into a single LLM-ready markdown file
-
bpe-tokenizer
A BPE Tokenizer library
-
committed
Nitpicking commit history since beabf39
-
sima-rs
A blazingly fast simple string matcher library
-
freesia
some string operators
-
widget-forge
A Widget Based Application Engine for Ascii-Forge
-
lucide-yew
Yew port of Lucide
-
minigrep_jeck
minigrepis a grep clone that takes a query and searches for the query in the file; with added support for regex -
md-todo
Work with todos in markdown files
-
character_text_splitter
splitting text into chunks with overlap, designed for handling large amounts of text efficiently. Implementation is identical to langchain's CharacterTextSplitter
-
paperless-api-client
Paperless-ngx API client
-
owoify
Text owoification library
-
fancy-regex-fork-pb
A custom fork of the fancy-regex crate. You probably don't want to use this.
-
show-bytes
Display bytes as printable ascii with escape sequences as needed
-
vape
full width aesthetics
-
jcalendar
Japanese Calendar for Rust
-
homoglyph-core
Domain layer for the homoglyph cli and service. Compute homoglyphs for an input sentence.
-
markdown_converter
html to markdown converter and flavored markdown to discord markdown converter
-
byte-num
converting numbers to bytes, and bytes to numbers in base 10!
-
proc-tools-core
过程宏工具核心
-
md_ol_util
Transform markdown unordered lists to ordered lists with current position expressions
-
tidyvcf
command-line tool to convert VCF files to tab/comma separated tables
-
manman
setup the on-line manual pages
-
group-similar
Group similar values based on Jaro-Winkler distance
-
scribe-webservice
Web service interface for Scribe repository analysis
-
target_info
Get text strings of attributes concernign the build target
-
mdja
日本語に最適化されたMarkdownパーサー - CommonMark + GFM対応、目次生成、読了時間計算
-
rupantor
A Bengali Phonetic Parser which is very flexible and supports Avro Phonetic
-
splutter
create/generate text efficiently
-
upodesh
generating Bengali word suggestions based on a given input string
-
msr-core
Industrial Automation Toolbox - Common core components
-
bloom-html
HtmlNode type for bloom-client and bloom-server
-
remake
writing maintainable regex and managing symbol soup
-
snekdown
A parser for the custom snekdown markdown syntax
-
ctrl-z
A composable reader to treat
0x1Aas an end-of-file marker -
serbzip
A quasi-lossless Balkanoidal meta-lingual compressor
-
caser
Change text between PascalCase, camelCase, and snake_case
-
unicode_names2_generator
Generates the perfect-hash function used by
unicode_names2 -
lingua-spanish-language-model
The Spanish language model for Lingua, an accurate natural language detection library
-
naromat
Convert text to narou novel format
-
fmtm
A diff-friendly Markdown formatter that breaks lines on sensible punctuations and words to fit a line width
-
gen3-charset
Pokemon Generation 3 Character Set Support (GBA)
-
hydroper_source_text
Source text containing line locations
-
gemmy
high performance orderbook written in rust
-
small-bwt
BWT construction in small space
-
regexy
lightweight Rust library for working with regular expressions. The regexy crate provides an easy-to-use interface for matching patterns in strings using regex
-
s3grep
A fast, parallel grep tool for searching logs and unstructured content in AWS S3 buckets
-
ripgrep
line-oriented search tool that recursively searches the current directory for a regex pattern while respecting gitignore rules. ripgrep has first class support on Windows, macOS and Linux.
-
trie-alg
Trie implementation
-
encode_rs_fs
Read and Write files with encoding_rs
-
find_unicode
Find Unicode characters, the easy way!
-
text_manipulation_rs
generating random placeholder text in different languages
-
trevordmiller
Personal CLI
-
wordshk_tools
A combination of parsers and other tools for words.hk (粵典)
-
fetch_mediawiki_configuration
Generate
parse_wiki_textconfiguration code for a MediaWiki wiki -
shaygrep
Minigrep from the Rust Book, for my practice
-
indoc
Indented document literals
-
opencc-sys
OpenCC bindings for Rust
-
transcript
A transcriber for European scripts
-
str_overlap
Methods for finding the overlap between two string slices
-
gh_page_tool
A github gh-pages tool for static blog site
-
pest3_core
Early WIP prototype version of pest 3.0
-
kicad-text-injector
A tiny CLI tool that replaces variables of the style
${KEY}within KiCad PCB (pcbnew) files -
rckive-genpdf
User-friendly PDF generator written in pure Rust
-
linkcheck
extracting and validating links
-
vidyut-kosha
A Sanskrit key-value store
-
slicestring
slicing Strings
-
whitespace-conf
Key-value configuration file delimited with whitespaces
-
sejong
Buffer is a buffer that can receive ASCII bytes different from keyboard and send out UTF-32 Hangul string. This buffer allows deletion by Jamo.
-
text-utils
Text utils for unescaping and align
-
mdbook-svgbob2
Alternative mdbook preprocessor for svgbob
-
dr
Command-line data file processing in Rust
-
igpay-atinlay
Translate text to Pig Latin
-
codetypo
Source Code Spelling Correction
-
ngram
Iterator adaptors for n-grams and k-skip-n-grams
-
simple-markdown-parser
handling markdown. May include extra utilities
-
hoedown
bindings for the Hoedown markdown processor
-
persian-tools-cli
cli for rust-persian-tools crate
-
pdf-rename
This script reads a list of PDF files from a specified directory and renames each file based on its content. The renaming logic uses the content of the PDF to generate a more descriptive and meaningful filename.
-
minigrep-cli-tool
cli tool to search upon a text file for lines containing specified string
-
lsp-document
Helpers to convert between LSP documents and Rust strings
-
fingers
a finger client library
-
viterbi_pos_tagger
A part-of-speech (POS) tagger using the Viterbi algorithm
-
google_translate_request
Google translate request to a spesific endpoint
-
telegram-escape
Escape text for Telegram's MarkdownV2 format
-
uwl
A management stream for bytes and characters
-
yini
extremely simple .ini file parser
-
llm-tui
A Terminal User Interface (TUI) for interacting with Language Learning Models (LLM) using llm-cli
-
mdbook-bibfile-referencing
An mdBook preprocessor to add bibfile referencing to each page
-
rew
A text processing CLI tool that rewrites FS paths according to a pattern
-
codes-iana-charset
This package contains an implementation of the IANA Character Set registry
-
translation-api-cn
Some useful structs for calling Chinese translation api cloud services. A helper tool for
bilingualcmdline tool. -
ucd-util
A small utility library for working with the Unicode character database
-
notmecab
tokenizing text with mecab dictionaries. Not a mecab wrapper.
-
csvsc
Build processing chains for CSV files
-
m_lexer
extensible regular expressions based lexer
-
rspy
Pythonic interface for Rust
-
kindle_formats
File formats used in the Amazon Kindle ecosystem
-
deno_3p_lib
deno executable
-
divvunspell-bin
Spellchecker for ZHFST/BHFST spellers, with case handling and tokenization support
-
markdown-tables
generating markdown-formatted tables
-
rust_baht_text
Convert number to Thai Baht text
-
mediawiki_parser
A strict parser for MediaWiki markdown
-
libphonenumber-sys
rust ffi bindings to libphonenumber
-
shift_or_euc_c
C API for shift_or_euc
-
simple-text-pattern
Matching simple text patterns
-
rfc9839-rs
RFC9839 to test for problematic Unicode code points
-
font-map
Macros and utilities for parsing font files
-
asciirend
no_std compatible ascii rendering core
-
e2m
A lightweight CLI tool that transforms English words into emojis using only Rust’s standard library
-
surt-rs
Sort-friendly URI Reordering Transform (SURT)
-
hunspell-sys
Bindings to the hunspell C API
-
unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
uwildmat
wildmat/uwildmat matching
-
lazy-string-replace
A lazy version of
String::replace, so that it can be formatted or recursively replaced without intermediate allocations -
llm-shield-scanners
Security scanners for LLM Shield toolkit
-
synox
Program synthesis of string transformations from input-output examples
-
hex_table
Format a byte slice into a hex table
-
py-regex
A small wrapper around the Python regex module via PyO3
-
emoji-commit
Make your git logs beautiful and readable with the help of emojis
-
integer_scaling
IntegerScaling wrapper for those who prefer snake_case
-
badascii-mdbook
Embed badascii diagrams in your mdbook. See badascii.me for the editor.
-
textcon
Template text files with file/directory references for AI/LLM consumption
-
utf8-supported
Determine the UTF-8 support of the current locale
-
bubble-bath
Small and quick HTML sanitizer
-
zuk
Yozuk command-line interface
-
rpdf
PDF command-line utils written in Rust
-
dtxt-detect
dangerous text detection, optimized for high speeds
-
cabocha
Safe Rust wrapper for cabocha a japanese language dependency structure analyzer library
-
runestr-pancjkv
rune-based Pan-CJKV support -
deface
Lightweight markup to HTML converter
-
omgwtf8
Optimized-Matching-Generalized Wobbly Transformation Format — 8-bit
-
tree-sitter-stack-graphs-java
Stack graphs for the Java programming language
-
code-splitter
Split code into semantic chunks using tree-sitter
-
kind-tree
Syntatic trees for Kind compiler
-
textcat
detect text categories. It can be used to detect the language of a given text
-
cl
A command line tool for recording changes to be collected for use in a Keep A Changelog formatted CHANGELOG.md
-
rjoin
joining CSV data on command line
-
mdbook-check-missing-md
A backend for
mdbookwhich will find Markdowns you forgot on SUMMARY.md -
words-count
Count the words and characters, with or without whitespaces
-
saville_encode
Encode UTF-8 text into Peter Saville's colour code
-
dedent
Procedural macro for stripping indentation from multi-line string literals
-
bm25-vectorizer
A minimal Rust library for creating sparse vector representations (embeddings) using the BM25 algorithm for information retrieval
-
bookrafter
This repository contains code related to bookrafter rendering
-
mdbook-webinclude
Preprocessor for mdBook that includes content from URLs
-
chunkr
A fast and quick chunking library for rust
-
runiq-lib
An efficient way to filter duplicate lines from input, à la uniq
-
artificial-openai
OpenAI backend adapter for the Artificial prompt-engineering SDK
-
slugmin
Slug version which allows to use spaces and big letters
-
pulldown-cmark-fork
A pull parser for CommonMark
-
macro_colors
colorful printing macros
-
rex-regex
A pedestrian library for regular expressions: Slow, but easy to understand
-
chardet
rust version of chardet
-
grace-cli
CLI tool for processing files and strings
-
grader
Stream-based CLI for binary sorting text files via a given shell command
-
flux-tui
Fast and lightweight Terminal UI drawing library
-
mdbook-quiz-validate
Input validation for quizzes used in mdbook-quiz
-
syntaxdot-encoders
Encoders for linguistic features
-
mdbook-merjong
A preprocessor for mdbook to add merjong support
-
abbreviation_extractor
extracting abbreviations from text
-
acorns
Generate an AsciiDoc release notes document from tracking tickets
-
pulldown_typst
A pull parser for Typst markup
-
incredimo
just another font for your terminal
-
tuilet
A textual user interface for Toilet, the ANSI-art text generator
-
grammalecte_client
Grammalecte HTTP client
-
alfrusco
building Alfred workflows with Rust
-
cli-colors
A CLI tool for outputting text in ANSI format with features like colors, underlining, boldening, and italicizing
-
align
aligning text
-
encoding_rs_transcode
Transcode text within writers using encoding_rs
-
tarzi
Rust-native lite search for AI applications
-
duca
Search and read Dante's Divine Comedy from your terminal
-
chinese-telegraph
unicode to chinese telegraph code conversion
-
tiny-ver
tiny version parser
-
minigrep_by_wjw
realization of console command line
-
encoding-next-index-simpchinese
Index tables for simplified Chinese character encodings
-
artificial-types
Reusable prompt fragments and helper types for the Artificial prompt-engineering SDK
-
base16384
Encode binary file to printable utf16be, and vice versa
-
campfire
A tiny static site generator, greatly inspired by Zola
-
lf_lint
Linter that checks that all files in <PATH> ends with a newline
-
couleur
Add the ablity to print colored text to the console
-
html-auto-p
function like
wpautopin Wordpress. It uses a group of regex replaces used to identify text formatted with newlines and replace double line-breaks with HTML paragraph tags. -
esc
Escape characters in strings
-
reflexo-vec2svg
Export a frame of Typst document into svg element
-
english-core
English language inflector
-
popgetter-core
Core library for popgetter
-
inslice
A command-line utility for filtering text input by columns and rows
-
chromazone
line-oriented match and colorization tool
-
esperanto-text
Convert Esperanto text between UTF-8, x-system and h-system transliterations
-
rustfmt_emitter
Rustfmt emitter library
-
markdown-it-heading-anchors
A markdown-it plugin for parsing GFM tasklists
-
ohos-input-method-sys
OpenHarmony's input method binding for rust
-
lindera-ipadic-neologd-builder
A Japanese morphological dictionary builder for IPADIC NEologd
-
computergeneration
compgen but all wrong
-
iregex-automata
Finite automata definitions for the
iregexcrate -
text-fx
A collection of text processing utilities for Rust
-
dcsv
Dyanmic csv reader,writer,editor
-
trie-match
Fast match macro
-
mdbook-latex
An mdbook backend for generating LaTeX and PDF documents
-
gramatika
A minimal toolkit for writing parsers with Rust
-
minigrep_necimye
Functions required to find the lines in file that contains the query. query and file path should be entered through command line preceded by two dashes. Ex: cargo run -- body filename.txt
-
kitoken
Fast and versatile tokenizer for language models, supporting BPE, Unigram and WordPiece tokenization
-
orthrus-core
core functionality for Orthrus
-
string_morph
string case transformations with an emphasis on accuracy and performance. The case conversions are available as functions as well as traits on String types.
-
indicator-extractor
Extract indicators (IP, domain, email, hashes, etc.) from a string or a PDF file
-
floating-ui-utils
Rust port of Floating UI. Utilities for Floating UI.
-
string-simple
containing some simple string utilities that I use in my other projects
-
grep-search
search query in contents lib
-
uniart
A CLI tool to convert images and gifs to terminal characters
-
rreplace
designed to streamline string replacements. It can handle multiple unique replacements and iterates the string only once.
-
uwu_cli
uwuifying the terminal
-
todo-to-issue
CLI tool that converts forgotten TODO comments into actionable GitHub issues
-
alrulab-core
Alrulab Core Library
-
mantra-lang-tracing
Contains functionality needed to collect requirement traces from code or plain text files for the
mantraframework -
markdown-table
Creating markdown tables with Rust!
-
veloci_levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
terraphim-markdown-parser
Terraphim Markdown Parser
-
hex-utilities
working with hexadecimal numbers
-
pdf_forms
programatically filling out pdf forms
-
zed-util
A collection of utility structs and functions used by Zed and GPUI
-
mitex-parser
Parser for MiTeX
-
mdbook-numeq
An mdbook preprocessor for automatically numbering centered equations
-
tiny-gradient
Make your string colored in gradient
-
clippers
Cross-platform clipboard management library
-
mdbook-twiki
twiki backend for mdbook
-
mupdf-sys
Rust FFI binding to MuPDF
-
wkhtmlapp
Convert html to pdf or image
-
cur
that will hunt for your regular expression
-
is_printable
Determine whether a given text-based value is printable
-
yozuk-sdk
Types used in the Yozuk ecosystem
-
mdbook-trunk
mdBook plugin which bundles packages using Trunk and includes them as iframes
-
catatau
A TUI EPUB reader with fuzzy search and navigation
-
console-traits
Basic console traits, such as setting char position and writing strings
-
unicode_types
A mapping of all the unicode characters into convenience types (one enum per block of characters with one variant per character)
-
ultron-syntaxes-themes
Syntaxes and themes dump for ultron
-
mdbook-extended-markdown-table
Preprocessor for mdBook that generates tables with merged cells from ASCII text
-
unicode-character-database
Unicode character database tables (Unicode Standard Annex #44) generated using
ucd-generate -
mdbook-tools
A collection of tools for mdbook
-
matchable
Check if text is matching against string or regex in one API
-
latinrs
Rust package that transliterates text and characters into thier latin form
-
anon-csv-cli
anonymise CSV files, providing various options to substitute real data with plausable fake data
-
abbreviator
abbreviating long words
-
lithe
A Slim template engine by using Pest
-
hubble
Official Hubble plugin SDK for Rust
-
glifnames
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
enc-check
inspect utf-8 and utf-16 character encodings
-
typst-ts-core
Core function of Typst.ts
-
markdown-formatter
Flavored Markdown (ZH) content formatter
-
cursive-async-view
A loading-screen wrapper for gyscos/cursive views
-
qp-trie
An idiomatic and fast QP-trie implementation in pure Rust, written with an emphasis on safety
-
basalt-core
core functionality for Basalt TUI application
-
levenshtein_lite
No-frills implementation of a Levenshtein Automata and the Levenshtein Distance function
-
textshrink
A minimalistic lib that reduces text size by replacing specific character sequences with single Unicode characters (e.g., 'mm' becomes '㎜'). On average, this compresses text by ~5%, with variation also ~5%.
-
mdbook-markdown
Markdown processing used in mdBook
-
trans-case
Transform case
-
litsea-cli
Litsea is an extreamely compact word segmentation and model training tool implemented in Rust
-
rsticle-cli
Command line tool to convert source files into narratives/articles
-
pandoc_ast
deserializes and serializes the markdown ast for writing pandoc filters
-
roxy_cli
A command-line static site generator
-
char-positions
Iterate chars and their positions, i.e. line, column, and byte ranges
-
squidge
shortens delimited data
-
lexikit
CLI that packs an entire project into a single LLM-ready document (Markdown or //-comment style) with UTF-8 detection, binary handling, glob include/exclude filters, and .gitignore awareness
-
print-positions
providing string segmentation on grapheme clusters and ANSI escape sequences for accurate length arithmetic based on visible print positions
-
verba
working with Latin words
-
opentalk-types-signaling-meeting-notes
Signaling types the OpenTalk meeting-notes module
-
glyph_brush_layout
Text layout for ab_glyph
-
mdbook-keeper
An improved testing experience for mdbook
-
retrofont-cli
CLI for retrofont: Render and convert retro ASCII/ANSI art fonts
-
regexgrep
ripgrep tool that suports regular expressions
-
vec-string-to-static-str
providing utilities for converting vectors of
Strings into vectors of&'static str -
jpreprocess-njd
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
validated-slice
Helper macros to implement std traits for custom validated slice types
-
twitter-text
in Rust
-
notedown_ast
Notedown Abstract Syntax Tree
-
raxa
String normalization and cleanup for structured text processing
-
brush-parser
POSIX/bash shell tokenizer and parsers (used by brush-shell)
-
gimme
Pull useful data out of your clipboard
-
igo-rs
Pure Rust port of the Igo, a POS(Part-Of-Speech) tagger for Japanese (日本語 形態素解析)
-
utf8proc-sys
Unsafe rust bindings to the utf8proc library
-
untex
Understand and manipulate TeX files with ease
-
chars_data
Build-dependency for chars, the unicode character information CLI
-
rs-tool
A command-line tool to perform reservoir sampling on a file or a stream
-
fmty
Composable
core::fmtutilities -
wtf8
WTF-8 encoding. https://simonsapin.github.io/wtf-8/
-
groop
Outline-aware grep with syntax highlighting
-
ufofmt
A fast, flexible UFO source file formatter based on the Norad library
-
leven-distance
Compute operational differences between two sequences using the Levenshtein algorithm
-
jiang_mini_grep
minigrep 查询文件的某个字符
-
literumilo
A spell checker and morphological analyzer for Esperanto
-
salvation-cosmic-text
Pure Rust multi-line text handling
-
wildflower
Wildcard matching against strings
-
mdbook-image-size
A mdbook preprocessor which support image size syntax
-
prototty_decorator
Prototty views for decorating other views
-
syllable
counter for use with reading level calculations
-
alemat
type-safe building of MathML
-
regex_static
Compile-time validated regex, with convenience functions for lazy and static regexes
-
humnum
Human numeric sorting program — does what
sort -his supposed to do! -
rss4mdbook
a generator for mdBook as CLI tool, export RSS.xml into u want path
-
dirgrab
CLI tool to concatenate file contents from directories, respecting Git context
-
deregex
show all possible and shortest strings which matches the given regex
-
ltxcut
formats a table-like stream into a LaTeX-table
-
spc-core
A command-line tool for processing and analyzing data from SPC files
-
opencc-jieba-rs
High-performance Chinese text conversion and segmentation using Jieba and OpenCC-style dictionaries
-
mdbook-skill-tree
mdbook plugin to show roadmaps
-
spellabet
Convert characters into spelling alphabet code words
-
b2c2-jis-x-201
UTF-8とJIS-X-201を雑に変換処理する
-
uklatn
Ukrainian Cyrillic transliteration to Latin script
-
marko
Programmtically format text with Markdown syntax
-
mark-html
efficient Markdown to HTML parser written in Rust
-
goodname
assist you with cool naming of your methods and software
-
old_icelandic_zoega
Old Icelandic dictionary for Rust. From "A Concise Dictionary of Old Icelandic" by Geir Zoëga
-
lightgrep
A fast, ergonomic grep-like tool in Rust
-
deno_doc
doc generation for deno
-
typeline_ext_utils
operators for typeline
-
pencil_case
but effective library for writing GUI in bevy
-
crawdad-rkyv
Crawdad: ChaRActer-Wise Double-Array Dictionary with rkyv support
-
md-inc
Include files in Markdown docs
-
to_fraktur
Function that converts any string to fraktur font
-
rustfmt_configuration
Rustfmt configuration library
-
rk-utils
A collection of utility functions and data structures for rust
-
lang-unicodes
Effortlessly convert and validate text across various Unicode ranges, supporting multiple languages for seamless internationalization in your applications
-
glcp
Small library to help you set and get text from a global clipboard
-
mdbook-najan
Preprocessor for the Najan mdBook
-
delim
parsing values from delimited strings
-
informa
font compiler for digital typography
-
arbitrator
Format text based on a set of rules and regexes
-
libanubhav
management system written in Rust
-
seven_seg
Seven-segment digital display for terminal
-
markdown-gen
generating Markdown files
-
pomsky-macro
Macro for converting pomsky expressions to regexes
-
ut1_blocklist
UT1 blocklist URL/domain filters
-
spongemock
CLI-tool for random text capitalization
-
minisearch
A mini search which can handle both - case sensitive or in-sensitive both
-
molybdenum
Recursive search and replace CLI application
-
no-comment
Remove rust-style line and block comments from a char iterator
-
mdbook-shiftinclude
mdbook preprocessor for file inclusion with shift
-
rrename
" Opinionated tool to rename files in batch. Match regular expression, replace some characters I consider noise to kebab case
-
unidecode
pure ASCII transliterations of Unicode strings
-
unicode-ellipsis
truncate Unicode strings to a certain width, automatically adding an ellipsis if the string is too long
-
compute-file-server-cli
Uploads files to Fastly for serving directly from within Fastly Compute applications. Upload any type of file: images, text, video etc and serve directly from Fastly. It is ideal for…
-
mdbook-morsels
Morsels plugin for Mdbook
-
semantic-commands
A lightweight Rust framework for defining and executing semantic commands using text embeddings
-
yeslogic-unicode-script
Fast lookup of the Unicode Script property
-
minigrep21
A program to catch arguments pertaining to query and file path, and to display its search results
-
pulldown-cmark-escape
An escape library for HTML created in the pulldown-cmark project
-
addbib
An app to add linked bibliographies to markdown files
-
aki-txpr-macro
the more easy to use libaki-*
-
chinese_detection
Classify a string as either English, Chinese, or Pinyin
-
rsbook
Generate mdBook-ready Markdown documentation from rustdoc JSON
-
Raifus
view an Ascii waifu
-
dokkoo
Mokk (Macro Output Key Kit) implementation written in Rust
-
rust_lemmatizer
A lemmatizing package for use with a .csv dictionary of lemmas and their corresponding words
-
oeb-rs
oeb电子书解析库
-
glob-match
An extremely fast glob matcher
-
string-parser
String parsing crate
-
kincaid
A word statistics library in Rust
-
base58check
encoding
-
regex-automata
Automata construction and matching using regular expressions
-
unic-char-property
UNIC — Unicode Character Tools — Character Property taxonomy, contracts and build macros
-
auto-regex
Automagically finds a regex that best matches an example and a sample list
-
csv_to_table
pretty print CSV as a table
-
twitter_text_parser
Parser for twitter-text in Rust
-
hashmoji_generator
Code generation tool for hashmoji
-
retest
Command-line regular expression tester
-
xee-xpath
XPath 3.1 library API
-
remove-markdown-links
Turns
[example](https://example.com)intoexample. That’s it -
flerp
CLI tool that does XYZ
-
mdbook-obsidian
mdBook preprocessor to render Obsidian specific syntax
-
chinese-ner
A CRF based Chinese Named-entity Recognition Library written in Rust
-
toshi
A full text search engine based on Tantivy
-
gematria_rs
Gematria, a traditional Hebrew numerology system
-
segtok
Sentence segmentation and word tokenization tools
-
rcut
replacement for GNU cut that supports UTF-8
-
rins_markdown_parser
markdown parser written on Rust
-
jlu
Command-line utilities for on-the-fly investigation of JSON Lines
-
tree-formatter-rs
format hierarchical structures as text-based trees
-
zspell-cli
Command line interface for the ZSpell spellchecking library
-
garde
Validation library
-
rexpaint
functionality for reading and writing .xp files of the Grid Sage Games REXPaint ASCII art editor
-
techlead
CLI is a command-line interface that enables developers to chat with an AI assistant powered by the OpenAI GPT language model, designed specifically to help with your Rust project
-
html-compare
compare html files
-
mdtranslation
prepare multi-lingual Markdown documents
-
romulus
a stream editor like sed
-
json2markdown
convert JSON data into well-structured Markdown format
-
marisa-rs
Safe Rust wrapper for the marisa-trie C++ library
-
pcre2
High level wrapper library for PCRE2
-
uniwhat
Display the unicode characters text
-
untanglr
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies
-
tokenizers-enfer
today's most used tokenizers, with a focus on performances and versatility
-
ab-radix-trie
A compressed radix trie implementation supporting matching rules
-
minigrepx719
A grep implementation in Rust
-
sauron-md
parsing markdown into sauron node
-
tform
format plain text into well-structured Markdown or HTML
-
names-changer
Convert a names of sql schemes from camelcase to snake case
-
companies-house-api
consuming various APIs from Companies House
-
rep-cli
Replace text file in bulk
-
google-fonts
Download and cache TTF fonts from Google
-
irg-kvariants
wrapper around kvariant from hfhchan/irg
-
stringsext
find multi-byte-encoded strings in binary data
-
rustfits
A light-weight FITS file reader in Rust
-
gpt4ocr
Extract structured text from PDFs using OpenAI's GPT4o
-
hangul
manipulate Hangul Syllables
-
utf8-io
Traits and types for UTF-8 I/O
-
git-ref-format-core
Core types for the git-ref-format crate
-
futf
Handling fragments of UTF-8
-
crate2bib-cli
A CLI tool for the crate2bib crate
-
vndb_rs
Bare-bones VNDB api
-
lilgrep
A minimal command-line tool that searches for lines matching a pattern in a file, similar to grep. Built for simplicity and learning purposes.
-
ra_rustc_lexer
lexer used by rustc, zero stability gurantees
-
md-include
include any file in markdown files
-
kurtbuilds_regex
Wraps the regex library to also provide macros
-
excavate
A command line tool to excavate fields from multi-column texts
-
stringedits
Edit trait and associated iterators for small edits to strings
-
aho-corasick-unsafe
Fast multiple substring searching
-
encoding_c
C API for encoding_rs
-
mdbook-langtabs
An mdbook preprocessor that adds language tabs for code blocks
-
pulldown-html-ext
Extended HTML rendering capabilities for pulldown-cmark
-
tantivy-czech-stemmer
Czech stemmer as Tantivy tokenizer
-
table_to_html
interface to convert a
tabled::Tableinto a HTML table (<table>) -
html_to_epub
A command line converts .html file to .epub file
-
encoding8
various 8-bit encodings
-
bitranslit
Bi-directional transliterator for Rust. Transliterates strings according to the rules specified in the language packs.
-
whisperer
将文本编码为简短的中文字符, 防和谐
-
lingua-chinese-language-model
The Chinese language model for Lingua, an accurate natural language detection library
-
sqlify
CLI tool for formatting SQL queries
-
warrah
command-line utility and Rust library that sloppily removes code comments from a text file, supporting 60+ programming languages
-
komga
REST API Client generated from OpenAPI specification
-
hayro-write
rewriting pages of a PDF file
-
custard_lib
A core library for Custard
-
svgbobdoc
Renders ASCII diagrams in doc comments as SVG images
-
tectonic_io_base
Basic types for Tectonic's pluggable I/O backend system
-
decondenser
Pretty-print any text based on brackets nesting
-
mdbook-diataxis
A preprocessor for mdBook which helps apply the Diátaxis documentation framework
-
minigrep_elijahkx
MiniGrep is a Rust-based command-line tool, with a (current) size of 588KB that lets users search files for a given query string and shows matching lines with their line numbers
-
mr-regex
Minimalist ASCII Regex-engine with 300 lines of Rust
-
minigrep_xiaoai
一个简单的命令行工具,用于在文件中搜索字符串。
-
curlicue
Helix keybinding utilities
-
transition-table
transition table utilities for keyword parser
-
wikitext_table_parser
implement for parse the wikitext's table
-
ascii-hangman-webapp
customizable Hangman game with ASCII-art rewarding for children (webapp version)
-
mdbook-tabs
mdBook plugin for rendering content in tabs
-
escape-bytes
Escapes bytes that are not printable ASCII characters
-
glowpub
A glowfic to epub converter
-
re2
Wrapper for the re2 C++ regex library
-
shallow
long text
-
noodler
A port of the python-ngram project that provides fuzzy search using N-gram
-
shopping-parser
A Rust-based parser for parsing structured product information and shopping lists, supporting multiple currencies and units
-
markdown-it-footnotes
Creates footnotes and lists of footnotes in Markdown documents
-
rust-md2html
CLI to convert Markdown to HTML with file watching
-
benda
Use Bend language functions and capabilities in Python!
-
marker
finding issues in CommonMark documents
-
mdbook-typst-math
An mdbook preprocessor to use typst to render math
-
tex-rs
create latex documents in Rust
-
doccy
brace based markup language
-
libxdiff
Rust bindings for the libxdiff C library
-
formatjson
Formats JSON files
-
vl-convert-pdf
convert SVG to PDF with embedded text
-
css_math
Tokenizes and validates strings as CSS Math statements
-
scribe-patterns
Advanced pattern matching and search algorithms for Scribe
-
typos-dict
Source Code Spelling Correction
-
didaskalia
over-complicated language to store text entries and sequences across different languages
-
text-diff
text diffing and assertion library
-
minigrep_unlimate_test_of_rustbook
minigrep from rust book lul
-
afrim-memory
handle of sequential codes easier for an input method
-
codetypo-vars
Source Code Spelling Correction
-
render_readme
Render Markdown or reStructuredText with syntax highlighting and image filtering similar to GitHub's
-
replace_cli
copy contains of given file to another
-
asimov-openai-module
ASIMOV OpenAI module
-
mdbook-spec
An mdBook preprocessor to help with the Rust specification
-
inkly-core
Core PII detection engine for Inkly SecureText
-
clippy_lints
A bunch of helpful lints to avoid common pitfalls in Rust
-
string_art
Convert images into thread patterns for creating string art. It generates detailed instructions in text format and provides graphical previews of the resulting patterns.
-
invisible_unicode
finding invisible unicode characters
-
homoglyph-service
homoglyph-cli
-
sesdiff
Generates a shortest edit script (Myers' diff algorithm) to indicate how to get from the strings in column A to the strings in column B. Also provides the edit distance (levenshtein).
-
mini-openai
An OpenAI API client with minimal dependencies
-
pdf-sign
extract signed date from pdf file
-
asciimath-text-renderer
Render asciimath in terminal
-
wkhtmltopdf
High-level bindings to wkhtmltopdf
-
rescue-blanket
Escape values while they are being formatted
-
meme_generator_utils
Meme generator utils
-
bingrep123
tutorials project of fbbin
-
admerge
Merge multiply sources into one, with advanced options
-
xhtmlchardet
Character set detection for XML and HTML
-
ragtime
Easy Retrieval Augmented Generation
-
termio
styling terminal output with CSS-like syntax
-
ragzilla
providing tools for RAG (Retrieval-Augmented Generation) pipelines
-
rexis-rag
Rexis RAG - High-performance Retrieval-Augmented Generation framework with memory-first agents, vector search, and LLM integration
-
sttx
belt for transforming speech-to-text data
-
raw_str
type which contains a byte slice that is believed to contain UTF-8 string data, but which has not been validated
-
ssbm_utils
A collection of useful tools for working with data from Super Smash Bros Melee
-
tre-regex-sys
Rust bindgen bindings to the TRE regex module
-
lazy-transform-str
Lazy-copying lazy-allocated scanning
strtransformations. This is good e.g. for (un)escaping text, especially if individual strings are short. -
supercat
A syntax highlighting alternative to cat
-
ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched
-
mdbook-preprocessor
assist implementing an mdBook preprocessor
-
presciidoc
Preprocessing AsciiDoc for other tools
-
transliterate1234
UTF-8 to ASCII transliteration
-
samvadsetu
LLM API for commonly used LLM services including Gemini, ChatGPT, and Ollama. The name implies a bridge for dialogue since the library facilitates communication and interaction between…
-
dequote
Remove nested quotes around text
-
pattern-3
Needle API (née Pattern API 3.0), generalization of
std::str::pattern -
moenster
mønster (n) - pattern. simple glob-style pattern matching for strings
-
zummi
fun lib that produces spoonerisms
-
popgetter-py
Python bindings for popgetter
-
markovish
Markov chain implementation for text generation
-
regex-cli
A command line tool for debugging, ad hoc benchmarking and generating regular expressions
-
tectonic_bridge_harfbuzz
Expose the Harfbuzz C/C++ APIs to Rust/Cargo
-
mini__grep
grep tool
-
mdbook-mathpunc
An mdbook preprocessor that prevents line breaks between inline math blocks and punctuation marks when using katex
-
hulk
An ultra simple no hassle static site generator
-
unicodeit-cli
The command line interface to unicodeit
-
ik-rs
chinese segment, ik-analyzer for rust
-
generic-lexer
A generic lexer using a simple match function
-
lindera-unidic
A Japanese morphological dictionary for UniDic
-
wordnik_list
Quick, local library to check valid words within the Wordnik word list
-
kpathsea
Rust interface to the kpathsea TeX file management library
-
tiny-grep
grep-like text search utility written in Rust
-
staple
powerful static site generator
-
termwrap
Wrap Unicode text with ANSI color codes
-
pray
A tui tool for preparing a prompt to the llms
-
limace
Slugify some strings
-
ispell
Easily run ispell, hunspell or aspell from Rust
-
markdown-table-formatter
Markdown table formatter fully compliant with Unicode 15.1.0
-
assert-text
the testing macro tools
-
repub
convert markdown documents to epub
-
yozuk-core-skillset
Set of default Yozuk skills
-
grepmini
grep-like command line tool written in Rust to search for text patterns in files. Built following the Rust book's example.
-
scripter
A screenplay compiler
-
static_table
creates pretty tables at compiler time
-
crypto-invert
Unicode Upside-Down Mapping
-
engish
A language utility for sampling letters and building words
-
pdftotext
High-level library that binds to Poppler to extract text from a PDF
-
advanced_string_generator
A command-line tool for generating strings based on customizable regex patterns
-
indentation
Formatter
-
github-slugger
A slugger for GitHub headings
-
spezilinter
spezifisch's linter for different file formats, linting for weirdly specific stuff
-
selmr
Package to create and use Simple Explainable Language Multiset Representations
-
timfmt
A small utility for formatting code as Tim likes it
-
emoji-printer
Replace emoji shortcodes in string with emoji unicode (":sushi:" -> 🍣)
-
const_format_proc_macros
detail of the
const_formatcrate -
framework
detector for different frameworks in one projects
-
jpreprocess-jpcommon
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
smol-base-x
no-std centric implementation of base-x
-
enpsrlib
English Phrase Structure Rules library
-
wz
Count words, fast
-
ahtml-from-markdown
Convert Markdown to ahtml HTML element trees
-
kindleclip
CLI tool to export a file per book in kindle 'My Clippings.txt'
-
json-peek
Amature JSON parser library designed for my specific need
-
pra
Print Random ASCII
-
xur
Toy langage made to be used in a REPL
-
node-emoji
Convert
:emoji:to Unicode using GitHub’s and EmojiDB’s emoji names -
wordbreaker
A Unicode-aware no_std crate (requires alloc) that rapidly finds all sequences of dictionary words that concatenate to a given string
-
ryaspeller
lib for searching typos in text, files and websites
-
libfmt
A text formating package for rust
-
ascii-img-cli
Command-line tool for using ascii-img
-
angr
analyse ngrams in text files
-
git-busy
A wrapper around "git commit" that generates the commit messages for you
-
aki-mline
match line, regex text filter like a grep of linux command
-
bullshit-rs
A BullshitGenerator implementation in Rust
-
ligotab
Format delimited data with lightweight markup
-
sm-search
way of searching through text - for people who are too lazy to use Regex
-
struckdown
A structured markdown / commonmark library for Rust
-
leven
Make your own blog!
-
cindex
CSV indexing library
-
unidok
A powerful, readable, easy-to-learn markup language
-
mitex-spec
Specification Library for MiTeX
-
paltoquet
rule-based general-purpose tokenizers
-
kradical_static
Ready-to-use EDRDG radical decompositions
-
bk-tree
A Rust BK-tree implementation
-
blockcounter
Counts the blocks in a stream
-
oxidized-json-checker
A pushdown automaton low memory JSON bytes stream checker
-
markdown-composer
composing markdown documents
-
morse-nostd
A nostd version of the morse crate
-
fcowsay
working with cowsay
-
mul
Bengali stemmer
-
lindera-cli
A morphological analysis CLI
-
aki-stats
output the statistics of text, like a wc of linux command
-
soup
Inspired by the python library BeautifulSoup, this is a layer on top of html5ever that adds a different API for querying and manipulating HTML
-
markdown-extract-cli
Extract sections of a markdown file with a regular expression
-
bbd-lib
Binary Braille Dump
-
yassgy
A static site generator
-
mark-rs
A Markdown parser and Static Site Generator
-
character_frequency
counting character frequencies in a string concurrently
-
unic-ucd-normal
UNIC — Unicode Character Database — Normalization Properties
-
toml_to_table
pretty print TOML as a table
-
md-localizer
Localize markdown with remote links
-
opentalk-roomserver-types-legal-vote
OpenTalk RoomServer Types Legal Vote
-
word_filter
A Word Filter for filtering text
-
kth-lines
Command line tool for filtering stdin lines that just work
-
rustex
auto-generated LaTeX files in Rust
-
pups_text
Pretty Understandable Parsers
-
tectonic_engine_xetex
The XeTeX engine as a reusable crate
-
intname
Full English name for any integer of any primitive integer type
-
lindera-sqlite
Lindera tokenizer for SQLite FTS5 extention
-
lindera-ipadic
A Japanese morphological dictionary for IPADIC
-
sparklet
small flashcards library
-
moenarchbook
Creates a book from markdown files
-
pdf-create
low-level, strongly-typed PDF creation library
-
md-designer
A CLI tool for creating design docs in Markdown
-
simple-xml-builder
XML builder/writer
-
gregex-logic
Logic for the gregex crate
-
fiberplane-markdown
convert Fiberplane Notebooks to and from Markdown
-
asimov-readwise-module
ASIMOV module
-
iata_bcbp
IATA BCBP Parser in Rust Based on Resolution 792
-
asimov-sdk
ASIMOV Software Development Kit (SDK) for Rust
-
izihawa-tantivy-tokenizer-api
Tokenizer API of tantivy
-
pdf_composer_base
PDF Composer base functionality crate
-
highlight-pulldown
Process pulldown-cmark events to apply syntax highlighting to code blocks
-
rust-crlf
-
tradukisto
Kinda useful natural language translation library and utility
-
mitex-lexer
Lexer for MiTeX
-
heyiwei
何意味
-
webreg
A CLI tool for testing regexes against web pages
-
nlprule-build
Build tools for a fast, low-resource Natural Language Processing and Error Correction library
-
skribo
low-level text layout
-
clxrity_lilgrep
grep-like utility written in Rust
-
kanpyo
Japanese Morphological Analyzer
-
schmfy
Schmfication library
-
mdbook-tectonic
An mdbook backend for generating LaTeX and PDF documents
-
xml_magic
A reasonably fast XML formatter
-
chisel-lexers
Chisel backend lexers/scanners
-
kasedenv
Read environment variables by lower, upper case or case-insensitive keys
-
lithe-cli
A cli of lithe
-
toresy
term rewriting system based on tokenization
-
llmvm-codeassist
A LLM-powered code assistant that automatically retrieves context (i.e. type definitions) from a Language Server Protocol server.
-
rust_nickname_generater
that generates user/nick names based on the rust language
-
every-range
Iterator for interleaving included and excluded ranges
-
pattern-generator
solving Sudoku puzzles. It takes a Sudoku puzzle input and provides the solved grid.
-
rep-grep
wgrep/write-grep CLI
-
scie
research about how to build simple code identify engine for different languages
-
endf_parser
parsing ENDF-6 format nuclear data
-
findtext_textfile
Search text in text file
-
ipsae-core
markdown parser for DIY lover
-
gqlog
👾 filter your json logs with graphql 👾
-
tzgrep
grep tar.gz
-
bpmf_py
A Bopomofo and Pinyin library
-
xxxxx_rust_sts
A collection of useful string and file utilities for Rust
-
tokengrams
Compute n-gram statistics and model language over pre-tokenized text corpora used to train large language models
-
cli_tool
A CLI tool
-
prettythanks
frontend to dtolnay/prettyplease library
-
quicksilverx
easy to use grep clone
-
jtool-grep
notebook-specific grep tool for jtool
-
gen-epub-book
Generate an ePub book from a simple plaintext descriptor
-
mdbook-preprocessor-utils
writing mdBook preprocessors
-
bukvalno
A cli tool for converting images to ascii art
-
strfry
randomize a string
-
seq2xypic
Turn a text sequence diagram into a LaTeX xypic diagram
-
typeline
Efficient, Type-Safe Pipeline Processor
-
writings
The Bahá’í Sacred Writings for use in Rust projects and APIs
-
epub2mdbook
convert EPUB files to MDBook format
-
swc_ecma_lexer
Feature-complete es2019 parser
-
korrektor
work with Uzbek language text processing
-
rmw-utf8
Short text compression algorithm for utf-8 (optimized for Chinese , developed based on rust programming language). 面向utf-8的短文本压缩算法(为中文压缩优化,基于rust编程语言开发)。
-
kaomoji-ru
The full http://kaomoji.ru database, encoded and escaped as byte literals
-
szovegertesimutato-score
Calculate szovegertesimutato score for a given text and language
-
single_source
Generate code files from snippets in md tutorial files
-
caribon
A repetition detector program and library
-
mdbook-summary-generate
A mdbook preprocessor to generate SUMMARY.md from a directory structure
-
aki-resort
sort lines of text. You can use regex to specify the KEY.
-
make_it_braille
A CLI tool and library to manipulate unicode braille images
-
jellybean
Syntax highlighting with tree-sitter. Sweet colors.
-
unicode_skeleton
detects unicode strings that look nearly identical once rendered, but do not compare as equal. It defines "confusable" and "skeleton" based on Unicode Standard Annex #39
-
ngrams
Generate n-grams from sequences
-
sauron-markdown
parsing markdown into sauron node
-
cattocol
Combine two text into one text as columns
-
p101_enc
convert Olivetti P101 program to and from different encodings
-
findcar
An extensible CLI tool that filters, sorts, and aggregates search results from car websites. Because of course you want to search for cars from the command line.
-
mdbook-iced
An mdBook preprocessor to turn iced code blocks into interactive examples
-
yitizi
異體字查詢 Get variant Chinese characters
-
zalgo-text
A command line tool for generating zalgo text
-
boringascii
Strings that can only be constructed to contain non-whitespace, non-control ASCII characters
-
kakasi
Romanize hiragana, katakana and kanji (Japanese text)
-
aki-json-pick
The json pick out command
-
cli_app_capo
CLI application with Unix-like tools
-
rustsay
CLI tool in Rust that mimics the classic cowsay program, allowing a cow to speak your text in the terminal
-
character-stream
Helper data structures for reading UTF-8 characters from a stream
-
mdbook-translation
prepare multi-lingual mdBook books
-
wimbd
A CLI for inspecting and analyzing large text datasets
-
uo_rst_parser
fork of rst_parser with fixes for upstream-ontologist
-
text-to-json
Convert text to json in rust
-
yozuk-helper-english
English NLP utilities for Yozuk
-
contractions
expand contractions in English
-
codegenrs
Moving code-gen our of build.rs
-
detect-indent
Detect the indentation of code
-
asimov-dataset-cli
ASIMOV Dataset Command-Line Interface (CLI)
-
unflow
DSL to convert design to code
-
encoding-index-japanese
Index tables for Japanese character encodings
-
pranav_minigrep
Mini Version of grep regex command. It finds out the given string and returns the line containing it.
-
is_utf8
functions to determine if a sequence of bytes is valid utf-8
-
gret
command line tool to search for patterns and show matches in a tree structure
-
unicode_names
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
repa
Peak Performance Pattern Seeker
-
triangle-from-scratch
An educational series on how to do stuff yourself
-
kmdbook-html
mdBook HTML renderer
-
easy_random
Generate random data easily with easy_random :)
-
tabwriter
Elastic tabstops
-
opengrep
Advanced AST-aware code search tool with tree-sitter parsing and AI integration capabilities
-
sudachiclone
sudachiclone-rs is a Rust version of Sudachi, a Japanese morphological analyzer
-
parattice
Recursive paraphrase lattice generator
-
darn
Extras for the Book on Data Analysis with Rust Notebooks
-
rosie-sys
build or link to librosie to access the Rosie Pattern Language
-
unicodeit
Converts LaTeX to Unicode (rust port)
-
ghimlink
Generates a Github-Flavoured-Markdown formatted image link to an image in the same repository
-
screen_printer
displaying rectangular blocks of text to a terminal
-
epubparse
Parse epub and convert to text-only Book structure
-
skyspell_core
skyspell core library
-
varcon-core
Varcon-relevant data structures
-
lix-score
Calculate LIX score for a given text and language
-
widest-line
Find the widest line in a string with proper Unicode and ANSI escape code support
-
bazaar
formats and protocols
-
alpino-tokenizer
Wrapper around the Alpino tokenizer for Dutch
-
arabic_text_utils
Arabic text processing and manipulation
-
lindera-cc-cedict
A Chinese morphological dictionary for CC-CEDICT
-
literate
programming tool that extracts code written in your Markdown files
-
custard-sock
Unix socket interface for a frontmatter-querying server
-
ansinator_ansi_image
defining specific structures for representing and converting an image in ansi
-
ende
encoding/decoding unicode/utf-8/utf-16(ucs-2) code points
-
tectonic_xetex_layout
XeTeX's font loading and layout interface encapsulation, as a crate
-
unic-emoji-char
UNIC — Unicode Emoji — Emoji Character Properties
-
varcon
Source Code Spelling Correction
-
random_access_unicode
Provide random access to the unicode characters in a file
-
forming
lightweight architecture as code language. 架构描述语言
-
tinytoken
tokenizing text into words, numbers, symbols, and more, with customizable parsing options
-
harfbuzz-sys
Rust bindings to the HarfBuzz text shaping engine
-
lingua-japanese-language-model
The Japanese language model for Lingua, an accurate natural language detection library
-
regex_quote_fixer
Rewrites grep regexpressions for the use in the regex crate
-
soundex-rs
that calculates soundex value
-
exit-with-code
A CLI which returns the exit code of your choice
-
texrender
Thin wrapper around running
latexmkto render LaTeX documents. Also supports generating Tex documents. -
math-text-transform
Transform greek letters, latin letters, or decimal digits into certain variants from the mathematical alphanumeric symbols Unicode block (U+1D400–U+1D7FF). For example to bold, italic, script or double-struck.
-
saurus
Markdown to LaTeX Transpiler
-
text-tables
A terminal/text table prettifier with no dependencies
-
dictcc
Rust API for reading and querying the dict.cc offline translation database
-
difference
text diffing and assertion library
-
chunk_norris
splitting large text into smaller batches for LLM input
-
stylish-stringlike
API for string-like objects that have styles applied
-
pinot
Fast, high-fidelity OpenType parser
-
cesu8-str
CESU-8 and Java CESU-8 string validation and manipulation
-
hashtag-regex
regex matching hashtags accoding to the unicode spec: http://unicode.org/reports/tr31/#hashtag_identifiers
-
unicode_clusters
variable width unicode characters as single items, allowing for array like indexing etc
-
crossandra
A straightforward tokenization library for seamless text processing
-
hms-common
Common lib for Hold my Snip!
-
yeslogic-unicode-blocks
Functions to access and search Unicode blocks
-
minigrep_bourne
classic grep command
-
strizer
minimal and fast library for text tokenization
-
sims
Simplistic string search
-
dtex
Better TeX
-
jpreprocess-naist-jdic
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
font-awesome
A typed manifest of constants of Font Awesome unicode code points
-
japhonex
Japanese phone number checker for Rust
-
write16
A UTF-16 analog of the Write trait
-
grep-clone
A mini grep clone from the Rust-lang official tutorial
-
typos
Source Code Spelling Correction
-
film_parser
application for parsing detailed film data from raw text, including information such as title, release year, director, screenwriter, cast, genre, and runtime, into a structured Film…
-
vaporetto_tantivy
Vaporetto Tokenizer for Tantivy
-
pdf_encoding
Font related encodings
-
lindera-decompress
A morphological analysis library
-
stringsort
Pathological sorting of string characters
-
yeslogic-ucd-parse
parsing data files in the Unicode character database
-
rcut-lib
rcut is a Rust replacement for GNU cut that supports UTF-8
-
kanji_hanzi_converter
Translate Japanese Kanji to Chinese Hanzi
-
tattoy-wezterm-char-props
Unicode and other character properties
-
minigrep_flict
Simplest text-in-file search engine from rust book
-
find-typos
Find typos in your project
-
mdbookshelf
Create epubs from a list of mdbook repositories
-
csvre
replacing data in CSV columns with regular expressions
-
heart-strings
Quickly get random heart emojis to copy!
-
clparse
A command line tool for parsing CHANGELOG.md files that use the Keep A Changelog format
-
pygmentize
wrapper for syntax highlighting
-
betacode
conversion
-
write-html
writing HTML in Rust
-
fvoid
Void file generator, supports many formats
-
ipynb-to-md
Convert Jupyter Notebooks to Markdown files
-
notion2html
Convert Notion pages to HTML
-
ttf_word_wrap
Wraps text based on character width
-
roxy_markdown_parser
Roxy plugin for parsing Markdown
-
mon-artist
Render ASCII art to SVG, with user-customizable rendering rules
-
streamparser
Adapt simple stream parsers for different types of streams
-
mdbook-llms-txt-tools
convert mdbook to llmstxt.org format
-
xot
Full-featured XML tree library for Rust
-
encoding
Character encoding support for Rust
-
jg
Jeff Goldblum (jg) is a command-line JSON processor. jg searches for structural patterns in json input and prints each json object that matches the pattern.
-
rahat3062_minigrep
A light-weight & minimal implementation of the grep cli app
-
novalyn_core
Core library for novalyn changelog generation
-
wordfreq
port of wordfreq for looking up the frequencies of words in many languages
-
bos_books_codes
that handles 3-character Bible Books Codes
-
rizzer
Fuzzy matching tool to find string similarity
-
asimov-chromium-module
ASIMOV module for Chromium (and Brave, Google Chrome) bookmark import
-
csv_coincidence
Tool designed to efficiently search for and identify specific patterns within CSV files
-
codetypo-cli
Source Code Spelling Correction
-
unicode_converter
CLI tool to convert data between various Unicode encodings
-
august
& program for converting HTML to plain text
-
iconv-compat-win-sys
iconv bindings for Rust
-
mdoc
Modern PDF creation through Markdown and LaTeX
-
base1112031
way to represent numbers using all Unicode characters except ASCII control characters
-
argot
Parse documentation from codebases into Markdown for easy doc creation
-
bookkeeping
Minimal, safe and flexible in-memory bookkeeping API
-
tagsearch
Filter plaintext files based on @keyword tags
-
hashlogs
Command-line utility that hashes the part before a space on each line from stdin with blake2b keyed with an ephemeral randomly-generated key and writes to stdout
-
jp_utils
Utils for working with Japanese text
-
asimov-ollama-module
ASIMOV Ollama module
-
top-english-words
retrieve top words from the English language
-
crawdad
ChaRActer-Wise Double-Array Dictionary
-
malvolio
programmatically generating HTML
-
latin1str
Windows-1252 string types
-
demoji
Remove all emojis from a string
-
mdbook-sitemap-generator
generate a sitemap.xml file for an mdbook project
-
docfmt
A document formatter using Handlebars templates
-
yamc
Markdown Converter - A fast and feature-rich markdown to HTML and PDF converter
-
moscato
Outline scaler for OpenType glyphs
-
csv-sanity
Sanitize and transform large CSVs with millions of records quickly and efficiently
-
besida
Language for defining branching dialogue
-
LatexImgGen-lib
generating Latex images of equations with an edition number to avoid cache flushing and timestamp attributes
-
word-star-puzzle-solver
A CLI tool for solving VGs word star puzzle
-
mystsh
Mystical shell parser, formatter, and interpreter with Bash support
-
cw
Count Words, a fast wc clone
-
charwise
This lightweight, dependency-free rust library provides a convenient way to read characters from different resources
-
term-rustdoc
A TUI for Rust docs
-
pdfutil
PDF document manipulation
-
harfbuzz-traits
Rust Traits for the HarfBuzz text shaping engine
-
ngram-search
Ngram-based indexing of strings into a binary file
-
base100
Encode your data into emoji
-
perspicuity_formula
Calculate Flesh Reading Ease for a given text and language
-
litegrep
A basic tool for searching in files for lines of text, based on a query
-
dekor
styling and character repository in Rust
-
mepple
English dictionary as a library
-
kwindex
based "keyword index" library crate that maintaining an index of words from texts
-
claude-tokenizer
tokenizing text with the Anthropic Claude models
-
pulldown_mdbook
A pull parser for mdBook
-
bigstr
A command-line tool to make string BIG
-
autoruby
Easily generate furigana for various document formats
-
hexify
Format hex
-
tex
The νTeX typesetting engine
-
chanoma
Characters Normalization library. 文字列正規化処理用のライブラリです。
-
randem
Print a random emoji optionally with the given string as seed
-
bqrs
apply boolean query to text
-
reedy
A terminal-based RSS reader with a clean TUI interface
-
cutters
Rule based sentence segmentation library
-
fancy_print
Easily print animated ASCII text to the terminal!
-
blitztext
fast keyword extraction and replacement in strings
-
jmbl
A high performance CRDT
-
swappy
An anagram generator
-
conllu-utils
working with the CoNLL-U dependency format
-
yarner-lib
build plugins for the Literate Programming tool Yarner
-
is-vowel
Heuristically test whether a character is a vowel letter
-
ed_join
Implemtation of Ed-Join Algorithm for string similarity join
-
jmemo
CUI tool for maintaining memos
-
utf8-command
UTF-8 encoded
std::process::Commandoutput -
looneygrep
A powerful command-line search tool with web page support and replace functionality
-
kytea-tokenizer
Wrapper of tokenization by KyTea
-
minigrep_maybe
Just another minigrep from Rust book
-
uniaxe
replace Unicode letters with Ascii equivalents
-
goya
morphological analyzer for Rust and WebAssembly
-
inf-rs
INF file parse library
-
lazy-char-iter
Lazily iterate over Unicode characters from a u8 slice in Rust
-
glyphrs
A text art generator written in Rust
-
textframe
query plain text documents by unicode offset without loading them all into memory
-
nerd_font
Enumerated contents of the JetBrainsMono Nerd Font
-
aklat
create books from markdown files (like Gitbook)
-
harper
implement SSGs
-
diacritics
Remove diacritics from letters, for example when standardizing input for a search
-
bible-reference-rs
Extract Bible references from plain text
-
lexmatch
lexicon matching tool that, given a lexicon of words or phrases, identifies all matches in a given target text. Uses suffix arrays.
-
braille_pics
producing text-art pictures using Braille characters
-
unic-ucd-segment
UNIC — Unicode Character Database — Segmentation Properties
-
fuzzy-string-distance
Fuzzy string distance comparisons
-
ssml-parser
parsing speech sythnesis markup language
-
cli_app_capo15
CLI application with Unix-like tools
-
mdbook-html
mdBook HTML renderer
-
textos
Texts, strings, formatting, unicode…
-
codevis
turning your code into one large image
-
namesake
A CLI tool and library to generate names by combining words
-
supergrep
A super grepper
-
unicode_categories_zheyuye
Query Unicode category membership for chars
-
spongedown
Converts markdown to html with svgbob support
-
strings
String utilities, including an unbalanced Rope
-
okane-core
support parsing, emitting and processing Ledger (https://www.ledger-cli.org/) format files
-
lingua-latvian-language-model
The Latvian language model for Lingua, an accurate natural language detection library
-
allsorts-subset-browser
Temp fork of allsorts 0.15 - includes patch for subsetting fonts for browsers
-
typora-img-to-base64
Typora image to base64
-
text2art
translate text to ascii art
-
khat
A cat clone, nothing more nothing less
-
chinese_segmenter
Tokenize Chinese sentences using a dictionary-driven largest first matching approach
-
utf-8
Incremental, zero-copy UTF-8 decoding with error handling
-
marcus
An experimental Markdown parser written in Rust
-
markdownx
⇟
-
just-enough-emojis
text to emoji cli
-
boxy
Declarative builder for Unicode box-drawing characters
-
crate-starter
starter
-
ron_to_table
pretty print RON as a table
-
gmi2html
Convert text/gemini into HTML
-
kaomji-ru
The full http://kaomoji.ru database, encoded and escaped as byte literals
-
outerspace
Methods for prefixing and suffixing the non-whitespace characters in a string
-
quewuigrep
grep-like tool written in Rust
-
tb_normalization
normalization utf8 string, loc dau vietnamese and some language
-
japanese-ruby-filter
Japanese ruby notation parser
-
charmap
one-to-(none/one/many) character mapping
-
bookgrep
Basic grep equivalent, minor mods to Chapter 12
-
stringutils
A collection of various and (hopefully) useful String utility functions
-
jieba-macros
jieba-rs proc-macro
-
termbook
behind the
termbook-cli -
squ
command-line utility for converting quotation marks in plaintext files to "smart quotes"
-
morc
Dead simple, minimal markdown generator library written in Rust
-
csv2qr
command line tool for generating QR codes from a CSV file
-
prefixtree
Hash-table-based prefix tree library
-
jpreprocess-dictionary
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
borderrs
Add stylish borders around your text and datastructures
-
cologne_phonetics
generate phonetic cologne codes for utf8 strings
-
dd
a clone of the unix coreutil dd
-
lignin-html
HTML renderer for lignin VDOM Nodes. This crate is primarily for static and server-side rendering. For client-side use, see lignin-dom.
-
utf8_reader
A UTF-8 reader that read UTF-8 characters from object that implement Read trait
-
ogrep
searching in indentation-structured texts
-
lines_lossy
extension to
BufReadwith a functionlines_lossythat works likeBufRead::linesbut with lossy UTF-8 decoding -
rustyword
An anagram finder
-
grep-regex
Use Rust's regex library with the 'grep' crate
-
const-utf16
Utf8 to utf16 conversion functions for use in const contexts
-
atm_parser_helper_common_syntax
Parsing utilities for the common syntax specification
-
common-words-all
Most common words sorted by ngram frequency
-
oxcomm
using Google Translate on the fly
-
mdbook-force-relative-links
An
mdbookpre-processor to transform all local links to relative ones -
myanmar_util
A collection of tools for processing Myanmar text including syllable breaking and other utilities
-
lindera-dictionary-builder
Shared code for building Lindera dictionary files
-
readable-readability
Really fast readability
-
kpathsea_sys
Bindings to libkpathsea for efficient file management in the TeX ecosystem
-
typst-ts-svg-exporter
Export a frame of Typst document into svg element
-
tectonic_pdf_io
Xdvipdfmx’s PDF, XDV, and image I/O APIs in C, as a Rust crate
-
stfu
Shut The Ferris Up - profanity filtering for Rust
-
ascii_utils
handle ASCII characters
-
butterkups-minigrep
Mini grep utility; very weak application, use grep instead
-
horrible-katex-hack
A way of using KaTeX with rustdoc by abusing the doc(html_favicon_url) attribute
-
binatime
A binary clock in the terminal
-
scenic
Presentation tool that utilizes LaTex Beamer package
-
latex-to-html
Latex to html converter
-
strip-tags
Strip HTML and PHP tags from strings
-
rmbs
Remove any fluff, corporate speak, or other bullshit from input text and print the TL;DR essence of what's being said, using the www.bullshitremover.com public LLM API
-
dictionary-1024
a mnemonic dictionary with 1024 words, 10 bits per word
-
kanabake
Turn Romaji strings into Hiragana or Katakana
-
rsonpath-test-codegen
Blazing fast JSONPath query engine powered by SIMD. TOML-based test codegen for rsonpath-lib.
-
economic_indicator_finder
A finder for extracting economic indicators from paragraphs
-
chisel-parsers
Chisel parser front ends
-
recode_rs
Command-line tool for converting between the character encodings defined in the Encoding Standard
-
mdbook-to-github-wiki
Turns an mdbook book into a github wiki
-
flw
Process text via configurable tasks
-
env2toml
Convert env vars to toml text
-
lindera-ipadic-neologd
A Japanese morphological dictionary for IPADIC NEologd
-
hsk
Return HSK Level for Simplified Chinese Characters
-
caido-convert
Encoding library for web security applications
-
tectonic_bridge_graphite2
Expose the graphite2 library to Rust/Cargo
-
hexd
converting from varios encodings
-
demoji-rs
demoji implementation for Rust
-
utf8_slice
Lightweight UTF8 Slice Utilities
-
tantivy-object-store
A tantivy Directory implementation against object stores (S3, GCS, etc.)
-
indexable_str
creating indexable strings
-
unicode_escape
decoding escape sequences in strings
-
html2runes
An HTML to Text converter
-
lib-ruby-parser-ast
AST for lib-ruby-parser
-
note-mark
A markdown parser
-
cjieba-sys
unsafe ffi to cppjieba
-
llmvm-outsource-lib
outsource backend for llmvm
-
nipah_tokenizer
A powerful yet simple text tokenizer for your everyday needs!
-
rustextile
Textile markup language parser for Rust
-
basic-text-internals
Basic Text string literal implementation details
-
catmark
Console printer for CommonMark
-
ced
Dead easy csv editor
-
rbatis_xml_parser
Push based streaming parser for xml
-
cautious-octo-funicular
Test: shipping an mdbook with API docs
-
text-sanitizer
convert text to plain ASCII text
-
unicode-line-stacker
Tiny library for stacking Unicode line-drawing characters on top of each other
-
base_u256
base-u256 is to utf-8 as base-64 is to ascii
-
xsv
A high performance CSV command line toolkit
-
tfidf-summarizer
Basic tf-idf compute for documents
-
asimov-snapshot-cli
ASIMOV Snapshot Command-Line Interface (CLI)
-
decline-word
Choose word form based on given number
-
grep-printer
grep crate's Sink trait that provides standard printing of search results, similar to grep itself
-
mdtranslation-cli
Command-line tools for using mdTranslation, which can be used to prepare multi-lingual Markdown documents
-
minigrep_yunzhu
A fun toll to search text for a pattern
-
STKLR
STKLR: pronounced 'stickler'. Is a cli tool to automatically link functions, enums, structs, traits etc in rust-doc docstrings. I couldn't find a tool like this when I needed it so... here we are.
-
asimov-luma-module
ASIMOV module
-
lingua-portuguese-language-model
The Portuguese language model for Lingua, an accurate natural language detection library
-
truncrate
truncate strings economically without splitting your characters, symbols, emojis etc
-
quill_delta_pdf
Convert Quill Delta to PDF
-
terminal-emoji
safely displaying emoji inside of terminals
-
hemoglobin-search
Hemoglobin search utilities
-
unic-char-range
UNIC — Unicode Character Tools — Character Range and Iteration
-
wz-conf
Configuration options for wz
-
vaporetto_rules
Rule-base filters for Vaporetto
-
nutrimatic
Tools for reading Nutrimatic (https://nutrimatic.org) index files
-
code-span
Add additional infomation to code character
-
sgrep
grep util for those lazy to remember many command line options
-
grep-reader
short text for crates.io
-
terminal-supports-emoji
Check whether the current terminal supports emoji
-
utf8toipv4
Convert UTF-8 to ipv4 addresses and vice versa
-
flesh-reading-ease
Calculate Flesh Reading Ease for a given text and language
-
ansi-to-tui-forked
convert ansi color coded text into ratatui::text::Text type from ratatui library
-
deinflect
japanese deinflection
-
fozzie
A quick fuzzy finder for the terminal
-
htmlfixinator
A composable HTML transformation library with filters for cleaning, modifying, and standardizing HTML content
-
char_stream
Unified character reading interface to str, String, bytes, File and Stdin for Rust language
-
encoding_c_mem
C API for encoding_rs::mem
-
minigrep_improved
A command line tool to search for a string in a file
-
poetry-book
Create a poetry book in latex, starting from plain text
-
stopwords
from popular text processing frameworks
-
jpreprocess-window
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
asciir
Print ASCII table/values
-
pretok
A string pre-tokenizer for C-like syntaxes
-
bgrep
grep tailored to handle binary patterns and files
-
cowsay
program that generates ASCII art pictures of a cow with a message
-
xgrepx
xgrep is a rust implementation of grep. This is a follow up from the rust book
-
rmgrep
rust min grep text tools
-
kilo
small, fast utility crate/library for manipulating strings and generating sourcemaps with all in Magic 🪄
-
TextToEmoji
converting words to emoji representations
-
asimov-core
ASIMOV Software Development Kit (SDK) for Rust
-
password-characters
help with the "enter the 12th, 35th, and 63rd characters from your password" situations
-
simple-word-count
word count function, try to get same result with Microsoft Office Word application
-
hex_d_hex
HexDHex is a Rust Crate that encodes and decodes byte data to and from its hexidecimal representation. For instance, one may wish, on ocasion that is, to translate a utf8 or ASCII string…
-
inclip
terminal utility for clipboard interaction and convenience
-
lll_toon
convert JPG images to ASCII art with colors
-
yuto51942-servant
cli
-
czv
performing CSV-related operations for data engineering and analysis
-
simple_grep
command-line text search tool in Rust, inspired by grep
-
morsels_lang_ascii
Basic ascii tokenizer for morsels
-
procedural-masquarade
Incorrect spelling for procedural-masquerade
-
raekna-common
contains common types and functionality for the raekna project
-
Grep_Matt_C
clone of grep
-
base_emoji
Convert everything to Emojis (and back)! - 🐫🔦🔥🎲🐬🍀🍟🔋🐬🎲🐬🍀🎁🍟💧👂🔥🚪🔋🍟🔦🔋🚗👣🔦🌍👂🍤🐗
-
iterlower
Final-sigma-correct lowercasing iterator adapter with option for Turkish/Azeri I behavior
-
niho
A command-line tool for converting romanized Japanese text to Japanese characters
-
rure
A C API for Rust's regular expression library
-
ammonia
HTML Sanitization
-
anagram
A collection of anagram utility functions
-
ibm1047
String conversions to and from IBM-1047
-
cha-rs
Extract specific characters from an input
-
liendl_tokenizer
BPE tokenizer for Rust
-
fzy-tmux
A rewrite of fzy's fzy-tmux script in Rust
-
libnewsboat
libraries used by Newsboat reader
-
mdbook-core
The base support library for mdbook, intended for internal use only
-
llmvm-core-lib
llmvm core application
-
conv-rs
CLI converter for multi extensions
-
mojimoji-rs
fast converter between Japanese hankaku and zenkaku characters, mojimoji
-
asimov-maildir-module
ASIMOV module for Maildir email import
-
markdown2unicode
Converter from markdown notation to unicode characters
-
lingo
detect text languages
-
jposta
A fast and intuitive Terminal User Interface (TUI) tool for searching Japanese postal codes and addresses
-
libgrep-rs
searching through text
-
meaningsearch
package that helps you find meaningful lines of any given input. Especially useful in CTFs.
-
alpino-tokenize
Wrapper around the Alpino tokenizer for Dutch
-
finl-charsub
Character substitution module for in-line replacement of fixed strings
-
peppergrep
grep utility written following the 12th chapter of the Rust book. Some little modifications were made.
-
tectonic_bridge_icu
Expose a subset of the ICU Unicode APIs to Rust/Cargo
-
lingua-slovak-language-model
The Slovak language model for Lingua, an accurate natural language detection library
-
nb2fr
Convert integer number to literal French text
-
naming_clt
Extract and convert the naming format(case|notation) of identifiers from files or stdin. Use this tool to prepare identifier name strings for further operations (matching,replacing...) on relative files
-
mdbook-multicode
Allows you to give multilanguage code examples, toggled by a spinner
-
color-convert
Support RGB,RGBA,HEX,HSL,HSLA,HSV,CMYK to convert each other, write by rust
-
ascii-alphabetic-char
Traits for ASCII alphabetic characters
-
mdtransform
program that renders Markdown as HTML for websites
-
read_chars
An iterator over characters read from some I/O source
-
wn-parser
parser for WordNet database files
-
lindera-filter
Character and token filters for Lindera
-
askama-filters
Extra template filters for Askama
-
asaru
Asana Tasks Search CLI
-
diff_match_patch
Diff Match Patch is a high-performance library in Rust-lang that manipulates plain text
-
writedown
format parser
-
minigrep_desonglll
grep implementation from The Rust Programing Book
-
utfx
-
ssam
short for split sampler, splits one or more text-based input files into multiple sets using random sampling. This is useful for splitting data into a training, test and development sets, or whatever sets you desire.
-
figlet-rs
FIGlet to create ascii art
-
grepster
command-line tool for searching text in files
-
lindera-compress
A morphological analysis library
-
umlauts
text transformation of german umlauts
-
asimov-patterns
ASIMOV Software Development Kit (SDK) for Rust
-
suffix
arrays
-
html_to_markdown
Convert HTML to Markdown
-
folia
High-performance library for handling the FoLiA XML format (Format for Linguistic Annotation)
-
search-in-terminal
A terminal-based search tool
-
fimdoc
Firendship is Magic Document, converts Markdown into FIMFiction BBCode
-
minigrep5
grep implementation in Rust
-
allsorts_no_std
Font parser, shaping engine, and subsetter for OpenType, WOFF, and WOFF2
-
trexter
Text progression tracking library
-
freddo
An opinionated library to access the Federal Reserve of St Louis data API
-
grep-searcher
Fast line oriented regex searching as a library
-
asimov-mbox-module
ASIMOV module for Mbox email import
-
lingua-russian-language-model
The Russian language model for Lingua, an accurate natural language detection library
-
uchr
translate char values to the characters themselves, and vice versa
-
regex-test
Infrastructure for testing regexes. You probably don't want to use this crate unless you're working on a regex implementation
-
lingua-czech-language-model
The Czech language model for Lingua, an accurate natural language detection library
-
lingua-italian-language-model
The Italian language model for Lingua, an accurate natural language detection library
-
xmldecl
Extracts an encoding from an ASCII-based bogo-XML declaration in text/html in a Web-compatible way
-
tectonic_engine_xdvipdfmx
The
xdvipdfmxprogram as a reusable crate -
ucd-parse
parsing data files in the Unicode character database
-
deepfrog
A deep learning NLP suite (PoS,lemmatiser,NER) with FoLiA XML support
-
milligrep
Custom simplified implementation of grep
-
ddvm
Document to Document Virtual Machine
-
pulldown-html-ext-cli
CLI tool for extended HTML rendering of Markdown with pulldown-cmark
-
text-search
easy to use plug and play wrapper around tantivy for simple search scenarios
-
fontconfig-rs
Safe, higher-level wrapper around the fontconfig library
-
h_hangul
Korean Characters
-
marmite
easiest static site generator
-
cyware-convert
Encoding library for web security applications
-
tiniestsegmenter
Compact Japanese segmenter
-
clippy-to-md
cli tool to convert clippy json reports to markdown files
-
slicer
that slices string slices into smaller string slices
-
drawille-nostd
A no_std version of the drawille crate
-
guarding_parser
Guarding is a guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
lindera-tokenizer
A morphological analysis library
-
slidedeck
Create an HTML slide deck from Markdown
-
rut
A small UTF-8 parsing library for applications that need to parse individual
chars -
llmvm-chat
An llmvm frontend that acts as a CLI chat interface
-
findtext_sheet
Search text in SpreadSheet
-
iasthk
Harvard-Kyoto to IAST conversion
-
emojito
Find all the Emoji in a string. Supports composed emoji.
-
pdf-min
Very minimal crate for writing PDFs
-
emoji_converter
Converts text to emojis
-
indexrs
inefficient multi-language search index
-
lingua-arabic-language-model
The Arabic language model for Lingua, an accurate natural language detection library
-
rigrep
grep from Rust Book
-
my_mini_grep
A mini application that aims to replicate the behavior of the grep shell command
-
pikchr-cli
PIC-like diagramming language to SVG converter
-
darts
A double array trie, A Forward Maximum Matching Searcher
-
asimov-xai-module
ASIMOV xAI module
-
morsels_lang_chinese
Chinese tokenizer for morsels
-
docstring
manipulating and parsing documentation strings
-
fst-subseq-ascii-caseless
An automaton that matches if the input contains a specific subsequence ignoring ASCII case to be used with fst
-
lindera-py
Python binding for Lindera
-
uiuifree-text-data
csv and excel convert
-
rand-hira
CLI tool to generate random hiragana characters
-
stamd
Webservice for working with stand-off annotations on text (STAM)
-
mini-grep
A test crate with mini grep as in The Book
-
markdown_to_html_parser
parses Markdown syntax into HTML
-
pdf-annotations-converter
Converts annotations found in PDF files to different formats
-
indentation_flattener
From indented input, generate plain output with indentation PUSH and POP codes
-
morel
A string searching library
-
wwdc-parser
convert WWDC video pages to markdown
-
jpreprocess-dictionary-builder
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
font-map-core
Core font-parsing capabilities for font-map
-
lingua-polish-language-model
The Polish language model for Lingua, an accurate natural language detection library
-
findtext_pdf
Search text in PDF
-
asimov-anthropic-module
ASIMOV Anthropic module
-
emojicons-2021
Parse :emoji: notation to unicode representation
-
unicode-utf8
that converts utf-8 bytes to a unicode scalar value, and vice versa
-
simplearrayhash
-
grep-table-converter
A cli utility to convert grep result to table (csv, markdown, textile)
-
yeslogic-fontconfig
RENAMED: use the fontconfig crate instead
-
grep
Fast line oriented regex searching as a library
-
chardetng_c
C bindings for chardetng
-
flx
Full-text searching and scoring of strings
-
any2utf8
Convert any enncoding to utf-8
-
grep-pcre2
Use PCRE2 with the 'grep' crate
-
rust-cedar
efficiently-updatable double-array trie in Rust (ported from cedar)
-
asimov-config-cli
ASIMOV Configuration Command-Line Interface (CLI)
-
utf8reader
wrapper around Reader that returns a stream of UTF-8 characters
-
jpreprocess-core
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
asimov-gemini-module
ASIMOV Gemini module
-
lindera-analyzer
A morphological analysis library
-
backslash
parsing escape characters
-
md2pdf
A small utility to convert markdown files to pdf exploiting tectonic
-
lingua-persian-language-model
The Persian language model for Lingua, an accurate natural language detection library
-
lingua-dutch-language-model
The Dutch language model for Lingua, an accurate natural language detection library
-
ergrep
grep strings within a line from a text file
-
lindera-core
A morphological analysis library
-
mathml-latex
Convert between MathML and LaTeX
-
findtext_doc
Search text in Document
-
lindera-assets
A helper crate to fetch assets and build dictionary for lindera
-
asimov-mlx-module
ASIMOV MLX module
-
asimov-repository-cli
ASIMOV Repository Command-Line Interface (CLI)
-
lingua-turkish-language-model
The Turkish language model for Lingua, an accurate natural language detection library
-
x1b
State tracker for 0x1b terminal escape codes. Also gives random characters names.
-
text_to_emoji
Convert text to emoji
-
worcher
full-text search for static websites
-
lingua-greek-language-model
The Modern Greek language model for Lingua, an accurate natural language detection library
-
asimov-http-module
ASIMOV module for HTTP/HTTPS protocol support
-
asimov-construct-cli
ASIMOV Construct Command-Line Interface (CLI)
-
minigrep1800
command-line tool for searching text files
-
spyglass
Search engine for documents, inspired by bioinformatics
-
lingua-hindi-language-model
The Hindi language model for Lingua, an accurate natural language detection library
-
lingua-bokmal-language-model
The Bokmal language model for Lingua, an accurate natural language detection library
-
asimov-account-cli
ASIMOV Account Command-Line Interface (CLI)
-
lingua-indonesian-language-model
The Indonesian language model for Lingua, an accurate natural language detection library
-
lingua-danish-language-model
The Danish language model for Lingua, an accurate natural language detection library
-
lingua-ukrainian-language-model
The Ukrainian language model for Lingua, an accurate natural language detection library
-
lingua-mongolian-language-model
The Mongolian language model for Lingua, an accurate natural language detection library
-
lingua-malay-language-model
The Malay language model for Lingua, an accurate natural language detection library
-
lingua-vietnamese-language-model
The Vietnamese language model for Lingua, an accurate natural language detection library
-
lingua-lithuanian-language-model
The Lithuanian language model for Lingua, an accurate natural language detection library
-
lingua-hungarian-language-model
The Hungarian language model for Lingua, an accurate natural language detection library