-
regex
regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
-
unicode-width
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
textwrap
word wrapping, indenting, and dedenting strings. Has optional support for Unicode and emojis as well as machine hyphenation.
-
encoding_rs
A Gecko-oriented implementation of the Encoding Standard
-
similar
A diff library for Rust
-
fancy-regex
regexes, supporting a relatively rich set of features, including backreferences and look-around
-
const_format
Compile-time string formatting
-
convert_case
Convert strings into any case
-
ropey
A fast and robust text rope for Rust
-
lazy-regex
lazy static regular expressions checked at compile time
-
pulldown-cmark
A pull parser for CommonMark
-
deunicode
Convert Unicode strings to pure ASCII by intelligently transliterating them. Suppors Emoji and Chinese.
-
unicode-normalization
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
widestring
wide string Rust library for converting to and from wide strings, such as those often used in Windows API or other FFI libaries. Both
u16
andu32
string types are provided, including support for UTF-16 and UTF-32… -
emojis
✨ Lookup emoji in *O(1)* time, access metadata and GitHub shortcodes, iterate over all emoji, and more!
-
termimad
Markdown Renderer for the Terminal
-
rustybuzz
A complete harfbuzz shaping algorithm port to Rust
-
ammonia
HTML Sanitization
-
lopdf
PDF document manipulation
-
html2text
Render HTML as plain text
-
mdbook
Creates a book from markdown files
-
printpdf
reading and writing PDF files
-
lngcnv
linguistics: display pronunciation, translate between dialects, convert between orthographies; support for multiple languages: English, Latin, Polish, Quechua, Spanish, Tikuna
-
prettydiff
Side-by-side diff for two files
-
fuzzy-matcher
Fuzzy Matching Library
-
linkify
Finds URLs and email addresses in plain text. Takes care to get the boundaries right with surrounding punctuation like parentheses.
-
regress
A regular expression engine targeting EcmaScript syntax
-
slug
Convert a unicode string to a slug
-
comrak
A 100% CommonMark-compatible GitHub Flavored Markdown parser and formatter
-
onig
Rust-Onig is a set of Rust bindings for the Oniguruma regular expression library. Oniguruma is a modern regex library with support for multiple character encodings and regex syntaxes.
-
text-splitter
Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.
-
unicode-segmentation
Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules
-
unicode-bidi
Unicode Bidirectional Algorithm
-
finl_unicode
handling Unicode functionality for finl (categories and grapheme segmentation)
-
unicode-general-category
Fast lookup of the Unicode General Category property for char
-
pulldown-cmark-to-cmark
Convert pulldown-cmark Events back to the string they were parsed from
-
lindera
A morphological analysis library
-
garde
Validation library
-
Inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
sqlformat
Formats whitespace in a SQL string to make it easier to read
-
diff
An LCS based slice and string diffing implementation
-
synoptic
low-level, syntax highlighting library with unicode support
-
charabia
detect the language, tokenize the text and normalize the tokens
-
shell-escape
Escape characters that may have a special meaning in a shell
-
unescaper
Unescape strings with escape sequences written out as literal characters
-
indenter
A formatter wrapper that indents the text, designed for error display impls
-
str_indices
Count and convert between indexing schemes on string slices
-
unicode-script
exposes the Unicode
Script
andScript_Extension
properties from UAX #24 -
diffy
Tools for finding and manipulating differences between files
-
const-str
compile-time string operations
-
roff
ROFF (man page format) generation library
-
chardetng
A character encoding detector for legacy Web content
-
titlecase
Capitalize text according to a style defined by John Gruber for Daring Fireball
-
unicode_names2
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
cruet
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
smartcat
Putting a brain behind
cat
. CLI interface to bring language models in the Unix ecosystem 🐈⬛ -
autocorrect
A linter and formatter for help you improve copywriting, to correct spaces, words, punctuations between CJK (Chinese, Japanese, Korean)
-
wana_kana
checking and converting between Japanese characters - Kanji, Hiragana, Katakana - and Romaji
-
ferris-says
flavored replacement for the classic cowsay
-
line-index
Maps flat
TextSize
offsets to/from(line, column)
representation -
prefix
A customizable pretty printer for FIX messages
-
stringzilla
Faster SIMD-accelerated string search, sorting, fingerprints, and edit distances
-
uuhelp_parser
A collection of functions to parse the markdown code of help files
-
eloran
Comics and Ebook web library written in rust, with reading, search, reading status, bookmarks
-
entities
raw data needed to convert to and from HTML entities
-
jieba-rs
The Jieba Chinese Word Segmentation Implemented in Rust
-
regex-cursor
regex fork that can search discontiguous haystacks
-
charset
Character encoding decoding for email
-
presenterm
A terminal slideshow presentation tool
-
mdbook-katex
mdBook preprocessor rendering LaTeX equations to HTML
-
tiefdownconverter
A CLI tool to manage and convert Markdown-based projects
-
mdbook-pdf
A backend for mdBook written in Rust for generating PDF based on headless chrome and Chrome DevTools Protocol
-
epub-builder
generating EPUB files
-
scraps
A static site generator that makes it easy to write internal links (Wiki links) in Markdown files
-
stringsext
find multi-byte-encoded strings in binary data
-
typst-kit
Common utilities for Typst tooling
-
text-size
Newtypes for text offsets
-
hgrep
grep tool with human-friendly search output. This is similar to
-C
option ofgrep
command, but its output is enhanced with syntax highlighting focusing on human readable outputs. -
matchers
Regex matching on character and byte streams
-
hck
A sharp cut(1) clone
-
uncased
Case-preserving, ASCII case-insensitive, no_std string types
-
wildcard
matching
-
unicode-case-mapping
Fast lowercase, uppercase, and titlecase mapping for characters
-
pluralizer
Rust package to pluralize or singularize any word based on a count inspired on pluralize NPM package
-
mkrs
Build automation tool
-
prop-check-rs
A Property-based testing Library in Rust
-
mdbook-admonish
A preprocessor for mdbook to add Material Design admonishments
-
rphonetic
Rust port of phonetic Apache commons-codec algorithms
-
font-types
Scalar types used in fonts
-
rumdl
A fast Markdown linter written in Rust (Ru(st) MarkDown Linter)
-
unicode-id
Determine whether characters have the ID_Start or ID_Continue properties according to Unicode Standard Annex #31
-
languagetool-rust
LanguageTool API bindings in Rust
-
pdf-extract
extract content from pdfs
-
unicode-xid
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
anthropic-text-editor
A micro-CLI to apply tool calls from Anthropic for their text_editor_20250124 built-in computer use tool
-
decancer
that removes common unicode confusables/homoglyphs from strings
-
stop-words
Common stop words in many languages
-
unicode-joining-type
Fast lookup of the Unicode Joining Type and Joining Group properties
-
ascii
ASCII-only equivalents to
char
,str
andString
-
netidx
Secure, fast, pub/sub messaging
-
vaporetto
pointwise prediction based tokenizer
-
marmite
easiest static site generator
-
difflib
Port of Python's difflib library to Rust
-
unindent
Remove a column of leading whitespace from a string
-
llmvm-core
The core application for llmvm
-
stringcase
Converts string cases between camelCase, COBOL-CASE, kebab-case, and so on
-
unicode-blocks
contains a list of all unicode blocks and provides some functions to search across them
-
slice-command
slice is a command-line tool that allows you to slice the contents of a file using syntax similar to Python's slice notation
-
thoth-note
note-taking app written in Rust
-
mdbook-yapp
A mdBook preprocessor for simple text replacements
-
blurhash
A pure Rust implementation of blurhash
-
hyperlink
Very fast link checker for CI
-
wezterm-bidi
The Unicode Bidi Algorithm (UBA)
-
newdoc
Generate pre-populated module files formatted with AsciiDoc that are used in Red Hat and Fedora documentation
-
cow-utils
Copy-on-write string utilities for Rust
-
any_ascii
Unicode to ASCII transliteration
-
norad
Read and write Unified Font Object files
-
omekasy
Decorate alphanumeric characters in your input with various font; special characters in Unicode
-
idna
IDNA (Internationalizing Domain Names in Applications) and Punycode
-
text_io
really simple to use panicking input functions
-
mdbook-pandoc
A pandoc-powered mdbook backend
-
unicode_categories
Query Unicode category membership for chars
-
escrit
learning languages by reading texts
-
aho-corasick-unsafe
Fast multiple substring searching
-
pad
padding strings at runtime
-
unicode-reverse
Unicode-aware in-place string reversal
-
tossicat
입력된 단어에 맞게 같이 입력된 토시(조사)를 적절하게 변환하는 라이브러리
-
hyphenation
Knuth-Liang hyphenation for a variety of languages
-
mdbook-catppuccin
🎊 Soothing pastel theme for mdBook
-
aki-resort
sort lines of text. You can use regex to specify the KEY.
-
hypher
separates words into syllables
-
create_broken_files
Create broken files from other ones
-
collclean
Clean up collaboration commands in LaTeX files
-
xi-unicode
Unicode utilities useful for text editing, including a line breaking iterator
-
jx
An interactive JSON explorer for the command line
-
boreal
evaluate YARA rules, used to scan bytes for textual and binary pattern
-
efmt
Erlang code formatter
-
mdcat
cat for markdown: Show markdown documents in terminals
-
near-facsimile
Find similar or identical text files in a directory
-
topiary-cli
CLI app for Topiary, the universal code formatter
-
unicode-ccc
Unicode Canonical Combining Class detection
-
ascii-canvas
canvas for drawing lines and styled text and emitting to the terminal
-
chewing
(酷音) intelligent Zhuyin input method
-
spellbook
A spellchecking library compatible with Hunspell dictionaries
-
gibberish-or-not
Figure out if text is gibberish or not
-
deno_media_type
Media type used in Deno
-
lindera-ko-dic-builder
A Korean morphological dictionary builder for ko-dic
-
mlc
The markup link checker (mlc) checks for broken links in markup files
-
sd
An intuitive find & replace CLI
-
bullet_stream
Bulletproof printing for bullet point text
-
stam
powerful library for dealing with stand-off annotations on text. This is the Rust library.
-
igrepper
The interactive grepper
-
mdbook-combiner
combine mdbook summaries from multiple source into one mdbook
-
unic-char-property
UNIC — Unicode Character Tools — Character Property taxonomy, contracts and build macros
-
mdbook-epub
An EPUB renderer for mdbook
-
skyspell
Fast and handy spell checker for the command line
-
tbll
tbll outputs data in tabular format
-
moonwave
generating documentation from comments in Lua source code
-
file-organiser
Command line file manager to list, move or delete large numbers of files in nested folders filtered by age, file extension, file name pattern and/or size range
-
gotenberg_pdf
client for the Gotenberg PDF API
-
fm
Non-backtracking fuzzy text matcher
-
reword
some utility functions for human-readable formatting of words
-
mdbook-environment
A preprocessor for MdBook for working with environment variables
-
yaxp-cli
<yaxp-cli ⚡> Yet Another XML Parser CLI
-
vidyut-prakriya
A Sanskrit word generator
-
nvl-cli
A program to download webnovels
-
probly-search
A lightweight full-text search engine with a fully customizable scoring function
-
ncount
A word count tool intended to derive useful stats from markdown
-
molybdenum
Recursive search and replace CLI application
-
aws-sdk-georoutes
AWS SDK for Amazon Location Service Routes V2
-
stfu8
Sorta Text Format in UTF-8
-
dptran
run DeepL translations on command line written by Rust
-
frontmatter-gen
generating and parsing frontmatter in various formats
-
annatto
Converts linguistic data formats based on the graphANNIS data model as intermediate representation and can apply consistency tests
-
yake-rust
Yake (Yet Another Keyword Extractor) in Rust
-
svgdx-pandoc
pandoc filter for svgdx codeblocks in Markdown
-
mdbook-quiz
Interactive quizzes for your mdBook
-
subplot
tools for specifying, documenting, and implementing automated acceptance tests for systems and software
-
fax
Decoder and Encoder for CCITT Group 3 and 4 bi-level image encodings used by fax machines TIFF and PDF
-
bashtestmd
Compiles shell commands in .md files into Bash scripts for testing
-
line-ending
Detect, normalize, and convert line endings across platforms, including support for character streams. Ensures consistent handling of LF, CRLF, and CR line endings in text processing.
-
sapling-renderdag
Render a DAG into ASCII or Unicode text
-
htmd-cli
The command line tool for htmd
-
mdbook-typst
An mdBook backend to output Typst markup, pdf, png, or svg
-
tibco_ems
A high level API for the Tibco EMS
-
latex-thebib
Clean and sort legacy TeX bibliographies written using ‘thebibliography’ via the
refactor
sub-command. Compile BibTeX files to legacythebibliography
TeX code using thecompile
sub-command… -
COXave
Instruments for codings
-
fasttext
binding
-
stylin
Convert markdown to pandoc markdown with custom styles
-
lipsum
lorem ipsum text generation library. It generates pseudo-random Latin text. Use this if you need filler or dummy text for your application. The text is generated using a simple Markov chain…
-
fontfor
find fonts which can show a specified character and preview them in terminal or browser
-
autumnus
Syntax highlighter powered by Tree-sitter and Neovim themes
-
qpdf
Rust bindings to QPDF C++ library
-
jetscii
A tiny library to efficiently search strings and byte slices for sets of ASCII characters or bytes
-
wordcut-engine
Word segmentation/breaking library
-
unicode-bidi-mirroring
Unicode Bidi Mirroring property detection
-
regex-literal
delimited regular expression literals
-
sublime_fuzzy
Fuzzy matching algorithm based on Sublime Text's string search
-
repgrep
An interactive command line replacer for
ripgrep
-
llguidance
Super-fast Structured Outputs
-
mdbook-graphviz
mdbook preprocessor to add graphviz support
-
dvd-term
A bouncing ASCII art DVD logo (or custom text) for the terminal
-
levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
krafna
terminal-based alternative to Obsidian's Dataview plugin, allowing you to query your Markdown files using standard SQL syntax
-
matcher_rs
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
fastn-type
fastn: Full-stack Web Development Made Easy
-
codetypo-dict
Source Code Spelling Correction
-
vlazba
Lojban words generator and analyzer
-
indefinite
Prefix a noun with an indefinite article - a or an - based on whether it begins with a vowel
-
PrintLib
The ultimative printing libary for Rust #cli #string #color #term_color
-
line-numbers
Find line numbers in strings by byte offsets, quickly
-
bbd
Binary Braille Dump
-
roman-numerals-rs
Manipulate well-formed Roman numerals
-
none-shall-pass
Artifact for GitHub Action to validate hyperlinks in all markdown files
-
obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
simple-string-patterns
Makes it easier to match, split and extract strings in Rust without regular expressions. The parallel string-patterns crate provides extensions to work with regular expressions via the Regex library
-
tuicam
Terminal-based camera with switchable modes
-
rustkorean
processing Korean characters. It provides functionalities to check if a character is Korean, classify Korean characters, verify if a character is a leading consonant (choseong), a medial vowel (jungseong)…
-
arf-strings
Encoding and decoding for ARF strings
-
inflections
High performance inflection transformation library for changing properties of words like the case
-
pager
pipe your output through an external pager
-
gh-emoji
Convert
:emoji:
to Unicode using GitHub’s emoji names -
zhconv
Traditional/Simplified and regional Chinese variants converter based on MediaWiki & OpenCC rulesets and powered by AC automata 轉換简体、繁體及兩岸、新馬中文地區詞,基於MediaWiki和OpenCC之字詞轉…
-
minimizer
Minimize files to find minimal test case
-
dmos
Djot HTML renderer with advanced features
-
autotex
Continuously compile TeX and LaTeX
-
instant-segment
Fast English word segmentation
-
gline-rs
Inference engine for GLiNER models
-
vmks-exam-generator
CLI program for pseudo-randomly generating different variants of an embedded programming exam
-
arrow-row
Arrow row format
-
emojic
Emoji constants
-
picodiff
Tiny GUI app to compare text easily
-
zspell
Native Rust library for spellchecking
-
pathmut
Command line utility for manipulating path strings
-
grok
popular java & ruby grok library which allows easy text and log file processing with composable patterns
-
asciidork-parser
Asciidork parser
-
linelint-cli
Line lint cli
-
kelp
A convert tool for Japanese
-
aki-mcolor
mark up text with color
-
shwild
(for Rust)
-
chinese_dictionary
A searchable Chinese / English dictionary with helpful utilities
-
lindera-unidic-builder
A Japanese morphological dictionary builder for UniDic
-
unidoc
Unite all Markdown
-
duvet
A requirements traceability tool
-
utf16_iter
Iterator by char over potentially-invalid UTF-16 in &[u16]
-
jayce
tokenizer 🌌
-
nlpo3
Thai natural language processing library, with Python and Node bindings
-
yeslogic-ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched with support for additional tables
-
nlf
A CLI to append newline characters (LF) at the end of text file
-
whitespacesv
parser/writer for the Whitespace-Separated Value format, as defined by Stenway. See https://dev.stenway.com/WSV/. WSV offers an unambiguous alternative to CSV.
-
ripgrep_all
rga: ripgrep, but also search in PDFs, E-Books, Office documents, zip, tar.gz, etc
-
quickmd
Quickly preview a markdown file
-
cskk
C ABIから使う事を目的とした SKK(Simple Kana Kanji henkan)方式のかな漢字変換ライブラリ
-
secular
No Diacr!
-
sapling-streampager
streampager is a pager for command output or large files
-
adrs
Architectural Decision Record command line tool
-
precis-tools
Tools and parsers to generate PRECIS tables from the Unicode Character Database (UCD)
-
dec-sixbit
DEC SIXBIT encoding
-
uwc
Counts things in unicode text files
-
fasta-cleaner
Transform fasta files by upper-casing all sequence characters and removing non-ACGT sequence characters
-
rustic_print
A versatile Rust library for enhancing console output. It offers a range of features to create a more engaging and informative command-line interface.
-
unicode-security
Detect possible security problems with Unicode usage according to Unicode Technical Standard #39 rules
-
wchar
Procedural macros for compile time UTF-16 and UTF-32 wide strings
-
huggingface/tokenizers-python
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
-
inkjet
A batteries-included syntax highlighting library for Rust, based on tree-sitter
-
plsfix
Text cleaner upper
-
quixote
Quizzes and tests in Markdown
-
addbib
An app to add linked bibliographies to markdown files
-
minimo
terminal ui library combining alot of things from here and there and making it slightly easier to play with
-
regexml
XPath compatible regex engine
-
nmd
Official NMD CLI and compiler
-
mdbook-mermaid
mdbook preprocessor to add mermaid support
-
serpl
terminal UI for search and replace, ala VS Code
-
pinyin-parser
Parses a string of pinyin syllables. Covers marginal cases such as
ẑ
,ŋ
andê
. -
rsrpp-cli
project for research paper pdf
-
vader-sentimental
A faster Rust version from the original Python VaderSentiment analysis tool
-
fuzzt
Implementations of string similarity metrics. Includes Hamming, Levenshtein, OSA, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Sørensen-Dice.
-
herring-automata
Automata construction for Herring
-
harfbuzz_rs
A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust
-
texted
A markdown blog platform that lets you in control of your data
-
tesseract-rs
Rust bindings for Tesseract OCR with optional built-in compilation
-
deduplicator
find,filter,delete Duplicates
-
colornames
An enum of color names, with a catchall RGB variant
-
swift-check
High-performance, robust, and expressive searching and validation (uses SIMD on x86_64, aarch64, and WASM)
-
latkerlo-jvotci
Tools for creating and decomposing Lojban lujvo
-
bpetok
CLI for tokenizing text input using Byte Pair Encoding (BPE)
-
mut-str
A toolkit for working with mutable string slices (&mut str)
-
files-to-text-cli
Files to text cli
-
mandown
Markdown to groff (man page) converter
-
clipcat
A command line tool for copying the contents to clipboard of multiple files in one go
-
textra
type less, write more
-
mdbook-llms-txt-tools
convert mdbook to llmstxt.org format
-
hlight
dedicated to delivering exceptional syntax highlighting capabilities
-
epcmanager
EPC text tool for RFID
-
hanja_hangul
that converts Chinese characters to Korean characters. That is, convert hanja to hangul
-
ripsecrets
A command-line tool to prevent committing secret keys into your source code
-
cai
The fastest CLI tool for prompting LLMs
-
r2md
Entire codebase to single markdown or pdf file
-
asciimath-unicode
Convert asciimath to unicode
-
tabprinter
creating and printing formatted tables in the terminal. It supports various table styles and offers both color and non-color output options.
-
text2num
Parse and convert numbers written in English, Dutch, Spanish, Portuguese, German, Italian or French into their digit representation
-
termdiff
Write a diff with color codes to a string
-
tasty
A CLI that runs API tests defined and grouped in TOML files
-
neofile
linux file command for windows. PATCH 4.
-
mdbook-angular
mdbook renderer to run angular code samples
-
pray
A tui tool for preparing a prompt to the llms
-
unicode-truncate
Unicode-aware algorithm to pad or truncate
str
in terms of displayed width -
capitalize
Change first character to upper case and the rest to lower case, and other common alternatives
-
mdbook-theme
A preprocessor and a backend to config theme for mdbook, especially creating a pagetoc on the right and setting full color themes from the offical ace editor
-
rapidfuzz
rapid fuzzy string matching library
-
simple_term_rewriter
A basic interface to rewrite concrete tree terms according to a set of rules
-
diacritics
Remove diacritics from letters, for example when standardizing input for a search
-
oxford_join
Join string slices with Oxford Commas!
-
gen-mdbook-summary
generate SUMMARY.md for mdbook project
-
dcsv
Dyanmic csv reader,writer,editor
-
kas-text
Text layout and font management
-
analiticcl
approximate string matching or fuzzy-matching system that can be used to find variants for spelling correction or text normalisation
-
zet
zet finds the union, intersection, set difference, etc of files considered as sets of lines
-
notion2html
Convert Notion pages to HTML
-
frawk
an efficient Awk-like language
-
redpen-linter
Rust linter
-
iepub
epub、mobi电子书读写
-
unicode_titlecase
add Unicode titlecase and Turkish and Azeri locale upper/lowercase utilities to chars and strings
-
textalyzer
Analyze key metrics like number of words, readability, and complexity of any kind of text
-
rustyink
Blazing fast static site generator
-
dnd-character
A Dungeons and Dragons character generator
-
soundex-rs
that calculates soundex value
-
mktoc
Generate Table of Contents from Markdown files
-
lll_toon
convert JPG images to ASCII art with colors
-
mdbook-d2
D2 diagram generator plugin for MdBook
-
svgbob_cli
Transform your ascii diagrams into happy little SVG
-
lindera-cc-cedict-builder
A Chinese morphological dictionary builder for CC-CEDICT
-
ricat
A Rust-Based implemenation of classic UNIX
cat
command -
mdbook-pdf-headless_chrome
Control Chrome programatically
-
pulldown-cmark-toc
Generate a table of contents from a Markdown document
-
tantivy-stemmers
A collection of Tantivy stemmer tokenizers
-
dprint-plugin-markdown
Markdown formatter for dprint
-
furigana
Map furigana to a word given its reading
-
uast
Unicode Aware Saṃskṛta Transliteration in Rust 🦀
-
float-pretty-print
Format f64 for showing to user, not for serialisation
-
string-auto-indent
Normalizes multi-line string indentation while preserving platform-specific line endings
-
mdbook-nice
A mdbook plugin to add nice css to your book
-
hebrew_unicode_script
A low-level library designed to ascertain whether a character belongs to the Hebrew Unicode script. It supports checks for individual characters as well as for membership within collections
-
utf8_iter
Iterator by char over potentially-invalid UTF-8 in &[u8]
-
casile
The command line interface to the CaSILE toolkit, a book publishing workflow employing SILE and other wizardry
-
mago-docblock
Analyzes PHP docblocks to extract annotations, tags, and documentation comments, aiding tools that rely on inline documentation
-
regex-syntax
A regular expression parser
-
b2c2-tokenizer
b2c2のBASICコードのトーカナイザー?
-
ipset_lookup
ipset is a command-line tool that takes networks or IPs and searches through a lot of different threat feeds quickly. It can also download the feed data necessary to perform the queries…
-
line-span
Find line ranges and jump between next and previous lines
-
gst-plugin-textahead
GStreamer Plugin for displaying upcoming text buffers ahead of time
-
pulldown-cmark-mdcat
Render pulldown-cmark events to TTY
-
kathoey
text feminization using open corpus linguistics data
-
substring
method for string types
-
what-rs
Identify what something is! A pyWhat reimplementation in Rust
-
mdbook-alerts
mdBook preprocessor to add GitHub Flavored Markdown's Alerts to your book
-
bogrep
Full-text search for bookmarks from multiple browsers
-
vi
An input method library for vietnamese IME
-
upstream-ontologist
tracking of upstream project metadata
-
indent
Functions for indenting multiline strings
-
colored_text
adding colors and styles to terminal text
-
md-tui
A terminal markdown viewer
-
easy_reader
easily navigating forward, backward or randomly through the lines of huge files
-
tantivy-jieba
that bridges between tantivy and jieba-rs
-
whitespace-sifter
Sift duplicate whitespaces away!
-
rust-ai
A collection of 3rd-party AI APIs for Rust
-
timug
It has been created for personal blog creation purpose. Timus has its limits, but it fulfills the purposes for which it was created.
-
kbremap
Custom keyboard layouts for windows
-
mdbook_fork4ls
Fork of mdBook for mdBook_LS
-
in_definite
Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.
-
simpleml
parser/writer for the Simple Markup Language format, as defined by Stenway. See https://dev.stenway.com/SML/. SML builds on top of WSV but allows hierarchical structuring of data.
-
regex-charclass
Manipulate and convert regex character classes
-
mdbook-pagebreaks
A mdbook preprocessor to insert page breaks when rendering to HTML
-
safe-string
safe interface for interacting with multi-byte strings in Rust, namely IndexedStr, IndexedString, and IndexedSlice
-
shell2batch
Coverts simple basic shell scripts to windows batch scripts
-
zp
Copy the contents of the source file or the standard output buffer to the clipboard, with support for maintaining a history of copied content, allowing users to easily paste into another file or program
-
ib-pinyin
一个高性能拼音匹配库
-
ewts
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (lib)
-
unaccent
remove accents from strings, inspired by PostgreSQL's unaccent extension
-
ascii_help
help you quickly convert ASCII codes
-
cron_clock
A cron expression parser and schedule explorer. Rich documentation and case studies and related upper-level libraries are available.
-
rutie
The tie between Ruby and Rust
-
sgrep
grep util for those lazy to remember many command line options
-
eliza
natural language processing program developed by Joseph Weizenbaum in 1966
-
percent-encoding
Percent encoding and decoding
-
mdbook-aquascope
Interactive Aquascope editor for your mdBook
-
utf64
encode utf-8 strings into utf-64, and decode them back
-
vidyut-lipi
A Sanskrit transliterator
-
overlap-chunk
splitting text into chunks of specified size with adjustable overlap percentage
-
text-editing
string with utilities for editing
-
header-parsing
Simplifies parsing the headers of markdown inspired file formats
-
ragit
git-like rag pipeline
-
pink_accents
Replacement of patterns in string to simulate speech accents
-
lcat
lolcat in rust! Full unicode support, escapes for ANSI escape sequences, hue shift in Cubehelix
-
libabbs
aosc-os-abbs maintenance
-
deindent
A command line utility and Rust library to format overly-indented text
-
kataru
YAML-based dialogue engine for game development
-
sk-skimmer
Fuzzy Finder in rust!
-
multimarkov
generic tool for training and using multi-order Markov chains for procedural generation applications such as generating randomized but authentic-sounding names for characters and locations
-
flat_string
FlatString is fixed allocated size String that that can be created direcly on the stack
-
twars-url2md
A powerful CLI tool that fetches web pages and converts them to clean Markdown format using Monolith for content extraction and htmd for conversion
-
hh-highlight
hh: The text highlighter. Read from stdin, and highlight selected words printed to stdout
-
mdbook-linkcheck2
A backend for
mdbook
which will check your links for you -
pukram2html
converting Pukram-formatted text to HTML
-
armnod
random string generator
-
bwrap
A fast, lightweight, embedded systems-friendly library for wrapping text
-
goofy-animals
Generate a name in adjective-adjective-animal form
-
mdbook-chess
An mdbook preprocessing plugin to generate chess boards
-
avatarsay
Beautiful quotes from Avatar: The Last Airbender
-
rust_file_encode_mode_convert
这是一个rust的库,用于检测文件的编码格式。 支持GBK,GBK2312 , UTF8, UTF16LE, UTF16BE, UTF8+BOM,UTF32 等多种编码格式。
-
uwurs
UwUify your strings with uwurs!
-
rwalk
A blazing fast web directory scanner
-
pandoc
API that wraps calls to the pandoc 2.x executable
-
natural
Pure rust library for natural language processing
-
fortune-rs
classic BSD fortune program
-
seshat-unicode
A Unicode Library for Rust. Unicode 16.0.0 ready. XID_Start and XID_Continue are also available.
-
keep-a-changelog
generating and manipulating CHANGELOG.md files that use the Keep A Changelog format
-
reason-shell
Reason: A Shell for Research Papers
-
portmanteau
create portmanteaux
-
date_time_parser
Rust NLP library for parsing English natural language into dates and times
-
hyperscan
bindings for Rust with Multiple Pattern and Streaming Scan
-
tfon
Bitmap font parsing / conversion
-
minspan
a package for determining the minimum span of one vector within another
-
shvar
POSIX-compliant shell variable substitution routine
-
tenx
An AI-assisted coding tool
-
rsonpath-lib
Blazing fast JSONPath query engine powered by SIMD. Core library of
rsonpath
. -
obmrs
As a participant, you will create a structure to receive and hold the exchange-distributed order book. This structure will be called the OrderBoard, and will hold the order book's bids and asks as a price-sorted map…
-
url_encor
A lightweight library to encode and decode special characters in urls
-
stam-tools
Command-line tools for working with stand-off annotations on text (STAM)
-
seismic
designed for effective and efficient KNN retrieval over learned sparse embeddings
-
aki-xcat
concatenate files that are plain, gzip, xz and zstd
-
mantra-lang-tracing
Contains functionality needed to collect requirement traces from code or plain text files for the
mantra
framework -
flawless-http
HTTP client for https://flawless.dev
-
indent_write
Write adapters to add line indentation
-
rust_string_utils
String utilities for rust based on org.apache.commons.lang3
-
colonnade
format tabular data for display
-
hanconv
Convert between Chinese characters variants
-
see-cat
A cute cat(1)
-
termfactor
is your terminal sick, mid or sus? test your terminal features & find out
-
quranize
Encoding transliterations into Quran forms
-
stylish-style
Internal implementation details of
stylish-core
-
cranberry
Transliteration of Russian Cyrillic into Latin Script
-
filenamify
Convert a string to a valid filename
-
mdmodels
generate models, code and schemas from markdown files
-
treegrep
A pattern matcher frontend or backend which displays results in a tree
-
catalog-of-markdown
Generate the catalog of markdown file
-
frida-build
Rust bindings for Frida
-
string-patterns
Makes it easier to work with common string patterns and regular expressions in Rust, adding convenient regex match and replace methods (pattern_match and pattern_replace) to the standard…
-
commit_crafter
AI powered tool for Git commit message generator
-
iirs
Inverted repeats finder
-
affinidi-messaging-text-client
Affinidi Messaging SDK
-
pprint
Flexible and lightweight pretty printing library for Rust
-
puid
A unique ID generator with 'ch_'-style prefix
-
glu
Unpacks a Markdown document full of code snippets into a temp directory
-
inlet_manifold
A general purpose highlighting library
-
linebreak
breaking a given text into lines within a specified width
-
rumage
framework for making simple markdown sites
-
heatseeker
A fast, robust, and portable fuzzy finder
-
asimov-cli
ASIMOV Command-Line Interface (CLI)
-
mago-fixer
Applies automated fixes and transformations to text
-
fast_symspell
Spelling correction & Fuzzy search
-
rewrite
Safely rewrite file contents from stdin, even when file is open as an input
-
tiny-ver
tiny version parser
-
agentai
designed to simplify the creation of AI agents
-
character_converter
Turn Traditional Chinese script ot Simplified Chinese script and vice-versa and tokenize
-
pulldown-html-ext-cli
CLI tool for extended HTML rendering of Markdown with pulldown-cmark
-
unidown
Convert Markdown to Unicode
-
to_markdown_table
An easy way to format any data structure into a Markdown table
-
bashdoc
generating documentation/help menu for user defined bash functions
-
clima
A minimal Markdown reader in the terminal
-
extract_anchors
Утилита для извлечения из исходных кодов всех помеченных отрывков
-
htmd
A turndown.js inspired HTML to Markdown converter
-
ident_case
applying case rules to Rust identifiers
-
dodo-zh
operate on pinyin and cedict.u8 file
-
words-count
Count the words and characters, with or without whitespaces
-
ncase
Enforce a case style
-
arrow-string
String kernels for arrow arrays
-
typedoc_json_to_md
Converts TypeDoc JSON generated by typedoc-json-parser to Markdown
-
regexnight
Command-line tool to print syntax-highlighted versions of regular expressions and spot errors
-
uniquewords-rs
Count the frequencies of words in text file(s) or stdin
-
uclanr
A random word picker that gives you actually useful words
-
codepack
A *lightning-fast* tool that transforms a directory into a single, LLM-friendly text file, perfect for easy processing and analysis
-
mtc-token-healing
Token healing implementation
-
html-compare
compare html files
-
broken-md-links
A command-line tool and library to detect broken links in Markdown files
-
mdsh
Markdown shell pre-processor
-
str_inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
tectonic_bridge_core
Exposing core backend APIs to the Tectonic C/C++ code
-
lemmeknow
Identify any mysterious text or analyze strings from a file
-
reggy
friendly, resumable regular expressions for text analytics
-
zalgo-codec
Convert an ASCII text string into a single unicode grapheme cluster and back. Provides a macro for embedding Rust source code that has been encoded in this way.
-
semsimian
Sematic similarity calculations for ontologies implemented in Rust
-
mdbook-linkcheck
A backend for
mdbook
which will check your links for you -
mdbook-cmdrun
mdbook preprocessor to run arbitrary commands
-
unicodeit
Converts LaTeX to Unicode (rust port)
-
tree-sitter-stack-graphs
Create stack graphs using tree-sitter parsers
-
cbfr
A buffer that run on stack, focusing on performance and speed
-
aki-gsub
substitude text command, replace via regex
-
charname
Incredibly simple library that just gives you the Unicode name for a character
-
tapir-bf
Tapir is a moderately useful Brainfuck interpreter
-
antex
Styled text and tree in terminal
-
command-error
Detailed error messages and status checking for
std::process::Command
-
uklatn
Ukrainian Cyrillic transliteration to Latin script
-
adana-cache-command
namespaces aliases for command lines & basic scripting language
-
santoka
Translations of 668 of Taneda Santoka's free-verse haiku
-
abbreviator
abbreviating long words
-
zipcodes
Query US zipcodes without SQLite
-
cloc
Count, or compute differences of, lines of source code and comments
-
charx
A replacement for char::is_ascii*
-
abbreviation_extractor
extracting abbreviations from text
-
fontconfig
Safe, higher-level wrapper around the Fontconfig library
-
crowbook-text-processing
some utilities functions for escaping text (HTML/LaTeX) and formatting it according to typographic rules (smart quotes, ellipsis, french typograhic rules)
-
loc
Count lines of code (cloc) fast
-
yy1
Tiny utility to convert KiCad centroid files into Neoden YY1 pick and place machine format
-
crowbook
Render a Markdown book in HTML, PDF or Epub
-
poppler-sys-rs
Low-level (FFI) bindings for poppler-glib
-
arabic_reshaper
Reconstruct Arabic sentences to be used in applications that doesn't support Arabic script
-
semchunk-rs
A fast and lightweight Rust library for splitting text into semantically meaningful chunks
-
scanlex
lexical scanner for parsing text into tokens
-
natural-date-parser
A parser that converts natural language date and time expressions into Rust-compatible DateTime formats
-
nom-grapheme-clusters
Adapter that allows nom to account for unicode grapheme clusters
-
url_parser
URL Parser is a Rust parser developed to parse URLs into structured components such as scheme, domain, path, query and fragment
-
human_regex
A regex library for humans
-
deliminator
Universal code documentation generator
-
gigagei
random quote fetching console utility
-
autour_core
AUTOmata Utilities and Representation (AUTOUR) is a small toolbox to experiment with various kinds of automata and draw them
-
readability
Port of arc90's readability project to rust
-
repvar
A tiny CLI tool that replaces variables of the style
${KEY}
in text with their respective value. It can also be used as a rust library -
textwrap-macros
procedural macros to use textwrap utilities at compile time
-
my_rust_css_parser
CSS parser using the Pest library for Rust
-
runiq
An efficient way to filter duplicate lines from input, à la uniq
-
advent-ocr
Converts ASCII-art representations of letters generated by Advent of Code puzzles into a String containing those letters
-
text-tokenizer
Custom text tokenizer
-
mkwebsite
build websites using markdown
-
product-os-random
Product OS : Random provides a suite of random generator tools for different contexts. This includes random text, number and key generators including crypotgraphically secure random generation.
-
utilprint
easily using colors or unicode in your terminal printing
-
simple-ssg
Plain and simple static site generator for Djot and Markdown light markup languages
-
unicode-canonical-combining-class
Fast lookup of the Canonical Combining Class property
-
linoleum
but ergonomic line editor
-
apt-parser
A parser for the APT package manager's package lists
-
diffy-imara
Tools for finding and manipulating differences between files
-
pandoc_types
Rust port of pandoc-types
-
ipynb-to-md
Convert Jupyter Notebooks to Markdown files
-
mini-openai
An OpenAI API client with minimal dependencies
-
cesu8
Convert to and from CESU-8 encoding (similar to UTF-8)
-
clipboard-substitutor
CLI tool to monitor clipboard changes and perform operations based on the contents
-
asoiaf-api
Rust wrapper for the api of ice and fire
-
prompt-input
lightweight library for user input prompts in Rust, designed to make input handling straightforward
-
paltoquet
rule-based general-purpose tokenizers
-
substr-iterator
Substring extractor based on characters without allocation
-
fast_whitespace_collapse
Collapse consecutive spaces and tabs into a single space using SIMD
-
tower_allowed_hosts
tower allowed hosts layer
-
mdbook-codeblocks
A mdbook preprocessor to prepend customizable vignette to code blocks
-
slugify
Macro for flexible slug generation
-
clarifai_grpc
The official Clarifai gRPC Rust client
-
gosyn
Golang syntax parser in Rust
-
lexical-sort
Sort Unicode strings lexically
-
forbidden-bands
8-bit string handling library
-
mnemnk-application
Mnemnk agent for monitoring application usage
-
syllabize-es
Syllabize Spanish text, and much more
-
cssparser-color
Color implementation based on cssparser
-
puppet-fmt
Automatic code formatter for puppet manifests
-
vesti
A preprocessor that compiles into LaTeX
-
crlify
A std::io::Write wrapper that replaces with on Windows
-
substring-replace
developer-friendly methods to manipulate strings with character indices
-
spayd
Text handling for Short Payment Descriptors (SPAYD or SPD)
-
chord3
Create pdf songbooks from chopro source
-
kashida
Insert Kashidas/Tatweel into Arabic text, e.g. for justification purposes.
-
rust-tfidf
calculate TF-IDF (Term Frequency - Inverse Document Frequency) for generic documents
-
rustdoc-md
Convert Rust documentation JSON into clean, organized Markdown files
-
isbn
handling ISBNs
-
mini_markdown
Small markdown renderer
-
pragmatic-segmenter
Rust port of pySBD v3.1.0
-
cqtool
converting between CQ strings and message segment arrays
-
svgc
svgc
(previously SvgCompress) is a tool for compressing SVG files by removing unnecessary whitespace, comments, metadata and some other data. It also supports optimization with SVGO and compression into SVGZ. -
allsorts-subset-browser
Temp fork of allsorts 0.15 - includes patch for subsetting fonts for browsers
-
figlet-comment
quickly create banner to use as comments
-
mdopen
Preview markdown files in a browser
-
array_tool
Helper methods for processing collections
-
iconv-native
A lightweight text encoding converter based on platform native API or libiconv
-
rust-express
An ExpressJS inspired rust crate that handles http request in a similar way to ExpressJS. Created entirely out of bordom, you can pull and expand it as you like!
-
diary-cli
A powerful cli for documenting and keeping a diary
-
vibrato
viterbi-based accelerated tokenizer
-
codespan_preprocessed
Beautiful diagnostic reporting for M4 (or cpp) preprocessed text files
-
mdbook-private
An mdbook preprocessor that controls visibility of private chapters and sections within them
-
ethan-rs-wc
The ethan-rs-ws(erwc) is word, line, character, and byte count. Like wc command but not just wc command, more accurate and faster. Text can also be read from standard input for statistics.
-
metatron
core library
-
open-protocol
Atlas Copco/Torque Open Protocol specification in Rust
-
merge3
merge tool for three-way merges
-
reedy
A terminal-based RSS reader with a clean TUI interface
-
minigrep_jeck
minigrep
is a grep clone that takes a query and searches for the query in the file; with added support for regex -
fmtm
A diff-friendly Markdown formatter that breaks lines on sensible punctuations and words to fit a line width
-
markov
A generic markov chain implementation in Rust
-
jpreprocess
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
bump-bin
Increments version with semver specification
-
html-auto-p
function like
wpautop
in Wordpress. It uses a group of regex replaces used to identify text formatted with newlines and replace double line-breaks with HTML paragraph tags. -
rsrusl
A really simple useful library ported to Rust
-
aki-xtee
copy standard input to each files and standard output
-
pineappl_applgrid
PineAPPL's interface to APPLgrid
-
lingua-english-language-model
The English language model for Lingua, an accurate natural language detection library
-
orly
Download O'Reilly books as EPUB
-
sigrs
Interactive grep (for streaming)
-
enma
serving anime and manga information 📦
-
smol-symbol
compile-time globally unique, 25-character, 128-bit symbols similar to the Symbol type in Crystal Lang
-
matcher_c
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
rs-tool
A command-line tool to perform reservoir sampling on a file or a stream
-
turn-uppercase
Small command to uppercase text in command line and copy to clipboard
-
eternaltwin_constants
Hammerfest store implementation
-
word-tally
Output a tally of the number of times unique words appear in source input
-
lowcharts
draw low-resolution graphs in terminal
-
linkcheck2
extracting and validating links
-
epub2mdbook
convert EPUB files to MDBook format
-
naming_utils
generating naming conventions, pluralizing words, and rest api paths in Rust
-
byteutils
that provides a collection of frequently used utility functions for working with bytes, strings, and vectors. It includes common tasks such as converting between strings and byte arrays…
-
adobe-cmap-parser
parse Adobe CMap files
-
ean-rs
generating and validating EAN barcodes
-
wildcard_ex
extended wildcards that allows VB-like specifications
-
correct_word
A No brainer 'did you mean' library for Rust
-
simstring_rust
A native Rust implementation of the SimString algorithm
-
opentalk-types-signaling-meeting-notes
Signaling types the OpenTalk meeting-notes module
-
rust_readability
A package to assess the complexity of texts using a variety of readability formulas
-
uo_rst_parser
fork of rst_parser with fixes for upstream-ontologist
-
mdbook-callouts
mdBook preprocessor to add Obsidian Flavored Markdown's Callouts to your book
-
streampager
pager for command output or large files
-
unfuck
Python 2.7 bytecode deobfuscator
-
context-notation
Featherweight semantic notation for text
-
fuzzy-datetime
Detects, completes and normalises fuzzy date and date-time strings for interoperability with chrono or direct output as ISO-8601-compatible strings
-
tokenizers-enfer
today's most used tokenizers, with a focus on performances and versatility
-
mdbook-open-on-gh
mdbook preprocessor to add a open-on-github link on every page
-
svgbob
Transform your ascii diagrams into happy little SVG
-
named_entity_parsing
Named entity parser. Used in Rusev to parse a list of tokens into a list of entities.
-
utf16_lit
macro_rules to make utf-16 literals
-
xml_magic
A reasonably fast XML formatter
-
hangeul
Korean alphabet manipulation library
-
solr_post
command line tool and library to post files in a directory to a Solr server
-
fastn-resolved
fastn: Full-stack Web Development Made Easy
-
unicount
Alphabetic counter supporting unicode
-
hi-doc
Diagnostics formatting library
-
egg-mode-text
Text parsing for Twitter: character counting, hashtag/mention extraction
-
vidyut-kosha
A Sanskrit key-value store
-
sourceannot
render snippets of source code with annotations
-
vectorscan-rs
Ergonomic bindings to the Vectorscan high-performance regex library
-
wikidot-normalize
provide Wikidot-compatible string normalization
-
pulldown-cmark-escape
An escape library for HTML created in the pulldown-cmark project
-
sysit
System Sit! Check on the system with a quick glance
-
findtext_doc
Search text in Document
-
iregex
Intermediate representation for Regular Expressions
-
bt-echo
implemenation of the echo command-line utility
-
inflector-plus
Adds String based inflections for Rust. Snake, kebab, camel, word, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
cgp
Context-generic programming meta crate
-
markdown_converter
html to markdown converter and flavored markdown to discord markdown converter
-
fr_alebref_libbrefdata
BrefData library
-
ironsmith-parser
Transforms Smithy 2.0 IDL files into an abstract syntax tree
-
samvadsetu
LLM API for commonly used LLM services including Gemini, ChatGPT, and Ollama. The name implies a bridge for dialogue since the library facilitates communication and interaction between…
-
zh_num
Convert ASCII numbers and zh words
-
csml_interpreter
The CSML Interpreter is the official interpreter for the CSML programming language, a DSL designed to make it extremely easy to create rich and powerful chatbots
-
twitter_text_config
Configuration for twitter-text in Rust
-
qwit
A cli to work with dsv files
-
csv_to_table
pretty print CSV as a table
-
toolstr
formatting data into String's
-
enum-ts
TypeScript Enum pattern matcher codegen
-
loco-cli
loco cli website generator
-
ferret
A trigram-based tool for detecting similarity in groups of text documents or program code
-
case
A set of letter case string helpers
-
codetypo-vars
Source Code Spelling Correction
-
writings
The Bahá’í Sacred Writings for use in Rust projects and APIs
-
casespaetzle
This package introduces common identifier case traits and a macro for case trait generation
-
heart-strings
Quickly get random heart emojis to copy!
-
wikidump
parsing Mediawiki XML dumps
-
typo_checker
Searches for typos of English words and returns matches or similar words.(英単語のタイポを検索し、合致もしくは類似する単語を返却します)
-
tracery
Text-expansion library
-
grammalecte_client
Grammalecte HTTP client
-
surt-rs
Sort-friendly URI Reordering Transform (SURT)
-
string-alloc
Allocator-aware no-std-compatible String implementation
-
fluxcap
time expression parsing inspired on duckling
-
detone
Decompose Vietnamese tone marks
-
tgrep
Toy grep that honors .gitignore
-
wikipedia_prosesize
Count Wikipedia prose size
-
tfidf-text-summarizer
extractive text summarization system which uses TF-IDF scores of words present in the text to rank sentences and generate a summary
-
ragit-korean
korean tokenizer for ragit
-
gregex
Regex solver utilizing NFA
-
eddie
Fast and well-tested implementations of edit distance/string similarity metrics: Levenshtein, Damerau-Levenshtein, Hamming, Jaro, and Jaro-Winkler
-
srx
A mostly compliant Rust implementation of the Segmentation Rules eXchange (SRX) 2.0 standard for text segmentation
-
typope
Pedantic source code checker for orthotypography mistakes and other typographical errors
-
tergo-formatter
Formatter for tergo
-
bfom-lib
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
re_view_text_document
view that shows a single text box
-
capysay
A Rust-based CLI tool for customizable Capybara ASCII art with colorful messages
-
lingua-french-language-model
The French language model for Lingua, an accurate natural language detection library
-
djotters
Djot (Markdown) parser that runs at hyper speeds!
-
simdutf
Unicode validation and transcoding at billions of characters per second
-
unicode-intervals
Search for Unicode code points intervals by including/excluding categories, ranges, and custom characters sets
-
asimov-sdk
ASIMOV Software Development Kit (SDK) for Rust
-
like
A SQL like style pattern matching
-
treebender
An HDPSG inspired symbolic NLP library for Rust
-
nibi
static site generator. (WIP, implementing)
-
rust-regex-dsl-creator
Regular expression DSL derive macros
-
markov_str
Markov Chain implementation optimized for text generation
-
id32
32-character RFC 4648 lowercase Base32 IDs
-
man
Generate structured man pages
-
bubble-bath
Small and quick HTML sanitizer
-
crates_tut
following the rust book tutorial
-
ADA_Standards
help you handle checks on your ADA projects, especially good to build scripts to check coding standards conformity
-
mdtrans
Markdown parser and transformer using
pest.rs
, focused on flexibility to a project’s needs -
mdbook-dtmo
Creates a book from markdown files with added plugins
-
aki-mcycle
mark up text with cycling color
-
lucide-yew
Yew port of Lucide
-
uwl
A management stream for bytes and characters
-
linurgy
Manipulate the output of multiple newlines. Replace/Insert/Append newlines with text. Input and output from stdio/files/buffers
-
mdxt
Custom Markdown Engine for my personal blog
-
text_lines
Information about lines of text in a string
-
lindera-ipadic-builder
A Japanese morphological dictionary builder for IPADIC
-
rasciigraph
function to plot ascii graphs
-
thesaurus
An offline thesaurus library for Rust
-
digman
Ticket management systems for solo players
-
pdfgen
PDF rendering library
-
whichlicense_detection
detect licenses used by the WhichLicense project
-
b2c2-casl2
b2c2におけるCASL2の雑なパーサー?
-
block-list
A minimalist hosts-based tool for managing block lists and ad-blocking
-
libanubhav
management system written in Rust
-
trpl
A support crate for The Rust Programming Language book
-
yozuk
Chatbot for Programmers
-
rtss
A command-line tool to annotate stdout/stderr with elapsed times
-
choco
markup language for dialogue systems
-
dmos-cli
Djot HTML renderer with advanced features - CLI
-
alphabet_detector
Natural language alphabet detection library
-
analyse-json
CLI tool for inspecting (Newline Delimited) NDJSON or JSON to understand the contents
-
utf58
High-tech encoding of the Unicode space in one quibble and up to 3 bytes
-
mdbook-pikchr
A mdbook preprocessor to render pikchr code blocks as images in your book
-
mdbook-typstpdf
An mdBook backend that generates PDF output using Typst
-
markdown-extract
Extract sections of a markdown file
-
char-ranges
Iterate chars and their start and end byte positions
-
dicexp
A Dice Expression Interpreter program and library for parsing (and rolling) role-playing game style dice notations (e.g. "2d8+5")
-
gh-spray
generate a lot of GitHub activity
-
kbnf-regex-automata
A forked version of regex-automata for kbnf
-
unicode-matching
match Unicode open/close brackets
-
screen_printer
displaying rectangular blocks of text to a terminal
-
gh_page_tool
A github gh-pages tool for static blog site
-
cglue-bindgen
cleanup cbindgen headers for CGlue
-
xee-interpreter
Interpreter for XPath and XSLT
-
snailquote
Escape and unescape strings with shell-inspired quoting
-
sedregex
Sed-like regex library
-
protoviz
A fast and intuitive way of creating figures for protocol documentation
-
mdi
markdown include
-
ragegun
Performs lexica based analysis on text (i.e. age, gender, PERMA, OCEAN personality traits, ..)
-
wdl-doc
Documentation generator for Workflow Description Language (WDL) documents
-
litua
Read a text document, receive its tree in Lua and manipulate it before representing it as string
-
rustichar
cli tool made to check the occurrences of characters in a string
-
yara-x-parser
A parsing library for YARA rules
-
wimbd
A CLI for inspecting and analyzing large text datasets
-
texcore
Create LaTeX documents using native Rust types
-
unified-diff
GNU unified diff format
-
color-your-life
color data representations with a Display-like interface
-
csmlinterpreter
The CSML (Conversational Standard Meta Language) is a Domain-Specific Language developed for creating conversational experiences easily
-
hms-test_utils
Testing utils for Hold my Snip!
-
spongebob
convert text to spongebob case a.k.a tHe MoCkInG sPoNgEbOb MeMe
-
zoitei
alphabet conversions
-
rust-texas
generate latex documents
-
sre-engine
A low-level implementation of Python's SRE regex engine
-
font-map
Macros and utilities for parsing font files
-
datatroll
a robust and user-friendly Rust library for efficiently loading, manipulating, and exporting data stored in CSV files
-
unicode-range
UnicodeRange is a Rust library for parsing and stringifying Unicode ranges. It provides functionality to convert a string representation of Unicode ranges into a vector of code points and vice versa.
-
diff-man
diff utility lib
-
aho-corasick
Fast multiple substring searching
-
unicode_extension
Don't use this crate
-
mdbook-hints
mdBook preprocessor to add hover hints to your book
-
sbert
Sentence Bert (SBert)
-
tuisky
TUI client for Bluesky
-
mdbook-keeper
An improved testing experience for mdbook
-
tagsearch
Filter plaintext files based on @keyword tags
-
bin-rs
Binary Reader from multi source
-
anslatortray
translate from English to Pig Latin!
-
fiberplane-markdown
convert Fiberplane Notebooks to and from Markdown
-
annoying
others and yourself
-
auto-regex
Automagically finds a regex that best matches an example and a sample list
-
clafrica
This application allows you to type most of the characters in the african alphabet in any text field
-
colorize-rs
terminal text colorisation using ansi characters
-
unicode-ellipsis
truncate Unicode strings to a certain width, automatically adding an ellipsis if the string is too long
-
mdbook-plantuml
A preprocessor for mdbook which will convert plantuml code blocks into inline SVG diagrams
-
rustsay
CLI tool in Rust that mimics the classic cowsay program, allowing a cow to speak your text in the terminal
-
matcher_py
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
lookbook
Component preview framework for Dioxus
-
acorns
Generate an AsciiDoc release notes document from tracking tickets
-
aki-mline
match line, regex text filter like a grep of linux command
-
hashmoji_generator
Code generation tool for hashmoji
-
tcalc-rustyline
A fork of Rustyline for use specifically with tcalc
-
pdfcr
render a codebase to a pdf
-
babel
Map Rust enums to different LLM providers' model strings
-
viterbi_pos_tagger
A part-of-speech (POS) tagger using the Viterbi algorithm
-
textpod
Local, web-based notetaking app inspired by 'One Big Text File' idea
-
java_string
Java strings, tolerant of invalid UTF-16 encoding
-
runi
a CLI tool to generate unicode fonts
-
file-size
a function formatting file sizes in 4 chars
-
sanitizer
A collection of methods and macros to sanitize struct fields
-
harfbuzz-traits
Rust Traits for the HarfBuzz text shaping engine
-
recase
Changes the convention case of input text
-
freesia
some string operators
-
mdbook-pagetoc
A mdbook plugin that provides a table of contents for each page
-
roxy_cli
A command-line static site generator
-
typeline_ext_utils
operators for typeline
-
plagiarismbasic_lib
Basic plagiarism checker written in Rust
-
tre-regex
Rust safe bindings to the TRE regex module
-
vndb_tags_get
convert VNDB tag list (JSON to markdown)
-
leptos-markdown
A component which can render markdown as html element in leptos
-
manchu-converter
Converts transcripted Manchu text to Manchu script with Manchu alphabet
-
pediferrous
PDF rendering library
-
opentalk-types-signaling-whiteboard
Signaling types the OpenTalk whiteboard module
-
ssbm_utils
A collection of useful tools for working with data from Super Smash Bros Melee
-
srake
RAKE Algoritm
-
sayit
String replacements using regex
-
repub-rs
binary for converting mhtml webpages into remarkable-style summarized epubs
-
hr-id
A human-readable unique ID which supports Unicode, safe for URLs and file paths
-
console-utilities
handling console input and output utilities
-
chromalog
A customizable logger with dynamic color coding and file logging
-
poriborton
Interconversion between Unicode and various Bengali ANSI encodings
-
nlprule
A fast, low-resource Natural Language Processing and Error Correction library
-
palmdoc-compression
Fast & safe implementation of PalmDoc/MOBI/AZW/Kindle flavored LZ77
-
mdbook_rash
Binary to create doc from
rash
code -
crate-starter
starter
-
chunk_norris
splitting large text into smaller batches for LLM input
-
mecab
Safe Rust wrapper for mecab a japanese language part-of-speech and morphological analyzer library
-
class-rs
Reads/writes a .class file into/from an almost 1-to-1 matching struct
-
to-binary
Converts From Hex, UTF-8, Vector of Bytes, or a Byte Slice to a Binary String And Implements Many Useful Functions For Binary Strings
-
enc-check
inspect utf-8 and utf-16 character encodings
-
mdbook-external-links
Open external links inside your mdBooks in a different tab
-
tectonic_xetex_format
Tectonic/XeTeX engine data structures and their expression in TeX "format" files
-
common-words-all
Most common words sorted by ngram frequency
-
fish-printf
printf implementation, based on musl
-
ruby_inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
workstyle
Workspaces with style! This program will dynamically rename your workspaces to indicate which programs are running in each workspace. It uses the i3 ipc protocol, which makes it compatible with sway and i3…
-
lindera-filter
Character and token filters for Lindera
-
quickner-core
A fast and simple NER tool
-
etch
Not just a text formatter, don't mark it down, etch it
-
bk-tree
A Rust BK-tree implementation
-
dbxcase
Dropbox-compatible case-folding algorithm
-
fastn-builtins
fastn: Full-stack Web Development Made Easy
-
env2toml
Convert env vars to toml text
-
nu_plugin_emoji
a nushell plugin called emoji
-
bstr
A string type that is not required to be valid UTF-8
-
chardet
rust version of chardet
-
character-set
High performance
set.contains(char)
-
squidge
shortens delimited data
-
roman_numerals_fn
A function to convert integers to their roman numeral representation as strings. Values from 1 to 3999 are possible, otherwise it returns an OutOfRangeError. Zero has no representation in roman numerals.
-
hns
Human numeric sorting program — does what
sort -h
is supposed to do! -
unflow
DSL to convert design to code
-
unescape
Unescapes strings with escape sequences written out as literal characters
-
chewing_capi
The Chewing (酷音) intelligent Zhuyin input method
-
dingtalk
Robot Util, Send text/markdown/link messages using DingTalk robot, 钉钉机器人
-
bebop-lang
LISP based Markdown preprocessor for creating html documents!
-
display_bytes
Human-readable display of byte sequences
-
rk-utils
A collection of utility functions and data structures for rust
-
html_to_epub
A command line converts .html file to .epub file
-
easy_io
Fast and dead-simple IO for competitive programming in Rust
-
markdown-it-footnotes
Creates footnotes and lists of footnotes in Markdown documents
-
token-parser
parsing texts into data structures
-
fsays
flavored replacement for the classic cowsay
-
ob
A Blog and RSS system written in Rust
-
ansi-width
Calculate the width of a string when printed to the terminal
-
r4d
Text oriented macro processor
-
twas
A text substitution application for using random look-up tables to generate text in a manner similar to the Mad Libs game
-
merge-whitespace
Procedural macros for merging whitespace in const contexts
-
skyspell_kak
skyspell - kakoune integration
-
vestalia
Vestaboard API wrapper
-
sesdiff
Generates a shortest edit script (Myers' diff algorithm) to indicate how to get from the strings in column A to the strings in column B. Also provides the edit distance (levenshtein).
-
butterkups-minigrep
Mini grep utility; very weak application, use grep instead
-
spezilinter
spezifisch's linter for different file formats, linting for weirdly specific stuff
-
ik-rs
chinese segment, ik-analyzer for rust
-
pandoc-ac
pandoc filter for converting acronym codes to LaTeX
-
arabic_text_utils
Arabic text processing and manipulation
-
aki-stats
output the statistics of text, like a wc of linux command
-
tantivy-tokenizer-api
Tokenizer API of tantivy
-
bcdown
Bilibili漫画下载器,written in Rust,支持epub pdf zip格式
-
fast-str
A flexible, easy-to-use, immutable, efficient
String
replacement for Rust -
erebus
A CLI message generation library
-
utf8-io
Traits and types for UTF-8 I/O
-
document_tree
reStructuredText’s DocumentTree representation
-
simple-markdown-parser
handling markdown. May include extra utilities
-
tangler
Extracts code blocks from Markdown documents
-
rust_iso15924
ISO 15924, Codes for the representation of names of scripts, is an international standard defining codes for writing systems or scripts (a "set of graphic characters used for the written form of one or more languages")…
-
split-every
Split for every n occurrences of a pattern iteratively!
-
sqdj
sqdj shortens delimited data
-
botanical-latin
Decliner / conjugator / inflector for classical / botanical Latin
-
harfbuzz
Rust bindings to the HarfBuzz text shaping engine
-
caido-convert
Encoding library for web security applications
-
markdown2pdf
Create PDF with Markdown files (a md to pdf transpiler)
-
basalt-core
core functionality for Basalt TUI application
-
mdzk
Plain text Zettelkasten based on mdBook
-
symspell
Spelling correction & Fuzzy search
-
slicestring
slicing Strings
-
gspell
Rust bindings for gspell
-
rustysozluk
Eksi Sozluk API wrapper
-
ucd
Extends the char type to provide access to most fields of the UCD, Unicode Character Database, as of version 9.0.0. It aims to be compact, fast, and use minimal dependencies (only rust's core crate)…
-
transportations_library
Knowledge stack for transportation
-
caseless
Unicode caseless matching
-
flatpage
file system based markdown flat page
-
xi-rope
A generic rope data structure built on top of B-Trees
-
epistle
Writing letters to both human and machine
-
string-simple
containing some simple string utilities that I use in my other projects
-
readable-regex
Regex made for humans. Wrapper to build regexes in a verbose style.
-
image-to-ascii
Converts images and gifs to ascii art
-
jawk
JSON AWK
-
buf-min
Minimal utf-8 safe buffer traits
-
ascii-img-cli
Command-line tool for using ascii-img
-
jfmt
command-line tool for formatting json files in both readable and compact formats. It supports stdin/stdout shell usage, as well as working on files directly.
-
rust-toolchain
Types to model a Rust toolchain
-
file-expert
Expert system for recognizing source code files, similar to GitHub/lingust
-
mudder
Generate lexicographically-spaced strings between two strings from pre-defined alphabets
-
translitrs
Transliteration utility for Serbian language
-
gcg-parser
Parser and Data Structures for the GCG file format
-
cosmic-text
Pure Rust multi-line text handling
-
rep-grep
wgrep/write-grep CLI
-
tengwar
Transliterate text into J.R.R. Tolkien's Tengwar.
-
cryptose
effortlessly encrypting and decrypting messages
-
less
pager utility for displaying file contents or piped input, with dynamic scrolling and search functionality
-
uulm-sopra23-group03-core
Core library for implementing Battle of the Centerländ
-
regex_generate
Use regular expressions to generate text
-
cli_app_capo
CLI application with Unix-like tools
-
ewts-cli
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (cli)
-
io-close
An extension trait for safely dropping I/O writers such as File and BufWriter
-
pdfrust
PDF parser
-
onig_sys
onig_sys
crate contains raw rust bindings to the oniguruma library. This crate exposes a set of unsafe functions which can then be used by other crates to create safe wrappers around Oniguruma… -
clippy-to-md
cli tool to convert clippy json reports to markdown files
-
address_book
Інструмент командного рядка для парсингу телефонних номерів, ідентифікаторів, дат та неправильних…
-
re_view_text_log
A view that shows text entries in a table and scrolls with the active time
-
bureau
extracting dependency licenses, used by nertsio
-
convert_string
A trait to convert Strings to safe non-keywords and/or convert a Strings case (snake_case, PascalCase, ...)
-
unic-ucd-age
UNIC — Unicode Character Database — Age
-
pdf_composer
Create PDF documents from YAML Front Matter source documents
-
segtok
Sentence segmentation and word tokenization tools
-
stam-python
STAM is a library for dealing with standoff annotations on text, this is the python binding
-
vtext
NLP with Rust
-
pinot
Fast, high-fidelity OpenType parser
-
encoding-next
Character encoding support for Rust
-
unicode-display-width
Unicode 15.1.0 compliant utility for determining the number of columns required to display an arbitrary string
-
regex-split
split_inclusive for the regex crate
-
escape-bytes
Escapes bytes that are not printable ASCII characters
-
rust-fuzzy-search
Fuzzy Search with trigrams implemented in Rust
-
ttaw
talking to a wall, a piecemeal natural language processing library
-
pullup
Convert between markup formats
-
iregex-syntax
Common syntax for regular expressions
-
wool
Preview Github Markdown Offline
-
gem_version
Ruby's Gem::Version comparison logic in Rust
-
ipa-translate
translating between IPA and ASCII text
-
verba
working with Latin words
-
reg_match
A match style regex tool
-
libfmt
A text formating package for rust
-
censor
text profanity filter
-
str-utils
some traits to extend types which implement
AsRef<[u8]>
orAsRef<str>
-
betacode
conversion
-
qsv_vader_sentiment_analysis
Bindings for Rust from the original Python VaderSentiment analysis tool. Forked for use with qsv.
-
letter-sequence
A method to create sequence displayed as uppercase or lower letters, or digits
-
code-splitter
Split code into semantic chunks using tree-sitter
-
unveil-rs
Unveil Rs is a tool to create presentations from markdown files
-
jira-clean
clean up Jira task description that is an output of jira-cli tool
-
advanced_string_generator
A command-line tool for generating strings based on customizable regex patterns
-
enpsrlib
English Phrase Structure Rules library
-
mdbook-tocjs
A mdbook preprocessor which adds extra js and css file for ToC hydration
-
tree-sitter-pkl
A Tree-sitter grammar for Pkl
-
chars_data
Build-dependency for chars, the unicode character information CLI
-
mdast2minimad
converting markdown AST to minimad texts
-
blackboxmc_java
BlackboxMC bindings for java.util
-
textparse
declaratively implement parsers that are based on Packrat Parsing
-
loe
Very fast and yet another line ending (CRLF <-> LF) converter written in Rust
-
minigrep_baolhq
Just getting started with Rust, enjoying it so far 😇
-
slow-scan-print
Text is printed at fixed intervals by character or by line, and its name is inspired by SSTV (Slow Scan TV)
-
recvmsg
Traits for receiving datagrams reliably, without truncation
-
hyphenation_commons
Proemial code for the
hyphenation
library -
geoipsed
Inline decoration of IPv4 and IPv6 address geolocations
-
sqlite-fastrand
A SQLite extension for generating numbers and blobs very quickly
-
mdbook-embedify
based mdbook preprocessor plugin that allows you to embed apps to your book, like youtube, codepen and some other apps
-
untanglr
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies
-
sejong
Buffer is a buffer that can receive ASCII bytes different from keyboard and send out UTF-32 Hangul string. This buffer allows deletion by Jamo.
-
afrim-translator
Manage the predication system of the afrim input method
-
spf
.spf (Simple Pixel Font) file parsing, and useful api's to go alongside
-
mask-text
mask text with multiple masking options
-
message_segment_calculator
package to calculate SMS message segments
-
punycode
Functions to decode and encode Punycode
-
imperative
Check for imperative mood in text
-
promptkit_rs
LLM structured prompting library
-
oxcomm
using Google Translate on the fly
-
search-in-terminal
A terminal-based search tool
-
floem-cosmic-text
Unofficial cosmic-text fork for Floem
-
bilingual
A cmdline tool used for markdown translation via calling Chinese translation api cloud services
-
boreal-cli
CLI utility to run boreal, a YARA rules engine
-
zspell-cli
Command line interface for the ZSpell spellchecking library
-
ragtime
Easy Retrieval Augmented Generation
-
pulldown-html-ext
Extended HTML rendering capabilities for pulldown-cmark
-
computergeneration
compgen but all wrong
-
emoji
Every emoji, their metadata, and localized annotations
-
slicedisplay
Simplistic Display implementation for Vecs and slices
-
rmbs
Remove any fluff, corporate speak, or other bullshit from input text and print the TL;DR essence of what's being said, using the www.bullshitremover.com public LLM API
-
owned_chars
Owned iterators with the same output as Chars and CharIndices
-
detect-newline-style
Determine a string's preferred newline character
-
mdbook-curly-quotes
mdBook preprocessor that replaces straight quotes with curlyquotes, except within code blocks or code spans
-
udp-logger-rs
Log macro for log's kv-unstable backend and a UDP socket logger
-
dekor
styling and character repository in Rust
-
ragzilla
providing tools for RAG (Retrieval-Augmented Generation) pipelines
-
indented_text_writer
IndentedTextWriter
-
easy_random
Generate random data easily with easy_random :)
-
regex-chunker
Iterate over the data in a
Read
type in a regular-expression-delimited way -
lodestone
A website wrapper for FFXIV's lodestone
-
yara-x
A pure Rust implementation of YARA
-
mdbook-toc
mdbook preprocessor to add Table of Contents
-
jposta
A fast and intuitive Terminal User Interface (TUI) tool for searching Japanese postal codes and addresses
-
markdown-gen
generating Markdown files
-
tzgrep
grep tar.gz
-
verify_string
verifying strings
-
text-utils
Text utils for unescaping and align
-
cow-rewrite
Rewrite copy-on-write types copying only when it's neccessary
-
hexstring
handling hexadecimal string
-
simple_peg
A command-line peg parser implemented by Rust
-
ultra-nlp
A NLP library
-
ftrace
trace files and paths
-
codegenrs
Moving code-gen our of build.rs
-
minify-html-common
Common code and data for minify-html*
-
align
aligning text
-
apple-notes-exporter
CLI tool for exporting Apple Notes to Markdown
-
translit
Transliteration for cyrillic languages that uses officially approved rules
-
akiaki
A good old fashioned wiki engine with a flat-file database
-
path2regex
Express style path to RegExp utility
-
google_taxonomy
Struct with all Google Product Categories / Taxonomy
-
incredimo
just another font for your terminal
-
shopping-parser
A Rust-based parser for parsing structured product information and shopping lists, supporting multiple currencies and units
-
slugify-rs
generate slugs from strings
-
findtext_sheet
Search text in SpreadSheet
-
llmvm-codeassist
A LLM-powered code assistant that automatically retrieves context (i.e. type definitions) from a Language Server Protocol server.
-
cindex
CSV indexing library
-
stylish-stringlike
API for string-like objects that have styles applied
-
faster-chars-count
counting length of chars faster than Chars::count()
-
bottomify
Fantastic (maybe) CLI for translating between bottom and human-readable text
-
marked
Parsing, filtering, selecting and serializing HTML/XML markup
-
md-inc
Include files in Markdown docs
-
fmty
Composable
core::fmt
utilities -
lindera-tantivy
Lindera Tokenizer for Tantivy
-
mle
The markup link extractor (mle) extracts links from markup files (Markdown and HTML)
-
streamparser
Adapt simple stream parsers for different types of streams
-
text_unit
Newtypes for text offsets
-
sms_splitter
An SMS message splitter and part calculator with support for GSM and Unicode
-
rusk
a Specification Language
-
prototty_render
Traits for defining a renderer and a view
-
hydroper_source_text
Source text containing line locations
-
wtf8-rs
WTF-8 encoding
-
find_unicode
Find Unicode characters, the easy way!
-
langsan
sanitizing language model input and output
-
unicode_names2_generator
Generates the perfect-hash function used by
unicode_names2
-
angr
analyse ngrams in text files
-
lib-ruby-parser-ast
AST for lib-ruby-parser
-
ut1_blocklist
UT1 blocklist URL/domain filters
-
glyph-names
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
kvu
The simplest command line tool to manage key-value pair lines
-
ruSTLa
A reStructuredText → LarST ⊂ LaTeX transpiler
-
codetypo-cli
Source Code Spelling Correction
-
pelp
A (p)resentation h(elp)er. Makes it easy to create a revealjs presentation from a Markdown file.
-
static-key
static keys in Rust
-
mdbook-scientific
Enables inline equations for mdbook to set by $..$ signs and $$..$$
-
nu_plugin_regex
nu plugin to search text with regex
-
kspconfigtool
KSP1 ConfigNode parser and block removal tool
-
loro-internal
Loro internal library. Do not use it directly as it's not stable.
-
fancy-regex-fork-pb
A custom fork of the fancy-regex crate. You probably don't want to use this.
-
pi_ucd
unicode字符函数,获得字符的语言区间段;及根据文字排版的需要,判断字符是否为单字字符或字母字符
-
ru-html-extractor
A universal web page main content extractor based on line block density distribution
-
divvunspell-bin
Spellchecker for ZHFST/BHFST spellers, with case handling and tokenization support
-
mutf8
Basic support for mutf8 strings
-
mantra-miner
your software recite mantras while it runs
-
pillar
small tool to format lines into columns
-
testcall
companinon crate to bintest, implements test facilities
-
xml1
sane, non compliant xml parser without allocations
-
story-dl
Story web scraping
-
font-awesome
A typed manifest of constants of Font Awesome unicode code points
-
excavate
A command line tool to excavate fields from multi-column texts
-
quake_text
Utils for Quake strings and characters
-
rsmorphy
Morphological analyzer / inflection engine for Russian and Ukrainian (soon) languages (WIP)
-
mapm
A set of tools, with command line and graphical interfaces, used to build exams typeset in LaTeX
-
csvsc
Build processing chains for CSV files
-
word_search_solver
Solves a word search
-
tet_rs
A third-party implementation of Text Entry Throughput (ref. https://doi.org/10.1145/3290605.3300866) for Rust
-
deck
A command line tool to generate HTML presentations from Markdown documents
-
spacebar
An anti-plagiarism tool based on null width characters
-
synox
Program synthesis of string transformations from input-output examples
-
mdbook-quiz-schema
Schema for quizzes used in mdbook-quiz
-
soft-ascii-string
char/str/string wrappers which add a "is-ascii" soft constraint
-
text-colorizer
Transitionary package
-
term_grid
formatting strings into a grid layout
-
google-fonts
Download and cache TTF fonts from Google
-
summertail
Summarize your tail -f output, by reporting how many lines match regexes
-
scraps_libs
A static site generator that makes it easy to write internal links (Wiki links) in Markdown files
-
lsp-document
Helpers to convert between LSP documents and Rust strings
-
perm-text
curling straight/dumb quotation marks ("") and apostrophes (') into their curly/smart (“”’) equivalents
-
ranting
Linguistic formatting placeholder extensions for rust
-
text_layout
Text layout algorithms
-
chunkr
A fast and quick chunking library for rust
-
csv-groupby
execute a sql-like group-by on arbitrary text or csv files
-
omgwtf8
Optimized-Matching-Generalized Wobbly Transformation Format — 8-bit
-
texting
string helpers
-
ucfirst
Uppercase the first letter of a string
-
notegraf
Core library for building a graph-oriented notebook
-
lla_plugin_utils
Blazing Fast and highly customizable ls Replacement with Superpowers
-
lingua-german-language-model
The German language model for Lingua, an accurate natural language detection library
-
hina
:]
-
mdbook-yml-header
mdBook preprocessor for removing yml header
-
grammateus
facilitate working with Ancient Greek words
-
enso-lazy-reader
An efficient buffered reader
-
ngrammatic
Character-oriented ngram generator and fuzzy matching library
-
unicode_categories_zheyuye
Query Unicode category membership for chars
-
campfire
A tiny static site generator, greatly inspired by Zola
-
ascii-hangman-webapp
customizable Hangman game with ASCII-art rewarding for children (webapp version)
-
mdbook-davids_cooking
A preprocesor for whatever https://davidsotomarchena.gitlab.io/davids-cooking/ needs
-
opentalk-types-signaling-meeting-report
Signaling types the OpenTalk meeting-report module
-
sluggify
slug or clean url generator for rust. With default settings, you will get an hyphenized, lowercase, alphanumeric version of any string you please, with any diacritics removed, whitespace and dashes collapsed…
-
commonregex
Rust port for CommonRegex. Find all times, dates, links, phone numbers, emails, ip addresses, prices, hex colors, and credit card numbers in a string. We did the hard work so you don't have to.
-
asciimath-text-renderer
Render asciimath in terminal
-
uapi-version
Compare versions according to the UAPI Version Format Specification
-
ldd_md_parse
markdow to html simple tool
-
ucf
A universal code formatter
-
guarding
guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
sortuniq
Find or count unique values in an input stream
-
haoxue-dict
Chinese dictionary and word segmenter
-
stringutils
A collection of various and (hopefully) useful String utility functions
-
mdbook-ocirun
mdbook preprocessor to run arbitrary commands and code snippets inside containers
-
emojicon
Find Emoji by using Emoticons and GitHub's, Bengali emoji names
-
strcursor
string cursor type for seeking through a string whilst respecting grapheme cluster and code point boundaries
-
nugine-rust-utils
Nugine's personal Rust utilities
-
lister-cli
Lister: Navigate Markdown Lists
-
valid_rust_char
A tiny library to check if a char it's valid in a rust file
-
bigstr
A command-line tool to make string BIG
-
cyrconv
A funny faux cyrillic character mapper
-
idna-cli
Encode/decode Unicode domain names to/from IDNA ASCII
-
c6o-obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
strizer
minimal and fast library for text tokenization
-
mdict
Rust implement mdict
-
wn-parser
parser for WordNet database files
-
qm-role-build
Role builder from markdown tables
-
qpprint
console printing/formatting
-
akai
Next-generation, functional and simple ID system
-
quocktail
Rust CLI for searching through nested directories of markdown files with frontmatter, filtering and displaying the results
-
grace-cli
CLI tool for processing files and strings
-
tabwriter
Elastic tabstops
-
character_text_splitter
splitting text into chunks with overlap, designed for handling large amounts of text efficiently. Implementation is identical to langchain's CharacterTextSplitter
-
lindera-ipadic-neologd-builder
A Japanese morphological dictionary builder for IPADIC NEologd
-
searcher_txt
A copy of grep that i made to show that im bad at rust
-
CFDTCP
Short text clustering for determining center points and correlation judgment [one-hot,Jenks Natural Breaks]
-
base16-rs
in Rust offers capabilities for encoding and decoding data in Base16 format. By utilizing the hex library, you can transform data into its hexadecimal representation and also decode…
-
rjoin
joining CSV data on command line
-
kansuji
漢数字と数字の相互変換のためのライブラリ
-
pulldown_typst
A pull parser for Typst markup
-
umlauts
text transformation of german umlauts
-
markdown-it-latex
Allows for the insertion of math in Markdown documents using LaTeX
-
dedent
Procedural macro for stripping indentation from multi-line string literals
-
repa
Peak Performance Pattern Seeker
-
extract-words
Extracts words from text without allocation
-
markdown-toc
Markdown Table of Contents generator
-
codes-iso-15924
This package contains an implementation of the ISO 15924 Codes for the representation of names of scripts specification
-
bitflip
functions to generate bitflips of binary and UTF-8 strings
-
notedown_ast
Notedown Abstract Syntax Tree
-
wordmarkov
but flexible Markov chain library, specifically for text sentences, which handles punctuation and whitespace
-
fetch_mediawiki_configuration
Generate
parse_wiki_text
configuration code for a MediaWiki wiki -
tectonic_io_base
Basic types for Tectonic's pluggable I/O backend system
-
okh-tool
A CLI tool to deal with Open Know-How (OKH) data files. Its main functionalities are: validation of and conversion between the different formats
-
quartz_commands
Generates a parser at compile-time for handling commands similar in structure to those of Minecraft
-
adc-lang
Array-oriented reimagining of dc, a terse stack-based esolang
-
rusty-x
A snippet manager in Rust
-
runiq-lib
An efficient way to filter duplicate lines from input, à la uniq
-
pdf-sign
extract signed date from pdf file
-
dtex
Better TeX
-
yinzhe9
喵喵隐者9
-
ptero-cli
A text steganography CLI tool for Social Media
-
charclass
define and modify unicode character classes
-
numbers_into_words
Command-line utility and library for writing a positive integer as English words
-
outerspace
Methods for prefixing and suffixing the non-whitespace characters in a string
-
clipboard_macos
obtain access to the macOS clipboard
-
ook
encode and decode Brainfuck or Ook
-
linetime
command line utility to add timestamps at the start of lines. The tool can either process lines from stdin or execute a command and process lines from the command's stdout and stderr.
-
text_searcher
text searcher tool
-
rust_stringz
string manipulation library for Rust that you should probably don't need. It's just for learning purposes.
-
csv-guillotine
CSV's often have metadata at top before data headers. This removes it.
-
xpanda
Unix shell-like parameter expansion/variable substitution
-
abjad
Calculate the numerical abjad value of Arabic-script text
-
formatjson
Formats JSON files
-
rl-sys
Native bindings to libreadline
-
nfa_regex
NFA regex engine for text processing
-
ascii-img
Convert images to ASCII
-
esl01-drawdag
Parse an ASCII DAG into parent relations
-
gazetta-render-ext
A static site generator framework. Extra render code.
-
ccase
Command line interface to convert strings into any case
-
xuexi
parse a sentence into words for multiple language
-
mdrss
generating RSS feeds from markdown files
-
liwe
IWE core library
-
parser-cli
Command-line interface for extracting text from various file formats
-
cautious-octo-funicular
Test: shipping an mdbook with API docs
-
lorem-rustum
lib for generating lorem-ipsum with a rusty fleur
-
solstice_glyph
A fast text renderer for solstice, powered by glyph_brush
-
halfcaps
tRaNslAtE aNy TeXt To ThIs
-
cfasttext-sys
fastText ffi binding
-
character_frequency
counting character frequencies in a string concurrently
-
interslavic
in rust
-
simple-text-editor
Basic implementation of a simple text editor, as a Rust library and CLI
-
md-designer
A CLI tool for creating design docs in Markdown
-
ellipse
Truncate and ellipse strings in a human-friendly way
-
bytescolor
A versatile Rust library for colorizing strings and byte data in terminal applications using ANSI escape codes
-
hmd
Custom Markdown Engine for my personal blog
-
ucd-raw
Uninterpreted access to the unicode UCD
-
webdog
static site generator fit for a dog
-
johalun/module
FreeBSD kernel module in Rust
-
kindle2cbz
extracting images from kindle books in MOBI format to CBZ archives
-
ctf-brute
Brute-force utilities for Rust
-
looking-glass
reflection & type-erasure library for Rust
-
valis-core
workflow management
-
mdtranslation
prepare multi-lingual Markdown documents
-
regex_parser
This project provides a parser for standard regular expressions based on a defined grammar
-
csi
processing a directory of text files. Particularly useful for building static websites.
-
hello_lib
Demonstrate Generics Function
-
tuilet
A textual user interface for Toilet, the ANSI-art text generator
-
ryaspeller
lib for searching typos in text, files and websites
-
wz
Count words, fast
-
compact_str
A memory efficient string type that transparently stores strings on the stack, when possible
-
giff
Visualizes the differences between the current HEAD and a specified branch in a git repository using a formatted table output in your terminal. The differences are displayed with color-coded…
-
jcalendar
Japanese Calendar for Rust
-
char_reader
Safely read wild streams as chars or lines
-
markdown-formatter
Flavored Markdown (ZH) content formatter
-
ngram
Iterator adaptors for n-grams and k-skip-n-grams
-
markitdown
designed to facilitate the conversion of various document formats into markdown text
-
mdbook-tools
A collection of tools for mdbook
-
utf8streamreader
lookahead iterator on an utf8 byte stream
-
strmatch
Conditionally match strings in Rust using regex without much boilerplate
-
hl2-lib
highlighting streams of text via regex
-
robotics
algorithms
-
indentasy
Indent like a party
-
twilight-mention
working with mentions in the Twilight ecosystem
-
truncate_string_at_whitespace
Truncate a &str at the closest whitespace to a specified length with unicode safety
-
markdown-composer
composing markdown documents
-
runestr-pancjkv
rune
-based Pan-CJKV support -
text-diff
text diffing and assertion library
-
xee-ir
Xee intermediate representation and compilation to bytecode
-
case-conv
Faster case conversion crate
-
ab-radix-trie
A compressed radix trie implementation supporting matching rules
-
grader
Stream-based CLI for binary sorting text files via a given shell command
-
string-sharp
traits implementing on string similar to c#
-
rustblocks
tetromino based falling blocks game
-
rupantor
A Bengali Phonetic Parser which is very flexible and supports Avro Phonetic
-
cowonverter
owo-ify text files. provides a simple function for this purpose.
-
kicad-text-injector
A tiny CLI tool that replaces variables of the style
${KEY}
within KiCad PCB (pcbnew) files -
indentsort
Structure-preserving sorting of arbitrary indented text
-
spellabet
Convert characters into spelling alphabet code words
-
xhtmlchardet
Character set detection for XML and HTML
-
my_parser_rrin
My parser
-
khat
A cat clone, nothing more nothing less
-
ufofmt
A fast, flexible UFO source file formatter based on the Norad library
-
sixbit
Small packed strings
-
dokkoo
Mokk (Macro Output Key Kit) implementation written in Rust
-
str_overlap
Methods for finding the overlap between two string slices
-
whitespace
Encode arbitrary data whitespaces and vice versa
-
macro_colors
colorful printing macros
-
koelner-phonetik
koelner_phonetik or cologne phonetics is a phonetic algorithm like soundex, but specialized for german words
-
caser
Change text between PascalCase, camelCase, and snake_case
-
moss-oxide
open-source software similarity metric, written in Rust
-
wkhtmlapp
Convert html to pdf or image
-
galm
pattern matching library
-
ewin-com
editor for Window(GUI) users.No need to remember commands
-
bge
Rust interface for BGE Small English Embedding Library
-
dr
Command-line data file processing in Rust
-
textblocks
working with text blocks
-
anon-csv-cli
anonymise CSV files, providing various options to substitute real data with plausable fake data
-
elden-ring-saver
ansi2
-
encoding_c
C API for encoding_rs
-
vape
full width aesthetics
-
rex-regextract
extracts key value pairs out of text
-
jp-deinflector
A package for deinflecting Japanese words
-
glyphana
Quickly find, inspect & collect unicode glyps
-
lindera-decompress
A morphological analysis library
-
tieliehin_css_parser
CSS parser using the Pest library for Rust
-
readput
Fast and easy stdin input parsing for competitive programming in rust
-
utils_rust
这是一个用于各种实用功能的 Rust 库
-
as3_parser
ActionScript 3 parser
-
fuzzy-string-distance
Fuzzy string distance comparisons
-
morse_code_japanese
handle Japanese Morse Code. (和文モールス符号)
-
rand-hira
CLI tool to generate random hiragana characters
-
mdbook-tabs
mdBook plugin for rendering content in tabs
-
kana-converter
converter for half-width/full-width Japanese language characters (katakana, hiragana, and ASCII)
-
gbx
GBX (Grundbuch-Exchange) Dateiformat
-
gecliht
A disparate collection of text manipulation and formatting algorithms
-
unic-ucd-segment
UNIC — Unicode Character Database — Segmentation Properties
-
jp_utils
Utils for working with Japanese text
-
unicode-box-drawing
Unicode box-drawing characters
-
repr
The regular-expression-as-linear-logic interpretation and its implementation
-
royal_road_archiver
An archival program and library for the webnovel site RoyalRoad
-
cesu8-str
CESU-8 and Java CESU-8 string validation and manipulation
-
graphannis-core
supports graph representation and generic query-functionality
-
yagenerator
Application that uses tinytemplate engine to generate text files. If you have a set of structured data, and need to generated a bunch of arbitrary types of files from it, this tool can help you to save some time.
-
hulk
An ultra simple no hassle static site generator
-
sttx
belt for transforming speech-to-text data
-
hemtt-tokens
A token library for hemtt
-
tfidf-summarizer
Basic tf-idf compute for documents
-
latinrs
Rust package that transliterates text and characters into thier latin form
-
raekna
combines the other crates in the workspace to make a functional program
-
asmfmt
A formatter designed for programs in assembly language with AT&T syntax
-
mors
converting morsecode to text and reverse
-
gimme
Pull useful data out of your clipboard
-
unindenter
unindent text
-
hl2-cli
A cli tool for colourising / highlighting stdin with regex and writing back to stdout
-
yeslogic-fontconfig-sys
Raw bindings to Fontconfig without a vendored C library
-
cli-pdf-reader
A minimalistic ebook reader
-
goodname
assist you with cool naming of your methods and software
-
confusables
around Unicode confusables/homoglyphs
-
esperanto-text
Convert Esperanto text between UTF-8, x-system and h-system transliterations
-
igo-rs
Pure Rust port of the Igo, a POS(Part-Of-Speech) tagger for Japanese (日本語 形態素解析)
-
arabic-script
An expressive API for the characters of the Arabic script
-
unicode-vo
Unicode vertical orientation detection
-
rs-wordcloud
☁ Flexible WordCloud implementation for ⚙Rust
-
whitespace_text_steganography
A steganography strategy that uses whitespace to hide text in other text
-
platina
parameterized golden testing library
-
animated-emojis-rs
Noto Animated emojis
-
trie-match
Fast match macro
-
cef
in Rust
-
taboc
A table of contents generator for markdown documents
-
jpreprocess-njd
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
lindera-cli
A morphological analysis command line interface
-
aki-unbody
output first or last n lines, like a head and tail of linux command
-
codetypo
Source Code Spelling Correction
-
CLI_Project_Scott_Coakley
CLI Project in Rust
-
easy-regex
Make long regular expressions like pseudocodes
-
uwubot
discord bot for uwuifying text
-
mdbook-presentation-preprocessor
A preprocessor for utilizing an MDBook as slides for a presentation
-
regexy
lightweight Rust library for working with regular expressions. The regexy crate provides an easy-to-use interface for matching patterns in strings using regex
-
hunspell-rs
Rust bindings to the Hunspell library
-
ripgrep
line-oriented search tool that recursively searches the current directory for a regex pattern while respecting gitignore rules. ripgrep has first class support on Windows, macOS and Linux.
-
ctrl-z
A composable reader to treat
0x1A
as an end-of-file marker -
uiuifree-normalize
uiuifree text normalize
-
marker
finding issues in CommonMark documents
-
runanum
Существительные с правильными окончаниями после чисел
-
uwu-rs
uwuifying library
-
azusa
String index transformer for Rust utf8 to JavaScript utf16
-
mdbook-preprocessor-utils
writing mdBook preprocessors
-
unicode-normalization-alignments
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
group-similar
Group similar values based on Jaro-Winkler distance
-
cabocha
Safe Rust wrapper for cabocha a japanese language dependency structure analyzer library
-
levenshtein_lite
No-frills implementation of a Levenshtein Automata and the Levenshtein Distance function
-
swot
community-driven or crowdsourced library for verifying that domain names and email addresses are tied to a legitimate university of college
-
arboard
Image and text handling for the OS clipboard
-
string_manip_rust
Demo of managing projects
-
small-bwt
BWT construction in small space
-
rex-regex
A pedestrian library for regular expressions: Slow, but easy to understand
-
cha-rs
Extract specific characters from an input
-
invisible_unicode
finding invisible unicode characters
-
playin-cards
♠ Library for French-suited playing cards
-
fancy_print
Easily print animated ASCII text to the terminal!
-
pomsky-syntax
Parser for pomsky, a new regular expression language
-
mdbook_header_footer
mdBook preprocessor to prepend header and append footer to certain chapters
-
deno_tauri
deno executable
-
unicode-casing
Titlecase helper function on characters
-
docket
markdown to HTML documentation rendering
-
unicode-width-16
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
mdx
in Rust
-
sc_decode
Special character decoding
-
doc-search-dsl
procedural macro for creating complex regex patterns using a Domain-Specific Language (DSL)
-
termwrap
Wrap Unicode text with ANSI color codes
-
vec-string-to-static-str
providing utilities for converting vectors of
String
s into vectors of&'static str
-
basic-text-internals
Basic Text string literal implementation details
-
utf-cli
A UTF code viewer
-
veryfi
Module for communicating with the Veryfi OCR API
-
rabbit-auto
Wrappers for lapin publishers and consumers
-
mdbook-kroki-preprocessor
render kroki diagrams from files or code blocks in mdbook
-
typeline
Efficient, Type-Safe Pipeline Processor
-
b2c2-stat
デバッグ用にところどころで使ってるぽい
-
markdown2unicode
Converter from markdown notation to unicode characters
-
encoding-next-index-singlebyte
Index tables for various single-byte character encodings
-
engish
A language utility for sampling letters and building words
-
markov_strings
A simplistic Markov chain text generator
-
spdkit-surface
short text for crates.io
-
toml_to_table
pretty print TOML as a table
-
blitztext
fast keyword extraction and replacement in strings
-
dictcc
Rust API for reading and querying the dict.cc offline translation database
-
markov-text
creating a small markov model for text generation
-
redpatterns
a list of patterns for scanners 📟
-
linkcheck
extracting and validating links
-
commit-formatter
A CLI tool to help you write git commit
-
mdbook-svgbob2
Alternative mdbook preprocessor for svgbob
-
markovish
Markov chain implementation for text generation
-
lf_lint
Linter that checks that all files in <PATH> ends with a newline
-
wordshk_tools
A combination of parsers and other tools for words.hk (粵典)
-
mdbook-cat-prep
a preprocessor for mdbook which provides teacher, subject, material and tag functionality
-
thesauromatic
command-line thesaurus that returns related words when given a word. The output words are one per line, making it easy to process in shell pipelines.
-
cli_app_capo15
CLI application with Unix-like tools
-
xboot
static variables like module-level database connections can be initialized asynchronously before the program starts. / 在程序启动前通过异步函数初始化静态变量,如模块级数据库连接。
-
clippers
Cross-platform clipboard management library
-
kpathsea
Rust interface to the kpathsea TeX file management library
-
midstring
Create a string between two other strings, that is lexicographically halfway between them
-
nerd_font
Enumerated contents of the JetBrainsMono Nerd Font
-
fifthtry-mdbook
fork of mdbook, only for ft-cli
-
fast_aug
Fast data augmentation for text
-
rust-regex-dsl
Regular expression DSL
-
md-include
include any file in markdown files
-
delay_writer
Wraps a writer and delays its output after each newline
-
rustme
Automatically generate Rusty READMEs
-
indent_tokenizer
Generate tokens based on indentation
-
text_manipulation_rs
generating random placeholder text in different languages
-
genpdf
User-friendly PDF generator written in pure Rust
-
fimdoc
Firendship is Magic Document, converts Markdown into FIMFiction BBCode
-
rustrawi
Rust port of the original PHP Sastrawi
-
md-to-html
CLI tool to convert Markdown files to HTML
-
ae-renderable
handling individual text characters that can be rendered on the screen. Intended for use in game development.
-
kth-lines
Command line tool for filtering stdin lines that just work
-
nixpkgs-fmt-rnix
Nix code formatter for nixpkgs
-
shoebill
A Wadler/Leijen style pretty-printer
-
product-os-content
Product OS : Content provides a complete solution for content management for the purpose of serving content via Product OS : Server
-
collapse
Trim and collapse consecutive whitespace to a single space
-
assert-text
the testing macro tools
-
print-positions
providing string segmentation on grapheme clusters and ANSI escape sequences for accurate length arithmetic based on visible print positions
-
bukvalno
A cli tool for converting images to ascii art
-
kvarn-chute
A Markdown converter designed to use the Kvarn templating engine
-
codex
Human-friendly notation for Unicode symbols
-
unic-ucd-normal
UNIC — Unicode Character Database — Normalization Properties
-
rulet
figlet implementation
-
txt_to_md
Command converting from a txt file to a markdown file
-
remake
writing maintainable regex and managing symbol soup
-
minigrep_flict
Simplest text-in-file search engine from rust book
-
mdbook-twiki
twiki backend for mdbook
-
logseq
Handle Logseq Markdown files in Rust
-
stringmatch
Allow the use of regular expressions or strings wherever you need string comparison
-
html-to-pulldown-cmark-events
Parse HTML to pulldown-cmark's events
-
render_readme
Render Markdown or reStructuredText with syntax highlighting and image filtering similar to GitHub's
-
allwords
Generate all the words over a given alphabet
-
table_to_html
interface to convert a
tabled::Table
into a HTML table (<table>
) -
gstring
String with support for Unicode graphemes
-
markdown-linkify
Markdown preprocessor for substiting link shorthands to valid links according to configurable regexes and custom substitution implementations
-
carnation
some string operators
-
readability-text-cleanup
cleaning up text from HTML using Mozilla's Readability algorithm
-
codevis
turning your code into one large image
-
yeslogic-unicode-script
Fast lookup of the Unicode Script property
-
asimov-core
ASIMOV Software Development Kit (SDK) for Rust
-
tinytoken
tokenizing text into words, numbers, symbols, and more, with customizable parsing options
-
mdbook-svgdx
mdbook preprocessor to convert svgdx fenced code blocks into inline SVG images
-
tagalyzer
A CLI tool to gather statistics on collections of plaintext-adjacent files
-
informa
font compiler for digital typography
-
mdbook-svgbob
SvgBob mdbook preprocessor which swaps code-blocks with neat SVG
-
humnum
Human numeric sorting program — does what
sort -h
is supposed to do! -
mdtransform
program that renders Markdown as HTML for websites
-
mdbook-typst-pdf
mdbook typst pdf backend
-
forgiving-htmlescape
HTML entity encoding and decoding, with support for leaving malformed entities intact
-
name-engine
computing Markov chains to generate random names based on pronunciation
-
textocx
Tex code to Office MathML
-
just-enough-emojis
text to emoji cli
-
lazy-char-iter
Lazily iterate over Unicode characters from a u8 slice in Rust
-
md2gemtext
for converting Markdown into gemtext
-
lucide-dioxus
Dioxus port of Lucide
-
token-read
reading whitespace delimited files intended for competitive programming
-
ssml-parser
parsing speech sythnesis markup language
-
cattocol
Combine two text into one text as columns
-
uwildmat
wildmat/uwildmat matching
-
dispnet-hash
Hash Library used for Dispnet
-
word-star-puzzle-solver
A CLI tool for solving VGs word star puzzle
-
jput
puts and putc on unicode-width align for Rust
-
casaubon
A feature-poor static site generator
-
iprint
debugging utility that prints indented text based on function call depth
-
founder
The package provides a font toolbox
-
glifnames
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
amongify
A very ඞ sus ඞ program
-
uniwhat
Display the unicode characters text
-
literumilo
A spell checker and morphological analyzer for Esperanto
-
rust_nickname_generater
that generates user/nick names based on the rust language
-
charwise
This lightweight, dependency-free rust library provides a convenient way to read characters from different resources
-
mdbook-skill-tree
mdbook plugin to show roadmaps
-
trim
trim whitespaces from files
-
snake_case_converter
convert strings to snake case
-
citation
manipulate citation files
-
mdbook-all-the-markdowns
Render all markdown files in a given folder structure
-
brainfuck_interpreter
brainfuck interpreter written in Rust
-
harper-core
The language checker for developers
-
gdnative-doc
Documentation tool for gdnative
-
rustplexity
bigram-based perplexity calculator, useful for filtering out boilerplate or other abnormal text
-
todo-to-issue
CLI tool that converts forgotten TODO comments into actionable GitHub issues
-
veloci_levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
ra_rustc_lexer
lexer used by rustc, zero stability gurantees
-
codes-iana-charset
This package contains an implementation of the IANA Character Set registry
-
braille_pics
producing text-art pictures using Braille characters
-
notmecab
tokenizing text with mecab dictionaries. Not a mecab wrapper.
-
basic_lib_for_me
just a basic lib that i will use usually
-
jg
Jeff Goldblum (jg) is a command-line JSON processor. jg searches for structural patterns in json input and prints each json object that matches the pattern.
-
uwuify
fastest text uwuifier in the west
-
jlu
Command-line utilities for on-the-fly investigation of JSON Lines
-
comment-strip
Remove comments out of text files
-
strfry
randomize a string
-
mdbook-plugin-utils
mdBook plugins
-
minigrepsada
Test
-
charmap
one-to-(none/one/many) character mapping
-
parattice
Recursive paraphrase lattice generator
-
serbzip
A quasi-lossless Balkanoidal meta-lingual compressor
-
paperoni
A web article downloader
-
findtext_textfile
Search text in text file
-
latin1str
Windows-1252 string types
-
tocfl
Chinese TOCFL vocabulary by levels
-
pulldown_mdbook
A pull parser for mdBook
-
minigrep_jamhaze
minigrep tool from the Rust book chapter 12
-
libxdiff
Rust bindings for the libxdiff C library
-
cli-colors
A CLI tool for outputting text in ANSI format with features like colors, underlining, boldening, and italicizing
-
random_access_unicode
Provide random access to the unicode characters in a file
-
clippy_lints
A bunch of helpful lints to avoid common pitfalls in Rust
-
rreplace
designed to streamline string replacements. It can handle multiple unique replacements and iterates the string only once.
-
mdbook-spec
An mdBook preprocessor to help with the Rust specification
-
icu_casemapping
Unicode case mapping algorithms
-
naromat
Convert text to narou novel format
-
write16
A UTF-16 analog of the Write trait
-
transcript
A transcriber for European scripts
-
xee-xpath
XPath 3.1 library API
-
bnf_syntax_parser
The syntax parser based on BNF rules
-
g2-unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
strip-ansi-escapes
Strip ANSI escape sequences from byte streams
-
rust_baht_text
Convert number to Thai Baht text
-
arbitrator
Format text based on a set of rules and regexes
-
modeling
tools to analysis different languages by Ctags
-
strange
A static website generator
-
onepage
static site generator
-
crawdad
ChaRActer-Wise Double-Array Dictionary
-
json-peek
Amature JSON parser library designed for my specific need
-
case_convert
Converts the first letter of a Rust String to uppercase
-
cn-font-split
划时代的字体切割工具,CJK与任何字符!支持 otf、ttf、woff2 字体多线程切割,完美地细颗粒度地进行包大小控制。A revolutionary font subetter that supports CJK and any characters…
-
utfx
-
unicode_types
A mapping of all the unicode characters into convenience types (one enum per block of characters with one variant per character)
-
bpmf_py
A Bopomofo and Pinyin library
-
alphabet-encoding
A way of encoding text
-
awabi
A morphological analyzer using mecab dictionary
-
twitter_text_conformance
twitter-text in Rust
-
m2h
Convert Markdown to HTML with syntax highlighting
-
sarcasm
tExT creation and validation library
-
case_converter
Case Converter
-
bitranslit
Bi-directional transliterator for Rust. Transliterates strings according to the rules specified in the language packs.
-
ogrep
searching in indentation-structured texts
-
wattle
tools for rust
-
zummi
fun lib that produces spoonerisms
-
whitespace-conf
Key-value configuration file delimited with whitespaces
-
node-emoji
Convert
:emoji:
to Unicode using GitHub’s and EmojiDB’s emoji names -
gnu-echo-rs
A rewrite of the echo GNU core utility in rust
-
tantivy-czech-stemmer
Czech stemmer as Tantivy tokenizer
-
count-md
configurable command-line tool and Rust library for Unicode-aware, Markdown-aware, HTML-aware word counting in Markdown documents
-
mdtohtml
markdown to html renderer (with a couple of missing features)
-
regex-cache
Lazy and cached regular expressions
-
scrambler
command line tool to scramble letters
-
terminal_cli
A standalone library with no-std support for command line terminal interfaces. With autocomplete support, helpers for commands and properties and a prompt implementation.
-
mime_4
Strongly Typed Mimes
-
mdbook-github-authors
mdbook preprocessor to display Github profiles of authors of a page
-
gzip-cmp
that makes a distance measurement between binary data based on the difference of the compressed data length
-
console-traits
Basic console traits, such as setting char position and writing strings
-
polyvalid
A validator for package names, namespace, username and app name on wasmer
-
sastrawi
stemming and stopword removal for Bahasa Indonesia based on PHP sastrawi project by Andy Librian
-
ligotab
Format delimited data with lightweight markup
-
prettythanks
frontend to dtolnay/prettyplease library
-
uecho
The unicode of the echo command
-
wcounter
Give the word and count the appearance
-
merge-whitespace-utils
Procedural macros for merging whitespace in const contexts
-
ugrip
A i/o command that searches for word in a file
-
anagrambot
find anagrams of words
-
ruby-string
A string type that tracks Ruby glosses attached to parts of it
-
ra_syntax
Comment and whitespace preserving parser for the Rust langauge
-
unicode_reader
Adaptors which wrap byte-oriented readers and yield the UTF-8 data as Unicode code points or grapheme clusters
-
shapdf
Create Shapes into PDF
-
tokengeex
efficient tokenizer for code based on UnigramLM and TokenMonster
-
resrap
A parser library
-
igpay-atinlay
Translate text to Pig Latin
-
mdbook-fs-summary
Summary generator for mdbook
-
chisel-lexers
Chisel backend lexers/scanners
-
catdream
Sleeping cat dreams your text
-
git-busy
A wrapper around "git commit" that generates the commit messages for you
-
nmd-core
Official NMD compiler core
-
crypto-invert
Unicode Upside-Down Mapping
-
ucd-util
A small utility library for working with the Unicode character database
-
kl-hyphenate
Knuth-Liang hyphenation for a variety of languages
-
detect-indent
Detect the indentation of code
-
regex_static
Compile-time validated regex, with convenience functions for lazy and static regexes
-
alpha-counter
Alphabetic counter
-
changecase
A trait and implementation for changing the case of Strings and &str. It currently supports uppercase, lowercase, alternating case, and inverting case. Title case is in the works.
-
spongemock
CLI-tool for random text capitalization
-
chat-gpt-lib-rs
interacting with OpenAI's ChatGPT API, providing a simple interface to make API requests and handle responses
-
show-bytes
Display bytes as printable ascii with escape sequences as needed
-
hunspell-sys
Bindings to the hunspell C API
-
stringsort
Pathological sorting of string characters
-
rsonpath-test-codegen
Blazing fast JSONPath query engine powered by SIMD. TOML-based test codegen for rsonpath-lib.
-
kindleclip
CLI tool to export a file per book in kindle 'My Clippings.txt'
-
traduki
Integrate translated assets into your application or library
-
tgo
Heterogeneous data type transtion, it's safe, lightweight and fast
-
encoding_rs_transcode
Transcode text within writers using encoding_rs
-
csv-sanity
Sanitize and transform large CSVs with millions of records quickly and efficiently
-
yozuk-core-skillset
Set of default Yozuk skills
-
kudubot-bindings
Rust Bindings for the kudubot framework
-
merge_pdf
Merge PDF files in a directory
-
korean_regex
Regex extension for Hangeul analysis
-
wtf8
WTF-8 encoding. https://simonsapin.github.io/wtf-8/
-
base64-secret
Base64 encoder/decoder with custom alphabet. The alphabet is sorted by a given key. The sorting is always deterministic.
-
inslice
A command-line utility for filtering text input by columns and rows
-
match-pinyin-with-hanzi
Checks whether the sentence in Chinese characters (汉字) matches with the sentence in pinyin (拼音). Erhua is supported.
-
moscato
Outline scaler for OpenType glyphs
-
univstring
The Universal String(Rust String, CString, Wide(UTF-16/UTF-32) CString) trait
-
text_styled
Trait to style String and &str
-
trans-case
Transform case
-
ascii_converter
converting between different ascii representations
-
pseudolocalize
Pseudolocalization crate for Rust
-
remove-markdown-links
Turns
[example](https://example.com)
intoexample
. That’s it -
no-comment
Remove rust-style line and block comments from a char iterator
-
note-mark
A markdown parser
-
fmt-extra
Extra formaters not shipped in the standard library
-
milligrep
Custom simplified implementation of grep
-
afrim-memory
handle of sequential codes easier for an input method
-
leven-distance
Compute operational differences between two sequences using the Levenshtein algorithm
-
md_converter
My markdown to html converter
-
cli-animate
Enrich your Rust CLIs with stunning, easy-to-implement animations
-
rigrep
grep from Rust Book
-
enchant
Bindings for Enchant spell checking library
-
soup
Inspired by the python library BeautifulSoup, this is a layer on top of html5ever that adds a different API for querying and manipulating HTML
-
markdown-extract-cli
Extract sections of a markdown file with a regular expression
-
meaningsearch
package that helps you find meaningful lines of any given input. Especially useful in CTFs.
-
regex-automata
Automata construction and matching using regular expressions
-
nipdf
A PDF library written in Rust
-
chargrid_test_grid
chargrid frontend that renders to an in-memory buffer for use in tests
-
autoruby-cli
CLI to easily generate furigana for various document formats
-
mon-artist
Render ASCII art to SVG, with user-customizable rendering rules
-
rcut
replacement for GNU cut that supports UTF-8
-
skribo
low-level text layout
-
shear
trimming excess contents from things
-
futf
Handling fragments of UTF-8
-
metatensor-sys
Bindings to the metatensor C library
-
randrem
sending random remainders to be more healthy
-
lexer-generator
Lexer derived from Regex patterns with user customizeable tokens
-
blockcounter
Counts the blocks in a stream
-
stardict
Rust implement stardict
-
mdbook-variables
mdBook proprocessor for risolve variables configured from book.toml
-
emoji-printer
Replace emoji shortcodes in string with emoji unicode (":sushi:" -> 🍣)
-
yeslogic-unicode-blocks
Functions to access and search Unicode blocks
-
mago-span
Tracks exact character positions in source code, making it easier to report precise errors, warnings, and other diagnostics
-
gqlog
👾 filter your json logs with graphql 👾
-
lexmatch
lexicon matching tool that, given a lexicon of words or phrases, identifies all matches in a given target text. Uses suffix arrays.
-
mul
Bengali stemmer
-
timfmt
A small utility for formatting code as Tim likes it
-
stamd
Webservice for working with stand-off annotations on text (STAM)
-
word_filter
A Word Filter for filtering text
-
sparklet
small flashcards library
-
Raifus
view an Ascii waifu
-
genex
Text-expansion library
-
yarner-block-links
A Yarner plugin that adds to each code block a list of links to all referenced and all referencing blocks
-
lingua-spanish-language-model
The Spanish language model for Lingua, an accurate natural language detection library
-
scripter
A screenplay compiler
-
rew
A text processing CLI tool that rewrites FS paths according to a pattern
-
irssi-sys
Automatically generated bindings to irssi
-
text-template
Small template engine for use with plain text (e.g. creating text email), not intended for HTML.
-
justify
plaintext while handling Unicode gracefully
-
emojicons-2021
Parse :emoji: notation to unicode representation
-
aprilasr
High-level wrapper for the april-asr C api (libaprilasr) using aprilasr-sys
-
mdbook-docslab
mdBook preprocessor for interactive code with docslab
-
pygmentize
wrapper for syntax highlighting
-
aki-txpr-macro
the more easy to use libaki-*
-
syllable
counter for use with reading level calculations
-
unicode-canvas
creating text base drawing
-
uwu_wasm
uwuifying the web
-
fsrenamer
refactoring invalid file/dir names
-
rescue-blanket
Escape values while they are being formatted
-
framework
detector for different frameworks in one projects
-
ron_to_table
pretty print RON as a table
-
marko
Programmtically format text with Markdown syntax
-
chinese
language nlp tools
-
spare
colorful format iterable
-
kincaid
A word statistics library in Rust
-
filecount
A modern high-performance open source file analysis library for automating localization tasks
-
argot
Parse documentation from codebases into Markdown for easy doc creation
-
shutil
Shell utility helper library
-
mdbook-quiz-validate
Input validation for quizzes used in mdbook-quiz
-
string-overlap
A helper crate for "layering" ASCII art
-
spoilers
High-level Rust bindings for CTranslate2
-
jpreprocess-jpcommon
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
is_printable
Determine whether a given text-based value is printable
-
diffy-fork-filenames
Fork of https://docs.rs/diffy that allows specifiying filenames
-
lindera-ko-dic
A Japanese morphological dictionary for ko-dic
-
twitter-text
in Rust
-
marcus
An experimental Markdown parser written in Rust
-
strip_markdown
remove markdown syntax from markdown files
-
lithe
A Slim template engine by using Pest
-
zhlint
A linting tool for Chinese text content
-
html_to_markdown
Convert HTML to Markdown
-
globber
Extended glob matching library
-
ngrams
Generate n-grams from sequences
-
mdbook-unlink
A mdBook backend that validates local links
-
shallow
long text
-
spacemod
A easy to understand and powerful text search-and-replace tool
-
icu-data
International Components for Unicode (ICU) data in Rust structures
-
slack_update
app to set Slack status, emoji and photo
-
md-localizer
Localize markdown with remote links
-
translation-api-cn
Some useful structs for calling Chinese translation api cloud services. A helper tool for
bilingual
cmdline tool. -
hms
Command line utility to help with storing text (eg. code snips)
-
leven
Make your own blog!
-
split_exact
splitting strings into arrays of slices
-
noodler
A port of the python-ngram project that provides fuzzy search using N-gram
-
markdown-table
Creating markdown tables with Rust!
-
gematria_rs
Gematria, a traditional Hebrew numerology system
-
old_icelandic_zoega
Old Icelandic dictionary for Rust. From "A Concise Dictionary of Old Icelandic" by Geir Zoëga
-
mdbook-playscript
Preprocessor for mdBook, which styles stage play scripts
-
subscript-compiler
A modern LaTeX rendition
-
mdxbook
Fork of mdBook, with more customizations and flexibility for programmers
-
zalgo-text
A command line tool for generating zalgo text
-
lingua-danish-language-model
The Danish language model for Lingua, an accurate natural language detection library
-
tectonic_status_base
Basic types for reporting status messages to a user
-
b2c2-jis-x-201
UTF-8とJIS-X-201を雑に変換処理する
-
corg
A cog-like tool
-
struckdown
A structured markdown / commonmark library for Rust
-
squ
command-line utility for converting quotation marks in plaintext files to "smart quotes"
-
emojito
Find all the Emoji in a string. Supports composed emoji.
-
bazaar
formats and protocols
-
rust_lemmatizer
A lemmatizing package for use with a .csv dictionary of lemmas and their corresponding words
-
chinese-ner
A CRF based Chinese Named-entity Recognition Library written in Rust
-
boxy
Declarative builder for Unicode box-drawing characters
-
code-span
Add additional infomation to code character
-
file-search
File indexing and search
-
fuzzywuzzy
A pure-Rust clone of the incredibly useful fuzzy string matching python package, FuzzyWuzzy
-
kakasi
Romanize hiragana, katakana and kanji (Japanese text)
-
indoc
Indented document literals
-
bytepiece_rs
The Bytepiece Tokenizer Implemented in Rust
-
tectonic_engine_xetex
The XeTeX engine as a reusable crate
-
gret
command line tool to search for patterns and show matches in a tree structure
-
typeline_ext_csv
csv parsing and serialization for typeline
-
kasedenv
Read environment variables by lower, upper case or case-insensitive keys
-
adbook
Creates a book from AsciiDoc files
-
hline
a grep-like tool that highlights lines in files
-
my_mini_grep
A mini application that aims to replicate the behavior of the grep shell command
-
morse-nostd
A nostd version of the morse crate
-
mdbook-auto-gen-summary
A preprocessor and cli tool for mdbook to auto generate summary
-
crustword
Crusty Crosswords
-
mdbook-trunk
mdBook plugin which bundles packages using Trunk and includes them as iframes
-
encoding-index-singlebyte
Index tables for various single-byte character encodings
-
boringascii
Strings that can only be constructed to contain non-whitespace, non-control ASCII characters
-
am-parse
converting asciimath to other formats
-
lindera-dictionary
A morphological analysis library
-
br-pdf
PDF Invoice Processing
-
rust-cedar
efficiently-updatable double-array trie in Rust (ported from cedar)
-
gfm-autolinks
Parse GitHub Flavored Markdown autolinks
-
wiki_corpus_grammar
Extract text from Wikipedia dumps (.bz2) and convert it to JSONLines format
-
falcom-sjis
Falcom-compatibile Shift JIS implementation
-
varcon-core
Varcon-relevant data structures
-
psfparser
A PSF reader written in Rust
-
ruby-parser
A parser for the Ruby language
-
seven_seg
Seven-segment digital display for terminal
-
static_table
creates pretty tables at compiler time
-
ezemoji
Catigoryized Emoji's
-
igneous-md-viewer
The viewer component of igneous-md
-
scie
research about how to build simple code identify engine for different languages
-
mdbook-gitbook
mdBook preprocessor to properly render GitBook specific syntax
-
bbd-lib
Binary Braille Dump
-
const_format_proc_macros
detail of the
const_format
crate -
moenarchbook
Creates a book from markdown files
-
pretok
A string pre-tokenizer for C-like syntaxes
-
bibutils-sys
Rust bindings for bibutils, a program for bibliography format interconversion
-
term-rustdoc
A TUI for Rust docs
-
couleur
Add the ablity to print colored text to the console
-
owoify
Text owoification library
-
pithy
Ultra-fast, spookily accurate text summarizer that works on any language
-
repub
convert markdown documents to epub
-
cw
Count Words, a fast wc clone
-
saurus
Markdown to LaTeX Transpiler
-
mrdocument
Automatic PDF transcription and classification via OpenAI
-
zalgo-codec-common
Convert an ASCII text string into a single unicode grapheme cluster and back
-
senax-inflector
Pluralizes and singularizes English nouns. This crate is forked from https://github.com/whatisinternet/Inflector.
-
plural
pluralize English words
-
pattern-3
Needle API (née Pattern API 3.0), generalization of
std::str::pattern
-
lindera-compress
A morphological analysis library
-
markdown-it-footnote
A markdown-it plugin for parsing footnotes
-
hitori
Generic compile-time regular expressions
-
gen-epub-book
Generate an ePub book from a simple plaintext descriptor
-
token-counter
wc
for tokens: count tokens in files with HF Tokenizers -
skyspell_core
skyspell core library
-
strip-tags
Strip HTML and PHP tags from strings
-
indentation
Formatter
-
marktask
A CLI tool for parsing and manipulating Markdown tasks
-
seq2xypic
Turn a text sequence diagram into a LaTeX xypic diagram
-
rosie
Interface for the Rosie Pattern Language, for efficient and maintainable text pattern matching and search
-
pomsky-bin
Compile pomsky expressions, a new regular expression language
-
ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched
-
convert_encoding
Convert encoding of text files in batch
-
august
& program for converting HTML to plain text
-
m_lexer
extensible regular expressions based lexer
-
chinese_segmenter
Tokenize Chinese sentences using a dictionary-driven largest first matching approach
-
ghimlink
Generates a Github-Flavoured-Markdown formatted image link to an image in the same repository
-
roxy_markdown_parser
Roxy plugin for parsing Markdown
-
utf8_reader
A UTF-8 reader that read UTF-8 characters from object that implement Read trait
-
mdbookshelf
Create epubs from a list of mdbook repositories
-
korrektor
work with Uzbek language text processing
-
rosie-sys
build or link to librosie to access the Rosie Pattern Language
-
transition-table
transition table utilities for keyword parser
-
snekdown
A parser for the custom snekdown markdown syntax
-
latex_snippet
Convert even erroneous LaTeX snippets into HTML
-
finl-charsub
Character substitution module for in-line replacement of fixed strings
-
password-characters
help with the "enter the 12th, 35th, and 63rd characters from your password" situations
-
t9
Easily create port number from words and vice versa
-
wcount
CLI word counting tool
-
byte_string
Wrapper types for outputting byte strings (b"Hello") using the Debug ({:?}) format
-
text-to-json
Convert text to json in rust
-
terminal-supports-emoji
Check whether the current terminal supports emoji
-
bbcode-tagger
BBCode tree parser and tagger
-
difference
text diffing and assertion library
-
whisperer
将文本编码为简短的中文字符, 防和谐
-
pcre2
High level wrapper library for PCRE2
-
const-utf16
Utf8 to utf16 conversion functions for use in const contexts
-
stardict_wrapper
Rust implement stardict
-
mdbook-extended-markdown-table
Preprocessor for mdBook that generates tables with merged cells from ASCII text
-
mdbook-treesitter
mdBook preprocessor for html adding tree-sitter highlighting support
-
spacey
a tiny yet performant whitespace vmm/interpreter
-
hebrew_unicode_utils
Some functions for processing Hebrew unicode characters
-
ed_join
Implemtation of Ed-Join Algorithm for string similarity join
-
bullshit-rs
A BullshitGenerator implementation in Rust
-
hashtag-regex
regex matching hashtags accoding to the unicode spec: http://unicode.org/reports/tr31/#hashtag_identifiers
-
mojibake
Encode/Decode bytes as emoji base2048
-
rizzer
Fuzzy matching tool to find string similarity
-
destiny_rs
An async library to interact with Bungie's destiny 2 api
-
quagga
CLI tool that combines multiple text files into a single prompt suitable for Large Language Models
-
pomsky-macro
Macro for converting pomsky expressions to regexes
-
every-range
Iterator for interleaving included and excluded ranges
-
fontconfig-rs
Safe, higher-level wrapper around the fontconfig library
-
unicode_names
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
cur
that will hunt for your regular expression
-
flw
Process text via configurable tasks
-
patterns
no_std compatible pattern scan library using simd
-
pocky
A framework for building your own static site generator
-
spellcheck_toy
a basic spellchecking library based on edit distance
-
mdbook-compress
Compress an mdBook project into a single PDF file
-
matchable
Check if text is matching against string or regex in one API
-
mdbook-asciidoc
mdBook backend for AsciiDoc generation
-
pangu
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
termbook
behind the
termbook-cli
-
pdf_composer_definitions
PDF Composer definitions crate
-
wfst4str
Python library based on rustfst for manipulatig strings with wFSTs
-
unidok
A powerful, readable, easy-to-learn markup language
-
tectonic_xetex_layout
XeTeX's font loading and layout interface encapsulation, as a crate
-
rustex
auto-generated LaTeX files in Rust
-
readability-rs
Port of arc90's readability project to rust
-
corpus-count
Util to count words and character ngrams in a corpus
-
highlight-pulldown
Process pulldown-cmark events to apply syntax highlighting to code blocks
-
auto_correct
provide auto correct suggestions. Currently supporting EN-US.
-
bocu1
BOCU-1 compressed unicode encoding
-
jellybean
Syntax highlighting with tree-sitter. Sweet colors.
-
color-convert
Support RGB,RGBA,HEX,HSL,HSLA,HSV,CMYK to convert each other, write by rust
-
yozuk-helper-english
English NLP utilities for Yozuk
-
ripmors
encoding and decoding international Morse code and several variants
-
quoted-string-parser
Quoted string parser for grammar defined in RFC3261
-
unicode_escape
decoding escape sequences in strings
-
readable-readability
Really fast readability
-
gregex-logic
Logic for the gregex crate
-
rmw-utf8
Short text compression algorithm for utf-8 (optimized for Chinese , developed based on rust programming language). 面向utf-8的短文本压缩算法(为中文压缩优化,基于rust编程语言开发)。
-
aki-json-pick
The json pick out command
-
catr
A fully functional clone of the GNU concat tool built completely in Rust
-
aklat
create books from markdown files (like Gitbook)
-
darn
Extras for the Book on Data Analysis with Rust Notebooks
-
latexify
Shared definition for turn a rust object into latex code
-
litegrep
A basic tool for searching in files for lines of text, based on a query
-
rustfmt_emitter
Rustfmt emitter library
-
utf
UTF-8
-
tpt
Pure Rust implementation of the Unix concatenate (cat), word-count (wc) and echo command
-
encoding-next-index-tradchinese
Index tables for traditional Chinese character encodings
-
nb2fr
Convert integer number to literal French text
-
xsystem
Conversion between the Esperanto x-system and Unicode circumflexes
-
yeah
CLI tool to return the vendor name for a given MAC address
-
string-sections
Build tool for Leptos
-
e_book_sync_library
Synchonize e-book with your local e-library
-
dynfractal
A program to render any equation (in terms of z and c) into an ASCII fractal
-
pulldown-cmark-fork
A pull parser for CommonMark
-
uwu_cli
uwuifying the terminal
-
spongedown
Converts markdown to html with svgbob support
-
string-box
Create Rust string from UTF-8 string, byte string or wide string
-
wordfreq
port of wordfreq for looking up the frequencies of words in many languages
-
text_distance
A collection of approximate string matching algorithms
-
texrender
Thin wrapper around running
latexmk
to render LaTeX documents. Also supports generating Tex documents. -
encoding_c_mem
C API for encoding_rs::mem
-
exit-with-code
A CLI which returns the exit code of your choice
-
czv
performing CSV-related operations for data engineering and analysis
-
fancy_mocker
transforming text into alternating uppercase and lowercase letters, resembling the Spongebob Mock Meme
-
is_utf8
functions to determine if a sequence of bytes is valid utf-8
-
vidyut-chandas
A Sanskrit metrical classifier
-
kanjidic_types
A collection of types encompassing the variety of data about kanji available from Kanjidic
-
cmark2tex
A small utility to convert markdown files to pdf exploiting tectonic
-
encoding-index-tradchinese
Index tables for traditional Chinese character encodings
-
mdbook-rustviz
An mdbook preprocessor that allows users to embed RustViz visualizations into mdbook projects
-
vcg
manipulating verification conditions
-
uchr
translate char values to the characters themselves, and vice versa
-
demoji
Remove all emojis from a string
-
rcut-lib
rcut is a Rust replacement for GNU cut that supports UTF-8
-
re2
Wrapper for the re2 C++ regex library
-
vaporetto_tantivy
Vaporetto Tokenizer for Tantivy
-
bookkeeping
Minimal, safe and flexible in-memory bookkeeping API
-
kaomoji-ru
The full http://kaomoji.ru database, encoded and escaped as byte literals
-
unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
ascii_tree
generates ascii trees
-
romulus
a stream editor like sed
-
dequote
Remove nested quotes around text
-
markdown_to_html_parser
parses Markdown syntax into HTML
-
fum
fum finds fuzzy matches to a literal search pattern, searching recursively through all the files in the current directory and respecting gitignore rules
-
base_emoji
Convert everything to Emojis (and back)! - 🐫🔦🔥🎲🐬🍀🍟🔋🐬🎲🐬🍀🎁🍟💧👂🔥🚪🔋🍟🔦🔋🚗👣🔦🌍👂🍤🐗
-
lexicmp
comparing and sorting strings lexicographically and naturally
-
single_source
Generate code files from snippets in md tutorial files
-
web-grep
A Grep Tool for HTML or XML
-
aprilasr-sys
Low-level FFI bindings for the april-asr C api (libaprilasr)
-
bionic-ebooks
Takes an EPUB file and generate a copy with bionic like font applied
-
character-stream
Helper data structures for reading UTF-8 characters from a stream
-
conllu-utils
working with the CoNLL-U dependency format
-
base256_lib
Base256 encoder/decoder
-
kodumaro-trim
Command line tool to trim strings
-
nipah_tokenizer
A powerful yet simple text tokenizer for your everyday needs!
-
rustextile
Textile markup language parser for Rust
-
trim_lines
An extremely simple and tiny library which provides an iterator over the lines of a string, trimmed of whitespace. It is a simple wrapper around the Lines iterator in std::str which trims the whitespace from each line.
-
asciir
Print ASCII table/values
-
mdbook-ai-pocket-reference
mdbook preprocessor for the ai-pocket-reference project
-
zw
encoding and decoding text using zero-width characters
-
procedural-masquarade
Incorrect spelling for procedural-masquerade
-
llmvm-core-lib
llmvm core application
-
ende
encoding/decoding unicode/utf-8/utf-16(ucs-2) code points
-
json-event-parser-witespace
A json event parser with whitespace
-
csvre
replacing data in CSV columns with regular expressions
-
kradical_static
Ready-to-use EDRDG radical decompositions
-
porter-stemmer
Flexible and unicode friendly, Porter stemmer implementation
-
x1b
State tracker for 0x1b terminal escape codes. Also gives random characters names.
-
encoding8
various 8-bit encodings
-
jpreprocess-window
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
raekna-storage
in-memory and persistent storage functionality to the raekna project
-
lingua-slovak-language-model
The Slovak language model for Lingua, an accurate natural language detection library
-
allsorts_no_std
Font parser, shaping engine, and subsetter for OpenType, WOFF, and WOFF2
-
tendril
Compact buffer/string type for zero-copy parsing
-
tiniestsegmenter
Compact Japanese segmenter
-
roe
Unicode case conversion
-
spyglass
Search engine for documents, inspired by bioinformatics
-
mdbook-tectonic
An mdbook backend for generating LaTeX and PDF documents
-
mdbook-indexing
mdbook preprocessor for index generation
-
dhoni
converting Bengali text into their phonetic counterpart
-
datex
package to automatically process your data and make it look pretty in LaTeX
-
mdbook-check-missing-md
A backend for
mdbook
which will find Markdowns you forgot on SUMMARY.md -
mdbook-bib
mdbook plugin allowing to load and present a bibliography in BibLaTex format in your books and cite its references
-
flashtext2
The FlashText algorithm implemented in Rust
-
mocker
transforming text with uppercase and lowercase letters, resembling the Spongebob Mock Meme
-
markdown-heading-id
Filter for
pulldown-cmark
which converts headings with custom ID -
unicode-line-stacker
Tiny library for stacking Unicode line-drawing characters on top of each other
-
highlights
Convert kindle highlights into different formats
-
minigrep_macano953
A fun grep tool written as my first incursion into Rust. Not intended to be inherited and used in production code
-
mojimoji-rs
fast converter between Japanese hankaku and zenkaku characters, mojimoji
-
mdbook-latex
An mdbook backend for generating LaTeX and PDF documents
-
bos_books_codes
that handles 3-character Bible Books Codes
-
anagram
A collection of anagram utility functions
-
noneifempty
Convert T to None if empty
-
mossy
A toy library for parsing and compiling Markdown
-
read_chars
An iterator over characters read from some I/O source
-
unidades
playing with physical units
-
peppergrep
grep utility written following the 12th chapter of the Rust book. Some little modifications were made.
-
html2runes
An HTML to Text converter
-
yozuk-sdk
Types used in the Yozuk ecosystem
-
topfew
CLI to find high frequency occurrences in structured text files
-
hashlogs
Command-line utility that hashes the part before a space on each line from stdin with blake2b keyed with an ephemeral randomly-generated key and writes to stdout
-
ttf_word_wrap
Wraps text based on character width
-
mdbook-metadata
mdBook preprocessor to parse markdown metadata
-
quill_delta_pdf
Convert Quill Delta to PDF
-
textcat
detect text categories. It can be used to detect the language of a given text
-
wordbreaker
A Unicode-aware no_std crate (requires alloc) that rapidly finds all sequences of dictionary words that concatenate to a given string
-
presciidoc
Preprocessing AsciiDoc for other tools
-
szovegertesimutato-score
Calculate szovegertesimutato score for a given text and language
-
rmgrep
rust min grep text tools
-
intname
Full English name for any integer of any primitive integer type
-
latex-to-html
Latex to html converter
-
yitizi
異體字查詢 Get variant Chinese characters
-
truncrate
truncate strings economically without splitting your characters, symbols, emojis etc
-
stopwords
from popular text processing frameworks
-
thfst-tools
Support tools for DivvunSpell - convert ZHFST files to BHFST
-
simple-text-pattern
Matching simple text patterns
-
sauron-md
parsing markdown into sauron node
-
kanpyo-dict
Dictionary Library for Kanpyo
-
caribon
A repetition detector program and library
-
utf8-command
UTF-8 encoded
std::process::Command
output -
tabled
An easy to use library for pretty print tables of Rust
struct
s andenum
s -
inom
a text parser that converts code or inomscript to runnable rustlang for flow-based devgramming
-
mupdf-sys
Rust FFI binding to MuPDF
-
escaped-delimiter
Iterator of delimited slices with escape characters
-
tex
The νTeX typesetting engine
-
bookrafter
This repository contains code related to bookrafter rendering
-
markdown-meta-parser
parse markdown metadata
-
top-english-words
retrieve top words from the English language
-
rut
A small UTF-8 parsing library for applications that need to parse individual
char
s -
static_format
Format strings with no runtime overhead
-
chisel-parsers
Chisel parser front ends
-
bibliofile
A TUI epub reader inspired by DOS-era programs
-
bible-reference-rs
Extract Bible references from plain text
-
lindera-analyzer
A morphological analysis library
-
fribidi
bindings for Rust
-
pangu2
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
parser_kma_crate_beimuk
A brief description
-
esl01-renderdag
Render a graph into ASCII or Unicode text
-
rbatis_xml_parser
Push based streaming parser for xml
-
jp-location-relation
隣接する市区町村の一覧を取得
-
libharu
Rust binding for libharu(http://libharu.org/) PDF library
-
vaporetto_rules
Rule-base filters for Vaporetto
-
anystr
An abstraction over string encoding that supports ASCII, UTF-8, UTF-16 and UTF-32
-
opstr
‘Operate on strings’ command line utility
-
text2art
translate text to ascii art
-
tb_normalization
normalization utf8 string, loc dau vietnamese and some language
-
economic_indicator_finder
A finder for extracting economic indicators from paragraphs
-
retest
Command-line regular expression tester
-
mdplayscript
An extension of Markdown for play scripts
-
math-text-transform
Transform greek letters, latin letters, or decimal digits into certain variants from the mathematical alphanumeric symbols Unicode block (U+1D400–U+1D7FF). For example to bold, italic, script or double-struck.
-
mdbook-force-relative-links
An
mdbook
pre-processor to transform all local links to relative ones -
rsplitter
Splitter is a tool to split words from text that contains no spaces
-
basen
Convert binary data to ASCII with a variety of supported bases
-
lindera-assets
A helper crate to fetch assets and build dictionary for lindera
-
conv-rs
CLI converter for multi extensions
-
tpng
A small tool that prints truecolor png renderings to the terminal using unicode block characters
-
cdoc-parser
A markdown-based document parser with custom syntax
-
mdbook-numthm
An mdbook preprocessor for automatically numbering theorems, lemmas, etc
-
readwise
wrapper for the Readwise API
-
rsrpp
project for research paper pdf
-
kanabake
Turn Romaji strings into Hiragana or Katakana
-
ascii-rs
Process image into colored-ascii image
-
mdbook-iced
An mdBook preprocessor to turn iced code blocks into interactive examples
-
morc
Dead simple, minimal markdown generator library written in Rust
-
slidedeck
Create an HTML slide deck from Markdown
-
folia
High-performance library for handling the FoLiA XML format (Format for Linguistic Annotation)
-
stfu
Shut The Ferris Up - profanity filtering for Rust
-
mediawiki_parser
A strict parser for MediaWiki markdown
-
rckive-genpdf
User-friendly PDF generator written in pure Rust
-
decline-word
Choose word form based on given number
-
text_to_emoji
Convert text to emoji
-
textr
TeX-inspired plug-n-play interface for converting JSON documents into PDFs
-
moenster
mønster (n) - pattern. simple glob-style pattern matching for strings
-
tectonic_bridge_icu
Expose a subset of the ICU Unicode APIs to Rust/Cargo
-
spellcheck
A spell-checker based on the statistical algorithm described by Peter Norvig in http://norvig.com/spell-correct.html
-
japhonex
Japanese phone number checker for Rust
-
p4d-mdproof
Markdown to PDF converter
-
catmark
Console printer for CommonMark
-
typeline_ext_sqlite
sqlite integration for typeline
-
rapidquery
Rapid Query Executor
-
demoji-rs
demoji implementation for Rust
-
pra
Print Random ASCII
-
mepple
English dictionary as a library
-
my_project_parser_super_puper
A brief description
-
bitfont
Takes an ASCII string and generates a vector containing a bitmap font, for easy overlay into images
-
b2c2-common
b2c2の共通部分
-
mdtranslation-cli
Command-line tools for using mdTranslation, which can be used to prepare multi-lingual Markdown documents
-
corpus-preproc
A preprocessor for text and HTML corpora
-
csv2qr
command line tool for generating QR codes from a CSV file
-
deface
Lightweight markup to HTML converter
-
is-vowel
Heuristically test whether a character is a vowel letter
-
ewts-c
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (c lib)
-
utf8_slice
Lightweight UTF8 Slice Utilities
-
stardict_lib
Rust implement stardict
-
simple-word-count
word count function, try to get same result with Microsoft Office Word application
-
mdbook-bash-tutorial
A mdbook preprocessor that allows embedding Bash scripts as tutorials
-
text-scanner
A UTF-8 char-oriented, zero-copy, text and code scanning library
-
mdbook-najan
Preprocessor for the Najan mdBook
-
markdown-table-formatter
Markdown table formatter fully compliant with Unicode 15.1.0
-
homoglyph-core
Domain layer for the homoglyph cli and service. Compute homoglyphs for an input sentence.
-
newline-converter
Newline byte converter library
-
text-tables
A terminal/text table prettifier with no dependencies
-
regex-cli
A command line tool for debugging, ad hoc benchmarking and generating regular expressions
-
hex_table
Format a byte slice into a hex table
-
mdbook-last-changed
mdbook preprocessor to add the last modification date per page
-
rough
A very simple and opinionated static site generator
-
heckmv
A basic case-conversion renaming CLI tool
-
trevordmiller
Personal CLI
-
find-simdoc
Time- and memory-efficient all pairs similarity searches in documents
-
mdbook-embed
A preprocessor that simplifies embedded URL
-
fwuffgrep
Basic implementation of a grep command written in rust
-
df_cp437
Decoder for CP437 to UTF-8
-
spandex-hyphenation
Knuth-Liang hyphenation for a variety of languages
-
guarding_parser
Guarding is a guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
indexrs
inefficient multi-language search index
-
asciis
ASCII base on RFC20. Just ord() and chr()
-
alphabet-encoder
A quick and dirty way to deal with escape characters
-
contractions
expand contractions in English
-
minigrep-yogie
A demo Rust to grep some word from rust-lang.com
-
hex_d_hex
HexDHex is a Rust Crate that encodes and decodes byte data to and from its hexidecimal representation. For instance, one may wish, on ocasion that is, to translate a utf8 or ASCII string…
-
grepy
search files
-
smoldown
Native Rust library for parsing Markdown
-
help_crafter
help message generator without hussle
-
naming_clt
Extract and convert the naming format(case|notation) of identifiers from files or stdin. Use this tool to prepare identifier name strings for further operations (matching,replacing...) on relative files
-
xsv
A high performance CSV command line toolkit
-
tablers
create tables and export them into CSV or PDF. Without external deps
-
any2utf8
Convert any enncoding to utf-8
-
basic-text
Basic Text strings and I/O streams
-
cyrla
two-way conversion between latin and cyrillic script
-
spinning_cube
Perspective projection of a spinning cube, using just ASCII characters
-
xim-ctext
compound text en/decoder
-
txtframe
Creates a frame for text
-
unic-idna-mapping
UNIC — IDNA — IDNA Mapping Table
-
unicode_converter
CLI tool to convert data between various Unicode encodings
-
hoedown
bindings for the Hoedown markdown processor
-
unicode-character-database
Unicode character database tables (Unicode Standard Annex #44) generated using
ucd-generate
-
simplearrayhash
-
simplecc
Chinese Convert library (partially) compatible with OpenCC's dictionaries
-
lithe-cli
A cli of lithe
-
japanese-ruby-filter
Japanese ruby notation parser
-
autoruby
Easily generate furigana for various document formats
-
esc
Escape characters in strings
-
hsk
Return HSK Level for Simplified Chinese Characters
-
string_py
aims to make the String type as easy to use as the str type in python
-
mime-rs
A text processing framework, inspired by Emacs lisp and keyboard macros
-
tnil
Parsing, glossing, and generating utilites for New Ithkuil
-
text-sanitizer
convert text to plain ASCII text
-
pdf_form
programatically filling out pdf forms
-
unicode_clusters
variable width unicode characters as single items, allowing for array like indexing etc
-
hebrew
alephbet primatives and parsing library for rust
-
sudachiclone
sudachiclone-rs is a Rust version of Sudachi, a Japanese morphological analyzer
-
fnew
A Unicode-aware line-oriented drop-in replacement for coreutils' fold
-
lindera-unidic
A Japanese morphological dictionary for UniDic
-
rsnltk
Rust-based Natural Language Toolkit
-
mystem
Wrapper around Yandex Mystem for Rust
-
lingua-greek-language-model
The Modern Greek language model for Lingua, an accurate natural language detection library
-
encoding-next-index-simpchinese
Index tables for simplified Chinese character encodings
-
cp437_constants
Defined constants covering the entirety of ASCII code page 437 (https://en.wikipedia.org/wiki/Code_page_437)
-
h_hangul
Korean Characters
-
lingua-portuguese-language-model
The Portuguese language model for Lingua, an accurate natural language detection library
-
tu
CLI tool to convert a natural language date/time string to UTC
-
goya
morphological analyzer for Rust and WebAssembly
-
backslash
parsing escape characters
-
mdbook-open-git-repo
mdbook preprocessor to add a open-on-git-repo link on every page
-
openlibrary-rs
A wrapper around openlibrary's Web API
-
fmtm_ytmimi_markdown_fmt
Fork of @ytmimi's Markdown formatter; powers FMTM
-
strings
String utilities, including an unbalanced Rope
-
utf-8
Incremental, zero-copy UTF-8 decoding with error handling
-
boilerpipe
text extraction from HTML documents
-
typos-dict
Source Code Spelling Correction
-
askama-filters
Extra template filters for Askama
-
ced
Dead easy csv editor
-
slack-blocks-render
Slack blocks render is a Rust library to render Slack blocks as Markdown
-
emoji_converter
Converts text to emojis
-
yozuk-model
NLP model generator for Yozuk
-
wordninja
port of the Word Ninja English word splitting library
-
jpreprocess-dictionary
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
text_alignment
Align your text in Rust in the CLI
-
quilltex
open-source Rust library designed to convert LaTeX documents into a Delta format that can be used with Quill.js and vice versa
-
simple_csv
CSV parsing implementation
-
emojicons
Parse :emoji: notation to unicode representation
-
mdbook-summary-generate
A mdbook preprocessor to generate SUMMARY.md from a directory structure
-
lyn
A scanner useful for creating recursive descent parsers
-
markdown-it-autolink
A markdown-it plugin for parsing GFM autolinks
-
trexter
Text progression tracking library
-
ascii-to-hex
A small, simple library to converting an ASCII text string into its hexadecimal equivalent
-
nutrimatic
Tools for reading Nutrimatic (https://nutrimatic.org) index files
-
encoding-index-korean
Index tables for Korean character encodings
-
kaomji-ru
The full http://kaomoji.ru database, encoded and escaped as byte literals
-
lingua-vietnamese-language-model
The Vietnamese language model for Lingua, an accurate natural language detection library
-
rusty_code_code_for_book
my book_rusty code
-
iterlower
Final-sigma-correct lowercasing iterator adapter with option for Turkish/Azeri I behavior
-
publish_demo_node
This program picks a random christmas emoji for you
-
mdbook-post
A CLI for add post to mdbook
-
poetry-book
Create a poetry book in latex, starting from plain text
-
textframe
query plain text documents by unicode offset without loading them all into memory
-
yeslogic-ucd-parse
parsing data files in the Unicode character database
-
markdown-it-tasklist
A markdown-it plugin for parsing GFM tasklists
-
sauron-markdown
parsing markdown into sauron node
-
perspicuity_formula
Calculate Flesh Reading Ease for a given text and language
-
norm-email
strip email provider defined behaviour from email addresses
-
STKLR
STKLR: pronounced 'stickler'. Is a cli tool to automatically link functions, enums, structs, traits etc in rust-doc docstrings. I couldn't find a tool like this when I needed it so... here we are.
-
dumbfuzz
dumb library for fuzzy search
-
bgrep
grep tailored to handle binary patterns and files
-
mdbook-mdinclude
Preprocessor for mdBook that better handles including markdown files
-
lindera-ipadic
A Japanese morphological dictionary for IPADIC
-
minigrep_lswarss
A very small part of Unix/Linux tool
grep
made with Rust for learning purpose while reading and studying the Rust Book -
naveengrep
command line tool similar to the grep
-
jawohl
Complete JSON documents from a stream of JSON tokens
-
raekna-common
contains common types and functionality for the raekna project
-
mdbook-multicode
Allows you to give multilanguage code examples, toggled by a spinner
-
chanoma
Characters Normalization library. 文字列正規化処理用のライブラリです。
-
aqua_parse
A parser library
-
charjpoet
Charj Poet is a API for write to
.cj
language -
dismej
convert text to discord emojis
-
harper
implement SSGs
-
mdbook-translation
prepare multi-lingual mdBook books
-
cutters
Rule based sentence segmentation library
-
slicer
that slices string slices into smaller string slices
-
encoding-index-japanese
Index tables for Japanese character encodings
-
ascii_utils
handle ASCII characters
-
mdbook-bibfile-referencing
An mdBook preprocessor to add bibfile referencing to each page
-
vividtext
change text color
-
recode_rs
Command-line tool for converting between the character encodings defined in the Encoding Standard
-
mdbook-to-github-wiki
Turns an mdbook book into a github wiki
-
uniaxe
replace Unicode letters with Ascii equivalents
-
ascii-alphabetic-char
Traits for ASCII alphabetic characters
-
grep-searcher
Fast line oriented regex searching as a library
-
github-slugger
A slugger for GitHub headings
-
mdbook-collector
A mdbook preprocessor which collects json/yaml/toml data from each post and builds a json file of it
-
textos
Texts, strings, formatting, unicode…
-
kytea-tokenizer
Wrapper of tokenization by KyTea
-
jellybean-pack-2
Sweet syntax highlighting with tree-sitter
-
pdf_composer_base
PDF Composer base functionality crate
-
asciifolding
ascii folding library
-
minigreper
Small grep style cli from the book
-
debug_canvas
Easily print complicated debug strings
-
borderrs
Add stylish borders around your text and datastructures
-
transliterate1234
UTF-8 to ASCII transliteration
-
genpdfi
User-friendly PDF generator written in pure Rust
-
scannedpdf
A fast, low memory footprint, simple images to PDF crate
-
lindera-dictionary-builder
Shared code for building Lindera dictionary files
-
conveyance
A stop-gap CLI for conveyancing
-
lingua-dutch-language-model
The Dutch language model for Lingua, an accurate natural language detection library
-
cjieba-sys
unsafe ffi to cppjieba
-
snakecase
general purpose snakecase implementation supporting both ascii and unicode
-
encoding-next-index-japanese
Index tables for Japanese character encodings
-
fst-subseq-ascii-caseless
An automaton that matches if the input contains a specific subsequence ignoring ASCII case to be used with fst
-
literate
programming tool that extracts code written in your Markdown files
-
utf8reader
wrapper around Reader that returns a stream of UTF-8 characters
-
lingua-czech-language-model
The Czech language model for Lingua, an accurate natural language detection library
-
pdf_forms
programatically filling out pdf forms
-
librsv
encoding/decoding the RSV file format (Rows of String Values)
-
markx
markdown parser
-
mdbook-shiftinclude
mdbook preprocessor for file inclusion with shift
-
cologne_phonetics
generate phonetic cologne codes for utf8 strings
-
writedown-html
Writedown HTML backend
-
mdbook-checklist
An mdBook preprocessor for generating checklists and indexes
-
mdbook-chapter-zero
A mdBook preprocessor that allows 0th (sub-)chapter
-
names-changer
Convert a names of sql schemes from camelcase to snake case
-
publish_demo_node_1
This program picks a random christmas emoji for you
-
unic-ucd-block
UNIC — Unicode Character Database — Unicode Blocks
-
libgrep-rs
searching through text
-
pikchr-cli
PIC-like diagramming language to SVG converter
-
findtext_pdf
Search text in PDF
-
equt-md-ext
Extend event iterator
-
varcon
Source Code Spelling Correction
-
encoding-next-index-korean
Index tables for Korean character encodings
-
clipcount
Counting words from the clipboard content
-
mdlynx
Small, fast utility to find broken file links in Markdown documents
-
mdbook-image-size
A mdbook preprocessor which support image size syntax
-
typeline_ext_http
http(s) tooling for typeline
-
pdftotext
High-level library that binds to Poppler to extract text from a PDF
-
lindera-cc-cedict
A Japanese morphological dictionary for CC-CEDICT
-
epubparse
Parse epub and convert to text-only Book structure
-
unicode-utf8
that converts utf-8 bytes to a unicode scalar value, and vice versa
-
nlprule-build
Build tools for a fast, low-resource Natural Language Processing and Error Correction library
-
mdlint-cli
tbd
-
ddvm
Document to Document Virtual Machine
-
html_to_pdf_lib
converting HTML to PDF
-
mdoc
Modern PDF creation through Markdown and LaTeX
-
ascii_set
Fast membership of ASCII character classes
-
forming
lightweight architecture as code language. 架构描述语言
-
markdown-it-deflist
A markdown-it plugin for parsing definition lists
-
base1112031
way to represent numbers using all Unicode characters except ASCII control characters
-
grep-table-converter
A cli utility to convert grep result to table (csv, markdown, textile)
-
mqtt-analyzer
Help analyzing MQTT messages
-
writedown
format parser
-
base_u256
base-u256 is to utf-8 as base-64 is to ascii
-
xmldecl
Extracts an encoding from an ASCII-based bogo-XML declaration in text/html in a Web-compatible way
-
djot
Djot parser written in pure Rust
-
jellybean-pack-0
Sweet syntax highlighting with tree-sitter
-
yeslogic-fontconfig
RENAMED: use the fontconfig crate instead
-
kanpyo
Japanese Morphological Analyzer
-
mdbook-hide
A preprocessor for mdbook that adds support for hidden chapters
-
mdbook-footnote
mdbook preprocessor for footnotes
-
combos
Print all permutations of a word list
-
techlead
CLI is a command-line interface that enables developers to chat with an AI assistant powered by the OpenAI GPT language model, designed specifically to help with your Rust project
-
nib
static site generator
-
font-map-core
Core font-parsing capabilities for font-map
-
chardetng_c
C bindings for chardetng
-
lindera-tokenizer
A morphological analysis library
-
hxgm30-noise
Noise-generation for use in hexagram30 projects
-
unicode_skeleton
detects unicode strings that look nearly identical once rendered, but do not compare as equal. It defines "confusable" and "skeleton" based on Unicode Standard Annex #39
-
unidecode
pure ASCII transliterations of Unicode strings
-
morsels_lang_ascii
Basic ascii tokenizer for morsels
-
smart-date
Parses relative and absolute dates in natural language text
-
jpreprocess-dictionary-builder
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
rnltk
Natural Language Toolkit for Rust
-
luaparse
A Lua 5.3 parser
-
mr-regex
Minimalist ASCII Regex-engine with 300 lines of Rust
-
tradukisto
Kinda useful natural language translation library and utility
-
swappy
An anagram generator
-
the-book-tui
A TUI application to load, read, and search The Rust Book
-
llmvm-outsource-lib
outsource backend for llmvm
-
mdbook-to-example
Turns an mdbook book into a Rust example
-
flag_by_code
Easiest way to get an emoji flag from BCP-47 language tags
-
code-tour
Enhanced example-based learning, i.e. awesome examples user experience
-
wz-conf
Configuration options for wz
-
deepfrog
A deep learning NLP suite (PoS,lemmatiser,NER) with FoLiA XML support
-
panduck-latex
Use panduck to generate XeLaTeX
-
asciify
converting images to a readable format on the command line
-
ascii-engine
A fast ascii engine written in rust!
-
json_to_table
pretty print JSON as a table
-
syllarust
quickly counting syllables
-
xgrepx
xgrep is a rust implementation of grep. This is a follow up from the rust book
-
try_publish_for_demo
This program picks a random christmas emoji for you
-
fzy-tmux
A rewrite of fzy's fzy-tmux script in Rust
-
morsels_lang_chinese
Chinese tokenizer for morsels
-
rure
A C API for Rust's regular expression library
-
jieba-macros
jieba-rs proc-macro
-
mdbook-typst-math
An mdbook preprocessor to use typst to render math
-
pdf-annotations-converter
Converts annotations found in PDF files to different formats
-
nib-cli
A cli for a yet another static site generator Nib
-
encoding-index-simpchinese
Index tables for simplified Chinese character encodings
-
encoding
Character encoding support for Rust
-
mdbook-fix-cjk-spacing
mdbook preprocess that fixes CJK line breaks
-
unic-idna
UNIC — Unicode IDNA Compatibility Processing
-
mdbook-webinclude
Preprocessor for mdBook that includes content from URLs
-
mdbook-snips
Markers for hidden lines in rust blocks within an mdbook
-
ascii-read
BufRead
-like methods for reading into anAsciiString
-
fvoid
Void file generator, supports many formats
-
mdbook-obsidian
mdBook preprocessor to render Obsidian specific syntax
-
mdbook-nix-eval
mdbook preprocessor for evaluating nix expressions
-
rustascii
Display Rust in ASCII
-
mdbook-numeq
An mdbook preprocessor for automatically numbering centered equations
-
typos-vars
Source Code Spelling Correction
-
mdbook-files
Preprocessor for mdbook which renders files from a directory as an interactive widget
-
tectonic_engine_xdvipdfmx
The
xdvipdfmx
program as a reusable crate -
typeline_ext_python
python integration for typeline
-
minigrep_crate
grep like console application
-
tpaint
Terminal based unicode and ascii diagrams and drawings
-
mdbook-boilerplate
Avoid the mdbook preprocessor boilerplate
-
owlz
"Owlz" ascii emoojis, created randomly or by design
-
rustyword
An anagram finder
-
mdbook-tagger
A mdbook preprocessor to collect and build tag data in Json format