-
regex
regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
-
similar
A diff library for Rust
-
textwrap
word wrapping, indenting, and dedenting strings. Has optional support for Unicode and emojis as well as machine hyphenation.
-
const_format
Compile-time string formatting
-
fancy-regex
regexes, supporting a relatively rich set of features, including backreferences and look-around
-
unicode-segmentation
Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules
-
unicode-normalization
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
pulldown-cmark
A pull parser for CommonMark
-
ropey
A fast and robust text rope for Rust
-
lazy-regex
lazy static regular expressions checked at compile time
-
widestring
wide string Rust library for converting to and from wide strings, such as those often used in Windows API or other FFI libaries. Both
u16
andu32
string types are provided, including support for UTF-16 and UTF-32… -
unicode-bidi
Unicode Bidirectional Algorithm
-
deunicode
Convert Unicode strings to pure ASCII by intelligently transliterating them. Suppors Emoji and Chinese.
-
tokenizers
today's most used tokenizers, with a focus on performances and versatility
-
html2text
Render HTML as plain text
-
unicase
A case-insensitive wrapper around strings
-
termimad
Markdown Renderer for the Terminal
-
rustybuzz
A complete harfbuzz shaping algorithm port to Rust
-
convert_case
Convert strings into any case
-
emojis
✨ Lookup emoji in *O(1)* time, access metadata and GitHub shortcodes, iterate over all emoji, and more!
-
unicode-width
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
diff
An LCS based slice and string diffing implementation
-
Inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
lopdf
PDF document manipulation
-
prettydiff
Side-by-side diff for two files
-
onig
Rust-Onig is a set of Rust bindings for the Oniguruma regular expression library. Oniguruma is a modern regex library with support for multiple character encodings and regex syntaxes.
-
chardetng
A character encoding detector for legacy Web content
-
fuzzy-matcher
Fuzzy Matching Library
-
stringprep
algorithm
-
text-splitter
Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.
-
finl_unicode
handling Unicode functionality for finl (categories and grapheme segmentation)
-
roff
ROFF (man page format) generation library
-
pulldown-cmark-to-cmark
Convert pulldown-cmark Events back to the string they were parsed from
-
indenter
A formatter wrapper that indents the text, designed for error display impls
-
unicode-script
exposes the Unicode
Script
andScript_Extension
properties from UAX #24 -
unicode-general-category
Fast lookup of the Unicode General Category property for char
-
const-str
compile-time string operations
-
shell-escape
Escape characters that may have a special meaning in a shell
-
lngcnv
linguistics: display pronunciation, translate between dialects, convert between orthographies; support for multiple languages: English, Latin, Polish, Quechua, Spanish, Tikuna
-
lindera-cli
A morphological analysis command line interface
-
charabia
detect the language, tokenize the text and normalize the tokens
-
titlecase
Capitalize text according to a style defined by John Gruber for Daring Fireball
-
ascii
ASCII-only equivalents to
char
,str
andString
-
text_io
really simple to use panicking input functions
-
regex-syntax
A regular expression parser
-
slug
Convert a unicode string to a slug
-
google-dialogflow3
A complete library to interact with Dialogflow (protocol v3)
-
mdbook
Creates a book from markdown files
-
synoptic
low-level, syntax highlighting library with unicode support
-
tailspin
A log file highlighter
-
diffy
Tools for finding and manipulating differences between files
-
netidx
Secure, fast, pub/sub messaging
-
regress
A regular expression engine targeting EcmaScript syntax
-
unicode_names2
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
printpdf
writing PDF files
-
minimizer
Minimize files to find minimal test case
-
text-size
Newtypes for text offsets
-
linkify
Finds URLs and email addresses in plain text. Takes care to get the boundaries right with surrounding punctuation like parentheses.
-
asciisavers
A small collection of ascii screensavers
-
hyperlink
Very fast link checker for CI
-
str_indices
Count and convert between indexing schemes on string slices
-
ferris-says
flavored replacement for the classic cowsay
-
matchers
Regex matching on character and byte streams
-
line-index
Maps flat
TextSize
offsets to/from(line, column)
representation -
unescaper
Unescape strings with escape sequences written out as literal characters
-
lipsum
lorem ipsum text generation library. It generates pseudo-random Latin text. Use this if you need filler or dummy text for your application. The text is generated using a simple Markov chain…
-
epub-builder
generating EPUB files
-
stringzilla
Faster SIMD-accelerated string search, sorting, fingerprints, and edit distances
-
wana_kana
checking and converting between Japanese characters - Kanji, Hiragana, Katakana - and Romaji
-
mdbook-katex
mdBook preprocessor rendering LaTeX equations to HTML
-
unindent
Remove a column of leading whitespace from a string
-
mdcat
cat for markdown: Show markdown documents in terminals
-
rphonetic
Rust port of phonetic Apache commons-codec algorithms
-
focaccia
no_std implementation of Unicode case folding comparisons
-
wildcard
matching
-
mdbook-pdf
A backend for mdBook written in Rust for generating PDF based on headless chrome and Chrome DevTools Protocol
-
mdbook-admonish
A preprocessor for mdbook to add Material Design admonishments
-
font-types
Scalar types used in fonts
-
cruet
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
uncased
Case-preserving, ASCII case-insensitive, no_std string types
-
stfu8
Sorta Text Format in UTF-8
-
prefix
A customizable pretty printer for FIX messages
-
bfom
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
decancer
that removes common unicode confusables/homoglyphs from strings
-
yeslogic-ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched with support for additional tables
-
unicode-reverse
Unicode-aware in-place string reversal
-
unicode-truncate
Unicode-aware algorithm to pad or truncate
str
in terms of displayed width -
autocorrect
A linter and formatter for help you improve copywriting, to correct spaces, words, punctuations between CJK (Chinese, Japanese, Korean)
-
lindera
A morphological analysis library
-
vesti
A preprocessor that compiles into LaTeX
-
scraps
A static site generator that builds a wiki from a set of markdown files
-
sd
An intuitive find & replace CLI
-
xi-unicode
Unicode utilities useful for text editing, including a line breaking iterator
-
hck
A sharp cut(1) clone
-
unicode-id
Determine whether characters have the ID_Start or ID_Continue properties according to Unicode Standard Annex #31
-
unicode-blocks
contains a list of all unicode blocks and provides some functions to search across them
-
aho-corasick-unsafe
Fast multiple substring searching
-
pad
padding strings at runtime
-
shell-toy
A "fortune | cowsay" implementation in Rust, i.e. a nice little toy to liven up your shell.
-
fortune-rs
classic BSD fortune program
-
subplot
tools for specifying, documenting, and implementing automated acceptance tests for systems and software
-
jieba-rs
The Jieba Chinese Word Segmentation Implemented in Rust
-
mlc
The markup link checker (mlc) checks for broken links in markup files
-
skyspell
Fast and handy spell checker for the command line
-
llmvm-core
The core application for llmvm
-
pdf-extract
extract content from pdfs
-
stam
powerful library for dealing with stand-off annotations on text. This is the Rust library.
-
chewing
(酷音) intelligent Zhuyin input method
-
languagetool-rust
LanguageTool API bindings in Rust
-
rustyline-async
A minimal readline with multiline and async support
-
omekasy
Decorate alphanumeric characters in your input with various font; special characters in Unicode
-
boreal
evaluate YARA rules, used to scan bytes for textual and binary pattern
-
tibco_ems
A high level API for the Tibco EMS
-
mdbook-catppuccin
🎊 Soothing pastel theme for mdBook
-
clipboard-rs
Cross-platform clipboard API (text | image | rich text | html | files | monitoring changes) | 跨平台剪贴板 API(文本|图片|富文本|html|文件|监听变化) Windows,MacOS,Linux
-
cow-utils
Copy-on-write string utilities for Rust
-
fontfor
find fonts which can show a specified character and preview them in terminal or browser
-
near-facsimile
Find similar or identical text files in a directory
-
utf16_iter
Iterator by char over potentially-invalid UTF-16 in &[u16]
-
hgrep
grep tool with human-friendly search output. This is similar to
-C
option ofgrep
command, but its output is enhanced with syntax highlighting focusing on human readable outputs. -
tossicat
입력된 단어에 맞게 같이 입력된 토시(조사)를 적절하게 변환하는 라이브러리
-
fm
Non-backtracking fuzzy text matcher
-
file-organiser
Command line file manager to list, move or delete large numbers of files in nested folders filtered by age, file extension, file name pattern and/or size range
-
hypher
separates words into syllables
-
mdbook-graphviz
mdbook preprocessor to add graphviz support
-
aki-resort
sort lines of text. You can use regex to specify the KEY.
-
wezterm-bidi
The Unicode Bidi Algorithm (UBA)
-
any_ascii
Unicode to ASCII transliteration
-
aneubeck-daachorse
Daachorse: Double-Array Aho-Corasick
-
difflib
Port of Python's difflib library to Rust
-
unicode-bidi-mirroring
Unicode Bidi Mirroring property detection
-
none-shall-pass
Artifact for GitHub Action to validate hyperlinks in all markdown files
-
console_static_text
Logging for text that should stay in the same place in a console
-
newdoc
Generate pre-populated module files formatted with AsciiDoc that are used in Red Hat and Fedora documentation
-
slice-command
slice is a command-line tool that allows you to slice the contents of a file using syntax similar to Python's slice notation
-
affinidi-messaging-text-client
Affinidi Messaging SDK
-
fiat-lux
Offline terminal-accessible Bible
-
mdbook-combiner
combine mdbook summaries from multiple source into one mdbook
-
vmks-exam-generator
CLI program for pseudo-randomly generating different variants of an embedded programming exam
-
ncount
A word count tool intended to derive useful stats from markdown
-
igrepper
The interactive grepper
-
regex-cursor
regex fork that can search discontiguous haystacks
-
sile
Simon’s Improved Layout Engine
-
stylin
Convert markdown to pandoc markdown with custom styles
-
pulldown-html-ext-cli
CLI tool for extended HTML rendering of Markdown with pulldown-cmark
-
rapidfuzz
rapid fuzzy string matching library
-
quixote
Quizzes and tests in Markdown
-
qpdf
Rust bindings to QPDF C++ library
-
tantivy-analysis-contrib
A set of analysis components for Tantivy
-
dvd-term
A bouncing ASCII art DVD logo (or custom text) for the terminal
-
probly-search
A lightweight full-text search engine with a fully customizable scoring function
-
mdbook-quiz
Interactive quizzes for your mdBook
-
molybdenum
Recursive search and replace CLI application
-
regex-literal
delimited regular expression literals
-
tree-sitter-stack-graphs-typescript
Stack graphs definition for TypeScript & TSX using tree-sitter-typescript
-
savvy
R extension interface
-
tantivy-stemmers
A collection of Tantivy stemmer tokenizers
-
nvl-cli
A program to download webnovels
-
mdbook-epub
An EPUB renderer for mdbook
-
fasta-cleaner
Transform fasta files by upper-casing all sequence characters and removing non-ACGT sequence characters
-
hyphenation
Knuth-Liang hyphenation for a variety of languages
-
fuzzt
Implementations of string similarity metrics. Includes Hamming, Levenshtein, OSA, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Sørensen-Dice.
-
unicode-case-mapping
Fast lowercase, uppercase, and titlecase mapping for characters
-
mdbook-yapp
A mdBook preprocessor for simple text replacements
-
chamkho
Khmer, Lao, Myanmar, and Thai word segmentation/breaking library and command line
-
sf-api
API to send commands to the Shakes & Fidget servers and parse their responses into characters
-
obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
vi
An input method library for vietnamese IME
-
collclean
Clean up collaboration commands in LaTeX files
-
pager
pipe your output through an external pager
-
qwt
Quad Wavelet Tree
-
htmd-cli
The command line tool for htmd
-
unicode_categories
Query Unicode category membership for chars
-
wchar
Procedural macros for compile time UTF-16 and UTF-32 wide strings
-
prop-check-rs
A Property-based testing Library in Rust
-
unicode-security
Detect possible security problems with Unicode usage according to Unicode Technical Standard #39 rules
-
minix
straightforward minifier for JavaScript and CSS files, developed using Rust. This tool efficiently reduces the size of JS and CSS files by removing unnecessary characters, whitespace…
-
spellbook
A spellchecking library compatible with Hunspell dictionaries
-
rsrpp-cli
project for research paper pdf
-
minimo
terminal ui library combining alot of things from here and there and making it slightly easier to play with
-
entities
raw data needed to convert to and from HTML entities
-
vaporetto
pointwise prediction based tokenizer
-
bashtestmd
Compiles shell commands in .md files into Bash scripts for testing
-
string-patterns
Makes it easier to work with common string patterns and regular expressions in Rust, adding convenient regex match and replace methods (pattern_match and pattern_replace) to the standard…
-
jetscii
A tiny library to efficiently search strings and byte slices for sets of ASCII characters or bytes
-
fax
Decoder and Encoder for CCITT Group 3 and 4 bi-level image encodings used by fax machines TIFF and PDF
-
gh-emoji
Convert
:emoji:
to Unicode using GitHub’s emoji names -
fontkit
font loading and indexing
-
stringcase
Converts string cases between camelCase, COBOL-CASE, kebab-case, and so on
-
fasttext
binding
-
ident_case
applying case rules to Rust identifiers
-
graphannis
new backend implementation of the ANNIS linguistic search and visualization system
-
uuhelp_parser
A collection of functions to parse the markdown code of help files
-
unicode_titlecase
add Unicode titlecase and Turkish and Azeri locale upper/lowercase utilities to chars and strings
-
uwc
Counts things in unicode text files
-
b2c2-parser
b2c2におけるBASICコードのパーサー?
-
zalgo-codec
Convert an ASCII text string into a single unicode grapheme cluster and back. Provides a macro for embedding Rust source code that has been encoded in this way.
-
figlet-comment
quickly create banner to use as comments
-
bbd
Binary Braille Dump
-
percent-encoding
Percent encoding and decoding
-
lindera-ko-dic
A Japanese morphological dictionary for ko-dic
-
svgbob
Transform your ascii diagrams into happy little SVG
-
yffi
Bindings for the Yrs native C foreign function interface
-
autotex
Continuously compile TeX and LaTeX
-
sgrep
grep util for those lazy to remember many command line options
-
vidyut-prakriya
A Sanskrit word generator
-
rustyink
Blazing fast static site generator
-
zhconv
Traditional/Simplified and regional Chinese variants converter based on MediaWiki & OpenCC rulesets and powered by AC automata 轉換简体、繁體及兩岸、新馬中文地區詞,基於MediaWiki和OpenCC之字詞轉…
-
basic-text
Basic Text strings and I/O streams
-
timug
It has been created for personal blog creation purpose. Timus has its limits, but it fulfills the purposes for which it was created.
-
mut-str
A toolkit for working with mutable string slices (&mut str)
-
text_analysis
Analyze text stored as *.txt in provided file or directory. Doesn't read files in subdirectories. Counting all words and then searching for every unique word in the vicinity (+-5 words)…
-
nu_plugin_emoji
a nushell plugin called emoji
-
frida-build
Rust bindings for Frida
-
case_insensitive_hashmap
A HashMap that uses case-insensitive strings as keys
-
instant-segment
Fast English word segmentation
-
vader-sentimental
A faster Rust version from the original Python VaderSentiment analysis tool
-
fontconfig
Safe, higher-level wrapper around the Fontconfig library
-
pretty-xmlish
Pretty print XML-ish data with unicode art
-
inkjet
A batteries-included syntax highlighting library for Rust, based on tree-sitter
-
pandoc
API that wraps calls to the pandoc 2.x executable
-
mdbook-mermaid
mdbook preprocessor to add mermaid support
-
mupdf
Safe Rust wrapper to MuPDF
-
rustkorean
processing Korean characters. It provides functionalities to check if a character is Korean, classify Korean characters, verify if a character is a leading consonant (choseong), a medial vowel (jungseong)…
-
float-pretty-print
Format f64 for showing to user, not for serialisation
-
mago-docblock
Analyzes PHP docblocks to extract annotations, tags, and documentation comments, aiding tools that rely on inline documentation
-
norad
Read and write Unified Font Object files
-
levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
frawk
an efficient Awk-like language
-
heart-strings
Quickly get random heart emojis to copy!
-
nanohtml2text
A zero-dependency library to convert HTML to plain text
-
lexical-sort
Sort Unicode strings lexically
-
arrow-row
Arrow row format
-
bpetok
CLI for tokenizing text input using Byte Pair Encoding (BPE)
-
unicode-ccc
Unicode Canonical Combining Class detection
-
nucleo-matcher
plug and play high performance fuzzy matcher
-
analiticcl
approximate string matching or fuzzy-matching system that can be used to find variants for spelling correction or text normalisation
-
harfbuzz_rs
A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust
-
mdbook-theme
A preprocessor and a backend to config theme for mdbook, especially creating a pagetoc on the right and setting full color themes from the offical ace editor
-
grok
popular java & ruby grok library which allows easy text and log file processing with composable patterns
-
mdbook-pandoc
A mdbook backend that outsources most of the rendering process to pandoc
-
svgc
svgc
(previously SvgCompress) is a tool for compressing SVG files by removing unnecessary whitespace, comments, metadata and some other data. It also supports optimization with SVGO and compression into SVGZ. -
egui-dropdown
An actual dropdown list for egui
-
asciimath-unicode
Convert asciimath to unicode
-
precis-tools
Tools and parsers to generate PRECIS tables from the Unicode Character Database (UCD)
-
pulldown-cmark-toc
Generate a table of contents from a Markdown document
-
notion2html
Convert Notion pages to HTML
-
tantivy-jieba
that bridges between tantivy and jieba-rs
-
glu
Unpacks a Markdown document full of code snippets into a temp directory
-
mdbook-alerts
mdBook preprocessor to add GitHub Flavored Markdown's Alerts to your book
-
aki-xcat
concatenate files that are plain, gzip, xz and zstd
-
pinyin-parser
Parses a string of pinyin syllables. Covers marginal cases such as
ẑ
,ŋ
andê
. -
fish-printf
printf implementation, based on musl
-
simpleml
parser/writer for the Simple Markup Language format, as defined by Stenway. See https://dev.stenway.com/SML/. SML builds on top of WSV but allows hierarchical structuring of data.
-
shell2batch
Coverts simple basic shell scripts to windows batch scripts
-
cskk
C ABIから使う事を目的とした SKK(Simple Kana Kanji henkan)方式のかな漢字変換ライブラリ
-
gst-plugin-textahead
GStreamer Plugin for displaying upcoming text buffers ahead of time
-
rust-persian-tools
Official Rust implementation of Persian Tools
-
trans-epub
Translate EPUB with CLI
-
bogrep
Full-text search for bookmarks from multiple browsers
-
mktoc
Generate Table of Contents from Markdown files
-
libreofficekit
in Rust
-
emojic
Emoji constants
-
textra
type less, write more
-
nlpo3
Thai natural language processing library, with Python and Node bindings
-
numeric-sort
A zero-allocation, human-readable sorting library
-
serpl
terminal UI for search and replace, ala VS Code
-
minspan
a package for determining the minimum span of one vector within another
-
typed-oid
Typed Object IDs
-
lll_toon
convert JPG images to ASCII art with colors
-
termfactor
is your terminal sick, mid or sus? test your terminal features & find out
-
mdbook-pagebreaks
A mdbook preprocessor to insert page breaks when rendering to HTML
-
filenamify
Convert a string to a valid filename
-
chocodye
changing the color of the chocobos' plumage in Final Fantasy XIV
-
dcsv
Dyanmic csv reader,writer,editor
-
unicode-xid
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
asciidork-parser
Asciidork parser
-
latex-thebib
Clean and sort legacy TeX bibliographies written using ‘thebibliography’ via the
refactor
sub-command. Compile BibTeX files to legacythebibliography
TeX code using thecompile
sub-command… -
gen-mdbook-summary
generate SUMMARY.md for mdbook project
-
matcher_rs
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
substring
method for string types
-
zet
zet finds the union, intersection, set difference, etc of files considered as sets of lines
-
sk-skimmer
Fuzzy Finder in rust!
-
pluralizer
Rust package to pluralize or singularize any word based on a count inspired on pluralize NPM package
-
supply-chain-trust-example-crate-000022
Fast multiple substring searching
-
unicode-joining-type
Fast lookup of the Unicode Joining Type and Joining Group properties
-
roe
Unicode case conversion
-
retryable-result
repeatedly try on some recoverable errors until too many or a fatal error
-
search-in-terminal
A terminal-based search tool
-
text2num
Parse and convert numbers written in English, Dutch, Spanish, German, Italian or French into their digit representation
-
extract_anchors
Утилита для извлечения из исходных кодов всех помеченных отрывков
-
git-blamediff
A program to automatically annotate changes to a file in git(1)
-
wit-bindgen-markdown
Markdown generator for WIT and the component model, typically used through the
wit-bindgen-cli
crate -
jayce
tokenizer 🌌
-
advreader
Split text file into text sequences, strings and (line) comments
-
zspell
Native Rust library for spellchecking
-
hebrew_unicode_script
A lightweight library to check if a hebrew character belongs to certain collections
-
composefs
filesystem
-
array_tool
Helper methods for processing collections
-
chinese_dictionary
A searchable Chinese / English dictionary with helpful utilities
-
mdbook-typst
An mdBook backend to output Typst markup, pdf, png, or svg
-
termdiff
Write a diff with color codes to a string
-
text-to-ascii-art
program to convert text to ASCII art
-
utf8_iter
Iterator by char over potentially-invalid UTF-8 in &[u8]
-
mdbook-toc
mdbook preprocessor to add Table of Contents
-
asciigraph
Ascii Graph
-
seshat-unicode
A Unicode Library for Rust. Unicode 16.0.0 ready. XID_Start and XID_Continue are also available.
-
iepub
epub、mobi电子书读写
-
inflections
High performance inflection transformation library for changing properties of words like the case
-
mdbook-private
An mdbook preprocessor that controls visibility of private chapters and sections within them
-
arf-strings
Encoding and decoding for ARF strings
-
json_to_table
pretty print JSON as a table
-
hh-highlight
hh: The text highlighter. Read from stdin, and highlight selected words printed to stdout
-
textpod
Local, web-based notetaking app inspired by 'One Big Text File' idea
-
bwrap
A fast, lightweight, embedded systems-friendly library for wrapping text
-
kathoey
text feminization using open corpus linguistics data
-
rutie
The tie between Ruby and Rust
-
unidown
Convert Markdown to Unicode
-
oxford_join
Join string slices with Oxford Commas!
-
sublime_fuzzy
Fuzzy matching algorithm based on Sublime Text's string search
-
colonnade
format tabular data for display
-
mdbook-cmdrun
mdbook preprocessor to run arbitrary commands
-
mdbook-angular
mdbook renderer to run angular code samples
-
ib-pinyin
一个高性能拼音匹配库
-
in_definite
Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.
-
kbremap
Custom keyboard layouts for windows
-
treegrep
A pattern matcher frontend or backend which displays results in a tree
-
line-straddler
Determine how lines through text (underlines, strikethroughs, etc) should be rendered
-
repvar
A tiny CLI tool that replaces variables of the style
${KEY}
in text with their respective value. It can also be used as a rust library -
lindera-dictionary
A morphological analysis library
-
uclanr
A random word picker that gives you actually useful words
-
slack-blocks-render
Slack blocks render is a Rust library to render Slack blocks as Markdown
-
shwild
(for Rust)
-
linebreak
breaking a given text into lines within a specified width
-
oxifetch
program that displays key system information, such as OS details, uptime, CPU specs, memory usage, and more. The output includes an ASCII art logo and a quick overview of your machine's current status.
-
rust-ai
A collection of 3rd-party AI APIs for Rust
-
ewts
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (lib)
-
secular
No Diacr!
-
clima
A minimal Markdown reader in the terminal
-
mdbook-open-on-gh
mdbook preprocessor to add a open-on-github link on every page
-
mini_markdown
Small markdown renderer
-
mdmodels
generate models, code and schemas from markdown files
-
easy_reader
easily navigating forward, backward or randomly through the lines of huge files
-
turn-uppercase
Small command to uppercase text in command line and copy to clipboard
-
iirs
Inverted repeats finder
-
detone
Decompose Vietnamese tone marks
-
dialogi
A dialog parser
-
wordcut-engine
Word segmentation/breaking library
-
charname
Incredibly simple library that just gives you the Unicode name for a character
-
supply-chain-trust-example-crate-000021
regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
-
rsonpath-lib
Blazing fast JSONPath query engine powered by SIMD. Core library of
rsonpath
. -
nu-utils
Nushell utility functions
-
text-editing
string with utilities for editing
-
yamd
Markdown Document (flavour)
-
whitespace-sifter
Sift duplicate whitespaces away!
-
crowbook-text-processing
some utilities functions for escaping text (HTML/LaTeX) and formatting it according to typographic rules (smart quotes, ellipsis, french typograhic rules)
-
santoka
Translations of 668 of Taneda Santoka's free-verse haiku
-
regex-charclass
Manipulate and convert regex character classes
-
kelp
A convert tool for Japanese
-
unicode-canonical-combining-class
Fast lookup of the Canonical Combining Class property
-
pprint
Flexible and lightweight pretty printing library for Rust
-
words-count
Count the words and characters, with or without whitespaces
-
mkwebsite
build websites using markdown
-
crlify
A std::io::Write wrapper that replaces with on Windows
-
reggy
friendly, resumable regular expressions for text analytics
-
rs-tool
A command-line tool to perform reservoir sampling on a file or a stream
-
uwurs
UwUify your strings with uwurs!
-
deliminator
Universal code documentation generator
-
dprint-plugin-markdown
Markdown formatter for dprint
-
mandown
Markdown to groff (man page) converter
-
sqdj
sqdj shortens delimited data
-
mdbook-codeblocks
A mdbook preprocessor to prepend customizable vignette to code blocks
-
avt
asciinema virtual terminal
-
indent
Functions for indenting multiline strings
-
what-rs
Identify what something is! A pyWhat reimplementation in Rust
-
armnod
random string generator
-
tectonic_bridge_core
Exposing core backend APIs to the Tectonic C/C++ code
-
common-words-all
Most common words sorted by ngram frequency
-
indefinite
Prefix a noun with an indefinite article - a or an - based on whether it begins with a vowel
-
iregex
Intermediate representation for Regular Expressions
-
casile
The command line interface to the CaSILE toolkit, a book publishing workflow employing SILE and other wizardry
-
mdbook-chess
An mdbook preprocessing plugin to generate chess boards
-
ColorShell
A small crate for coloring text for rust
-
pdf-sign
extract signed date from pdf file
-
mdbook-linkcheck2
A backend for
mdbook
which will check your links for you -
mantra-lang-tracing
Contains functionality needed to collect requirement traces from code or plain text files for the
mantra
framework -
hidden_watermark
Hidden Watermark in Rust
-
textcode
Text encoding/decoding library. Supports: UTF-8, ISO6937, ISO8859, GB2312
-
utilprint
easily using colors or unicode in your terminal printing
-
dmos
Djot HTML renderer with advanced features
-
date_time_parser
Rust NLP library for parsing English natural language into dates and times
-
arabic_reshaper
Reconstruct Arabic sentences to be used in applications that doesn't support Arabic script
-
diary-cli
A powerful cli for documenting and keeping a diary
-
inlet_manifold
A general purpose highlighting library
-
catalog-of-markdown
Generate the catalog of markdown file
-
cbfr
A buffer that run on stack, focusing on performance and speed
-
stop-words
Common stop words in many languages
-
rust_string_utils
String utilities for rust based on org.apache.commons.lang3
-
zipcodes
Query US zipcodes without SQLite
-
roman-numerals-rs
Manipulate well-formed Roman numerals
-
poppler-sys-rs
Low-level (FFI) bindings for poppler-glib
-
semsimian
Sematic similarity calculations for ontologies implemented in Rust
-
orly
Download O'Reilly books as EPUB
-
fmtm
A diff-friendly Markdown formatter that breaks lines on sensible punctuations and words to fit a line width
-
nibi
static site generator. (WIP, implementing)
-
mdbook-ocirun
mdbook preprocessor to run arbitrary commands and code snippets inside containers
-
diacritics
Remove diacritics from letters, for example when standardizing input for a search
-
smol-symbol
compile-time globally unique, 25-character, 128-bit symbols similar to the Symbol type in Crystal Lang
-
natural
Pure rust library for natural language processing
-
gh_page_tool
A github gh-pages tool for static blog site
-
ethan-rs-wc
The ethan-rs-ws(erwc) is word, line, character, and byte count. Like wc command but not just wc command, more accurate and faster. Text can also be read from standard input for statistics.
-
hr-id
A human-readable unique ID which supports Unicode, safe for URLs and file paths
-
dptran
run DeepL translations on command line written by Rust
-
sre-engine
A low-level implementation of Python's SRE regex engine
-
as-what
Provide a bunch of
as
traits -
tokenizers-enfer
today's most used tokenizers, with a focus on performances and versatility
-
kas-text
Text layout and font management
-
fiberplane-markdown
convert Fiberplane Notebooks to and from Markdown
-
mdbook-spec
An mdBook preprocessor to help with the Rust specification
-
yara-x
A pure Rust implementation of YARA
-
gosyn
Golang syntax parser in Rust
-
swimos_model
SwimOS Generic Representation of the SwimOS Serialization Model
-
lookbook
Component preview framework for Dioxus
-
simdutf
Unicode validation and transcoding at billions of characters per second
-
uast
Unicode Aware Saṃskṛta Transliteration in Rust 🦀
-
str_inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
rewrite
Safely rewrite file contents from stdin, even when file is open as an input
-
mini-openai
An OpenAI API client with minimal dependencies
-
rmbs
Remove any fluff, corporate speak, or other bullshit from input text and print the TL;DR essence of what's being said, using the www.bullshitremover.com public LLM API
-
semchunk-rs
A fast and lightweight Rust library for splitting text into semantically meaningful chunks
-
ipset_lookup
ipset is a command-line tool that takes networks or IPs and searches through a lot of different threat feeds quickly. It can also download the feed data necessary to perform the queries…
-
ascii_help
help you quickly convert ASCII codes
-
mdbook-embedify
based mdbook preprocessor plugin that allows you to embed apps to your book, like youtube, codepen and some other apps
-
dnd-character
A Dungeons and Dragons character generator
-
ragtime
Easy Retrieval Augmented Generation
-
url_parser
URL Parser is a Rust parser developed to parse URLs into structured components such as scheme, domain, path, query and fragment
-
html_parser_tarasenko
Базовий HTML-парсер на Rust з використанням Pest
-
safe-string
safe interface for interacting with multi-byte strings in Rust, namely IndexedStr, IndexedString, and IndexedSlice
-
supply-chain-trust-example-crate-000098
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
picodiff
Tiny GUI app to compare text easily
-
swift-check
High-performance, robust, and expressive searching and validation (uses SIMD on x86_64, aarch64, and WASM)
-
kbo
Spectral Burrows-Wheeler transform accelerated local alignment search
-
codepack
A *lightning-fast* tool that transforms a directory into a single, LLM-friendly text file, perfect for easy processing and analysis
-
uklatn
Ukrainian Cyrillic transliteration to Latin script
-
obmrs
As a participant, you will create a structure to receive and hold the exchange-distributed order book. This structure will be called the OrderBoard, and will hold the order book's bids and asks as a price-sorted map…
-
soundex-rs
that calculates soundex value
-
md-tui
A terminal markdown viewer
-
mdopen
Preview markdown files in a browser
-
character_converter
Turn Traditional Chinese script ot Simplified Chinese script and vice-versa and tokenize
-
presenterm
A terminal slideshow presentation tool
-
capitalize
Change first character to upper case and the rest to lower case, and other common alternatives
-
nu_plugin_regex
nu plugin to search text with regex
-
CLI_Project_Scott_Coakley
CLI Project in Rust
-
tremor-kv
A logstash inspured key value extractor
-
pandoc_types
Rust port of pandoc-types
-
rins_markdown_parser
markdown parser written on Rust
-
bump-bin
Increments version with semver specification
-
b2c2-tokenizer
b2c2のBASICコードのトーカナイザー?
-
asimov-sdk
ASIMOV Software Development Kit (SDK) for Rust
-
data-streams
Extension traits for reading and writing data with streams
-
jpreprocess
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
mdbook-pikchr
A mdbook preprocessor to render pikchr code blocks as images in your book
-
sourceannot
render snippets of source code with annotations
-
hangeul
Korean alphabet manipulation library
-
rust-beam
A LaTeX slide generator you can write in faster than beamer
-
trpl
A support crate for The Rust Programming Language book
-
mdbook-d2
D2 diagram generator plugin for MdBook
-
harfbuzz_rs_now
A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust
-
unicount
Alphabetic counter supporting unicode
-
unic-ucd-ident
UNIC — Unicode Character Database — Identifier Properties
-
cloc
Count, or compute differences of, lines of source code and comments
-
lorem-ipsum
Quickly generate placeholder test
-
fetch-catnip
fetch displaying system information and a cute cat
-
dodo-zh
operate on pinyin and cedict.u8 file
-
bin-rs
Binary Reader from multi source
-
choco
markup language for dialogue systems
-
mago-fixer
Applies automated fixes and transformations to text
-
mdbook-presentation-preprocessor
A preprocessor for utilizing an MDBook as slides for a presentation
-
dbxcase
Dropbox-compatible case-folding algorithm
-
mdbook-dtmo
Creates a book from markdown files with added plugins
-
mdsh
Markdown shell pre-processor
-
broken-md-links
A command-line tool and library to detect broken links in Markdown files
-
human_regex
A regex library for humans
-
mdtrans
Markdown parser and transformer using
pest.rs
, focused on flexibility to a project’s needs -
aki-gsub
substitude text command, replace via regex
-
jawk
JSON AWK
-
dom-content-extraction
Content extraction via text density paper
-
harper-core
The language checker for developers
-
tre-regex
Rust safe bindings to the TRE regex module
-
string-simple
containing some simple string utilities that I use in my other projects
-
tfon
Bitmap font parsing / conversion
-
repr
The regular-expression-as-linear-logic interpretation and its implementation
-
rustsay
CLI tool in Rust that mimics the classic cowsay program, allowing a cow to speak your text in the terminal
-
stylish-ansi
stylish
helpers for writing styles as ANSI escape codes -
bce
-
mdbook-variables
mdBook proprocessor for risolve variables configured from book.toml
-
wikipedia_prosesize
Count Wikipedia prose size
-
tesseract-rs
Rust bindings for Tesseract OCR with optional built-in compilation
-
quranize
Encoding transliterations into Quran forms
-
paltoquet
rule-based general-purpose tokenizers
-
rustblocks
tetromino based falling blocks game
-
uwl
A management stream for bytes and characters
-
truncate_string_at_whitespace
Truncate a &str at the closest whitespace to a specified length with unicode safety
-
char-ranges
Iterate chars and their start and end byte positions
-
mdi
markdown include
-
pink_accents
Replacement of patterns in string to simulate speech accents
-
stam-python
STAM is a library for dealing with standoff annotations on text, this is the python binding
-
csv_to_table
pretty print CSV as a table
-
typst-ansi-hl
highlights your Typst code using ANSI escape sequences
-
terminal-clipboard
a minimal cross-platform clipboard
-
utf8path
String-friendly path type
-
pdf-rename
This script reads a list of PDF files from a specified directory and renames each file based on its content. The renaming logic uses the content of the PDF to generate a more descriptive and meaningful filename.
-
message_segment_calculator
package to calculate SMS message segments
-
iconv-native
A lightweight text encoding converter based on platform native API or libiconv
-
betacode
conversion
-
tbll
tbll outputs data in tabular format
-
grammalecte_client
Grammalecte HTTP client
-
chord3
Create pdf songbooks from chopro source
-
cglue-bindgen
cleanup cbindgen headers for CGlue
-
rust-regex-dsl-creator
Regular expression DSL derive macros
-
mtc-token-healing
Token healing implementation
-
regexy
lightweight Rust library for working with regular expressions. The regexy crate provides an easy-to-use interface for matching patterns in strings using regex
-
text-tokenizer
Custom text tokenizer
-
tree-sitter-stack-graphs
Create stack graphs using tree-sitter parsers
-
loc
Count lines of code (cloc) fast
-
mdbook-plantuml
A preprocessor for mdbook which will convert plantuml code blocks into inline SVG diagrams
-
ruby_inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
oxcomm
using Google Translate on the fly
-
wiki_corpus
Extract text from Wikipedia dumps (.bz2) and convert it to JSONLines format
-
syllabize-es
Syllabize Spanish text, and much more
-
diff-man
diff utility lib
-
indicator-extractor
Extract indicators (IP, domain, email, hashes, etc.) from a string or a PDF file
-
lowcharts
draw low-resolution graphs in terminal
-
html-auto-p
function like
wpautop
in Wordpress. It uses a group of regex replaces used to identify text formatted with newlines and replace double line-breaks with HTML paragraph tags. -
lcat
lolcat in rust! Full unicode support, escapes for ANSI escape sequences, hue shift in Cubehelix
-
lindera-ipadic
A Japanese morphological dictionary for IPADIC
-
zh_num
Convert ASCII numbers and zh words
-
rst_parser
a reStructuredText parser
-
tracery
Text-expansion library
-
clafrica
This application allows you to type most of the characters in the african alphabet in any text field
-
pragmatic-segmenter
Rust port of pySBD v3.1.0
-
surt-rs
Sort-friendly URI Reordering Transform (SURT)
-
chisel-decoders
Chisel UTF-8 and ASCII byte stream decoder implementations
-
mdbook-kroki-preprocessor
render kroki diagrams from files or code blocks in mdbook
-
tergo-formatter
Formatter for tergo
-
clarifai_grpc
The official Clarifai gRPC Rust client
-
md-ulb-pwrap
Markdown paragraph wrapper using Unicode Line Breaking Algorithm
-
transportations_library
Knowledge stack for transportation
-
rslint_errors
Pretty error reporting library based on codespan-reporting built for the RSLint project
-
indent_write
Write adapters to add line indentation
-
llmvm-codeassist
A LLM-powered code assistant that automatically retrieves context (i.e. type definitions) from a Language Server Protocol server.
-
ferret
A trigram-based tool for detecting similarity in groups of text documents or program code
-
abbreviation_extractor
extracting abbreviations from text
-
ucf
A universal code formatter
-
unicode-intervals
Search for Unicode code points intervals by including/excluding categories, ranges, and custom characters sets
-
unicode-display-width
Unicode 15.1.0 compliant utility for determining the number of columns required to display an arbitrary string
-
dicexp
A Dice Expression Interpreter program and library for parsing (and rolling) role-playing game style dice notations (e.g. "2d8+5")
-
speech-synthesis
Common traits for speech synthesis
-
cesu8
Convert to and from CESU-8 encoding (similar to UTF-8)
-
datatroll
a robust and user-friendly Rust library for efficiently loading, manipulating, and exporting data stored in CSV files
-
url_encor
A lightweight library to encode and decode special characters in urls
-
mdbook_rash
Binary to create doc from
rash
code -
markov
A generic markov chain implementation in Rust
-
tfidf-text-summarizer
extractive text summarization system which uses TF-IDF scores of words present in the text to rank sentences and generate a summary
-
svgbob_cli
Transform your ascii diagrams into happy little SVG
-
poriborton
Interconversion between Unicode and various Bengali ANSI encodings
-
mdbook-cat-prep
a preprocessor for mdbook which provides teacher, subject, material and tag functionality
-
mdbook-curly-quotes
mdBook preprocessor that replaces straight quotes with curlyquotes, except within code blocks or code spans
-
streampager
pager for command output or large files
-
mdbook-pagetoc
A mdbook plugin that provides a table of contents for each page
-
whichlicense_detection
detect licenses used by the WhichLicense project
-
scanlex
lexical scanner for parsing text into tokens
-
palmdoc-compression
Fast & safe implementation of PalmDoc/MOBI/AZW/Kindle flavored LZ77
-
as3_parser
ActionScript 3 parser
-
portmanteau
create portmanteaux
-
simstring_rust
A native Rust implementation of the SimString algorithm
-
metatron
core library
-
eternaltwin_constants
Hammerfest store implementation
-
mdbook-hints
mdBook preprocessor to add hover hints to your book
-
utf16_lit
macro_rules to make utf-16 literals
-
kataru
YAML-based dialogue engine for game development
-
pulldown-cmark-mdcat
Render pulldown-cmark events to TTY
-
yozuk
Chatbot for Programmers
-
advanced_string_generator
A command-line tool for generating strings based on customizable regex patterns
-
egg-mode-text
Text parsing for Twitter: character counting, hashtag/mention extraction
-
sapling-renderdag
Render a DAG into ASCII or Unicode text
-
pulldown-html-ext
Extended HTML rendering capabilities for pulldown-cmark
-
uo_rst_parser
fork of rst_parser with fixes for upstream-ontologist
-
lemmeknow
Identify any mysterious text or analyze strings from a file
-
sbert
Sentence Bert (SBert)
-
csvtool
deal with CSV files
-
tower_allowed_hosts
tower allowed hosts layer
-
mdbook_header_footer
mdBook preprocessor to prepend header and append footer to certain chapters
-
jposta
A fast and intuitive Terminal User Interface (TUI) tool for searching Japanese postal codes and addresses
-
malachi
A domain specific pattern matching language made for defining bot commands
-
escrit
learning languages by reading texts
-
address_book
Інструмент командного рядка для парсингу телефонних номерів, ідентифікаторів, дат та неправильних…
-
text_lines
Information about lines of text in a string
-
b2c2-casl2
b2c2におけるCASL2の雑なパーサー?
-
substring-replace
developer-friendly methods to manipulate strings with character indices
-
slicestring
slicing Strings
-
repub-rs
binary for converting mhtml webpages into remarkable-style summarized epubs
-
google-fonts
Download and cache TTF fonts from Google
-
slugify
Macro for flexible slug generation
-
vectorscan-rs
Ergonomic bindings to the Vectorscan high-performance regex library
-
csml_interpreter
The CSML Interpreter is the official interpreter for the CSML programming language, a DSL designed to make it extremely easy to create rich and powerful chatbots
-
easy_random
Generate random data easily with easy_random :)
-
midstring
Create a string between two other strings, that is lexicographically halfway between them
-
mdbook-yml-header
mdBook preprocessor for removing yml header
-
mdbook-keeper
An improved testing experience for mdbook
-
aki-mcolor
mark up text with color
-
dekor
styling and character repository in Rust
-
roman_numerals_fn
A function to convert integers to their roman numeral representation as strings. Values from 1 to 3999 are possible, otherwise it returns an OutOfRangeError. Zero has no representation in roman numerals.
-
simple-string-patterns
Makes it easier to match, split and extract strings in Rust without regular expressions. The parallel string-patterns crate provides extensions to work with regular expressions via the Regex library
-
srake
RAKE Algoritm
-
crowbook
Render a Markdown book in HTML, PDF or Epub
-
poppler
Wrapper for the GPL-licensed Poppler PDF rendering library
-
casespaetzle
This package introduces common identifier case traits and a macro for case trait generation
-
harfbuzz
Rust bindings to the HarfBuzz text shaping engine
-
unicode-width-16
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
ik-rs
chinese segment, ik-analyzer for rust
-
sixbit
Small packed strings
-
matcher_py
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
plagiarismbasic_lib
Basic plagiarism checker written in Rust
-
mdbook-preprocessor-utils
writing mdBook preprocessors
-
fluxcap
time expression parsing inspired on duckling
-
image-to-ascii
Converts images and gifs to ascii art
-
man
Generate structured man pages
-
hydroper_source_text
Source text containing line locations
-
bobo_html_parser
parser of html markdown
-
ultra-nlp
A NLP library
-
asciidork-eval
Asciidork eval
-
mitex-parser
Parser for MiTeX
-
text_trees
textual output for tree-like structures
-
mdbook-linkcheck
A backend for
mdbook
which will check your links for you -
htmd
A turndown.js inspired HTML to Markdown converter
-
typo_checker
Searches for typos of English words and returns matches or similar words.(英単語のタイポを検索し、合致もしくは類似する単語を返却します)
-
tzgrep
grep tar.gz
-
tiny_pretty
Tiny implementation of Wadler-style pretty printer
-
pdf_composer
Create PDF documents from YAML Front Matter source documents
-
term_grid
formatting strings into a grid layout
-
angr
analyse ngrams in text files
-
markov-text
creating a small markov model for text generation
-
convert_string
A trait to convert Strings to safe non-keywords and/or convert a Strings case (snake_case, PascalCase, ...)
-
kindle2cbz
extracting images from kindle books in MOBI format to CBZ archives
-
libmistry
management system written in Rust
-
reason-shell
Reason: A Shell for Research Papers
-
inflector-plus
Adds String based inflections for Rust. Snake, kebab, camel, word, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
encoding-next
Character encoding support for Rust
-
readability
Port of arc90's readability project to rust
-
supply-chain-trust-example-crate-000066
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
cow-rewrite
Rewrite copy-on-write types copying only when it's neccessary
-
mdbook-rustviz
An mdbook preprocessor that allows users to embed RustViz visualizations into mdbook projects
-
markdown2pdf
Create PDF with Markdown files (a md to pdf transpiler)
-
gspell
Rust bindings for gspell
-
verify_string
verifying strings
-
pray
A tui tool for preparing a prompt to the llms
-
libcoerced
serde_json and generic text content matching, verification and coercion
-
hl2-lib
highlighting streams of text via regex
-
thesaurus
An offline thesaurus library for Rust
-
rust_iso15924
ISO 15924, Codes for the representation of names of scripts, is an international standard defining codes for writing systems or scripts (a "set of graphic characters used for the written form of one or more languages")…
-
minigrep_xiaoai
一个简单的命令行工具,用于在文件中搜索字符串。
-
glk
Bindings for the Glk I/O interface for hosting interactive fiction interpreters
-
gaze
small utility library with the goal of making it easier to scan/lex text and collections
-
wikidot-normalize
provide Wikidot-compatible string normalization
-
promptkit_rs
LLM structured prompting library
-
simple_peg
A command-line peg parser implemented by Rust
-
tu
CLI tool to convert a natural language date/time string to UTC
-
cogni
Unix native interface for LLMs
-
snailquote
Escape and unescape strings with shell-inspired quoting
-
owoify_rs
port of mohan-cao's owoify-js is released under MIT license, which is also the same license of owoify-js. This simple library will turn any string into nonsensical babyspeak similar…
-
rust-tfidf
calculate TF-IDF (Term Frequency - Inverse Document Frequency) for generic documents
-
mathemascii
AsciiMath parsing and rendering
-
textwrap-macros
procedural macros to use textwrap utilities at compile time
-
quickner-core
A fast and simple NER tool
-
libanubhav
management system written in Rust
-
mdbook-tabs
mdBook plugin for rendering content in tabs
-
stego_wps
text-based steganography using the Words Per Sentence (WPS) method. It allows encoding and decoding hidden messages within the structure of a given text.
-
bubble-bath
Small and quick HTML sanitizer
-
fuzzy-string-distance
Fuzzy string distance comparisons
-
regex_generate
Use regular expressions to generate text
-
rust_readability
A package to assess the complexity of texts using a variety of readability formulas
-
ragegun
Performs lexica based analysis on text (i.e. age, gender, PERMA, OCEAN personality traits, ..)
-
text-parsing
Hierarchical text processing preserving char position info
-
linurgy
Manipulate the output of multiple newlines. Replace/Insert/Append newlines with text. Input and output from stdio/files/buffers
-
env2toml
Convert env vars to toml text
-
rofi-mode
High-level library for creating Rofi plugins and custom modes
-
ewts-cli
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (cli)
-
tabprinter
creating and printing formatted tables in the terminal. It supports various table styles and offers both color and non-color output options.
-
easy-regex
Make long regular expressions like pseudocodes
-
case
A set of letter case string helpers
-
escape_string
Efficiently parse backslash-escaped strings
-
csmlinterpreter
The CSML (Conversational Standard Meta Language) is a Domain-Specific Language developed for creating conversational experiences easily
-
auto-regex
Automagically finds a regex that best matches an example and a sample list
-
vndb_tags_get
convert VNDB tag list (JSON to markdown)
-
giff
Visualizes the differences between the current HEAD and a specified branch in a git repository using a formatted table output in your terminal. The differences are displayed with color-coded…
-
quake_text
Utils for Quake strings and characters
-
r-matrix
Rust port of cmatrix
-
latinrs
Rust package that transliterates text and characters into thier latin form
-
notalawyer
display license notices
-
ut1_blocklist
UT1 blocklist URL/domain filters
-
huski-auxies
Auxiliary methods for huski implementation
-
jawohl
Complete JSON documents from a stream of JSON tokens
-
opentalk-types-signaling-whiteboard
Signaling types the OpenTalk whiteboard module
-
formatjson
Formats JSON files
-
shopping-parser
A Rust-based parser for parsing structured product information and shopping lists, supporting multiple currencies and units
-
justcsv
CSV-file reader/writer
-
merge_pdf
Merge PDF files in a directory
-
str-utils
some traits to extend types which implement
AsRef<[u8]>
orAsRef<str>
-
argentum_user_business
Business layer of user component
-
minigrep_baolhq
Just getting started with Rust, enjoying it so far 😇
-
anslatortray
translate from English to Pig Latin!
-
hmd
Custom Markdown Engine for my personal blog
-
sanitize-filename-reader-friendly
A filename sanitizer aiming to produce reader friendly filenames
-
kanbanrs
A basic kanban task manager written with egui
-
strinject
Inject text from somewhere else into given text
-
lisho
personal link shortener with no external dependencies in under 200 lines of Rust
-
worcher
full-text search for static websites
-
tnipv-lint
lints for tnipv, the Telcoin Network Improvement Proposal validator
-
mdbook-fs-summary
Summary generator for mdbook
-
libopenlipc-sys
Wrapper around liblipc to interact with Kindle dbus-based LIPC events
-
rasciigraph
function to plot ascii graphs
-
brzozowski
Work with regular expressions using Brzozowski derivatives. "
-
pandoc-ac
pandoc filter for converting acronym codes to LaTeX
-
film_parser
application for parsing detailed film data from raw text, including information such as title, release year, director, screenwriter, cast, genre, and runtime, into a structured Film…
-
just-enough-emojis
text to emoji cli
-
aho-corasick
Fast multiple substring searching
-
marker
finding issues in CommonMark documents
-
flowistry_ide
Information Flow in the IDE for Rust
-
cindex
CSV indexing library
-
mecab
Safe Rust wrapper for mecab a japanese language part-of-speech and morphological analyzer library
-
gregex
Regex solver utilizing NFA
-
loki_text
advanced string manipulation with pattern searching and replacement capabilities
-
textspan
Text span utility
-
lindera-ko-dic-builder
A Korean morphological dictionary builder for ko-dic
-
merge-whitespace
Procedural macros for merging whitespace in const contexts
-
character-set
High performance
set.contains(char)
-
regex-chunker
Iterate over the data in a
Read
type in a regular-expression-delimited way -
bytescolor
A versatile Rust library for colorizing strings and byte data in terminal applications using ANSI escape codes
-
cli-colors
A CLI tool for outputting text in ANSI format with features like colors, underlining, boldening, and italicizing
-
charx
A replacement for char::is_ascii*
-
mdbook-treesitter
mdBook preprocessor for html adding tree-sitter highlighting support
-
uapi-version
Compare versions according to the UAPI Version Format Specification
-
mdbook-tools
A collection of tools for mdbook
-
linkcheck2
extracting and validating links
-
codespan_preprocessed
Beautiful diagnostic reporting for M4 (or cpp) preprocessed text files
-
lodestone
A website wrapper for FFXIV's lodestone
-
litua
Read a text document, receive its tree in Lua and manipulate it before representing it as string
-
mdbook-pdf-headless_chrome
A fork from headless_chrome for publishing mdbook-pdf
-
hyperscan
bindings for Rust with Multiple Pattern and Streaming Scan
-
tengwar
Transliterate text into J.R.R. Tolkien's Tengwar.
-
gstring
String with support for Unicode graphemes
-
rpgtools
Collection of utilities to help run tabletop role-playing games
-
utf58
High-tech encoding of the Unicode space in one quibble and up to 3 bytes
-
screen_printer
displaying rectangular blocks of text to a terminal
-
dedent
Procedural macro for stripping indentation from multi-line string literals
-
antex
Styled text and tree in terminal
-
owned_chars
Owned iterators with the same output as Chars and CharIndices
-
somedoc
A very simple document model and markup generator
-
avatarsay
Beautiful quotes from Avatar: The Last Airbender
-
docket
markdown to HTML documentation rendering
-
hns
Human numeric sorting program — does what
sort -h
is supposed to do! -
wimbd
A CLI for inspecting and analyzing large text datasets
-
fimdoc
Firendship is Magic Document, converts Markdown into FIMFiction BBCode
-
selmr
Package to create and use Simple Explainable Language Multiset Representations
-
ultron
A web base text-editor with strong focus on displaying multi-width characters correctly
-
rsnltk
Rust-based Natural Language Toolkit
-
mudder
Generate lexicographically-spaced strings between two strings from pre-defined alphabets
-
twas
A text substitution application for using random look-up tables to generate text in a manner similar to the Mad Libs game
-
botanical-latin
Decliner / conjugator / inflector for classical / botanical Latin
-
unflow
DSL to convert design to code
-
visa
(Virtual Instrument Software Architecture) Wrapper which allows concurrent and dynamic switching between visa implementations from different vendors
-
markdown-extract-cli
Extract sections of a markdown file with a regular expression
-
markov_str
Markov Chain implementation optimized for text generation
-
utf8-io
Traits and types for UTF-8 I/O
-
wildcard_ex
extended wildcards that allows VB-like specifications
-
notedown_ast
Notedown Abstract Syntax Tree
-
zhconv-cli
Convert Traditional/Simplified Chinese and regional words of Taiwan/Hong Kong/mainland China/Singapore based on Wikipedia and OpenCC rulesets 轉換中文簡體、繁體及兩岸、新馬地區詞,基於維基百科和…
-
stringmatch
Allow the use of regular expressions or strings wherever you need string comparison
-
tgrep
Toy grep that honors .gitignore
-
readable-regex
Regex made for humans. Wrapper to build regexes in a verbose style.
-
unic-emoji-char
UNIC — Unicode Emoji — Emoji Character Properties
-
emojicon
Find Emoji by using Emoticons and GitHub's, Bengali emoji names
-
runiq
An efficient way to filter duplicate lines from input, à la uniq
-
console-utilities
handling console input and output utilities
-
indented_text_writer
IndentedTextWriter
-
bcdown
Bilibili漫画下载器,written in Rust,支持epub pdf zip格式
-
forbidden-bands
8-bit string handling library
-
royal_road_archiver
An archival program and library for the webnovel site RoyalRoad
-
indentsort
Structure-preserving sorting of arbitrary indented text
-
mdrss
generating RSS feeds from markdown files
-
mdbook-gitbook
mdBook preprocessor to properly render GitBook specific syntax
-
pdfgen
PDF rendering library
-
punycode
Functions to decode and encode Punycode
-
sesdiff
Generates a shortest edit script (Myers' diff algorithm) to indicate how to get from the strings in column A to the strings in column B. Also provides the edit distance (levenshtein).
-
slow-scan-print
Text is printed at fixed intervals by character or by line, and its name is inspired by SSTV (Slow Scan TV)
-
gdnative-doc
Documentation tool for gdnative
-
pullup
Convert between markup formats
-
recvmsg
Traits for receiving datagrams reliably, without truncation
-
isixhosa
help process text in isiXhosa
-
supply-chain-trust-example-crate-000037
Implementations of string similarity metrics. Includes Hamming, Levenshtein, OSA, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Sørensen-Dice.
-
ttaw
talking to a wall, a piecemeal natural language processing library
-
dnd_spellbook_maker
making pdf documents of spells that ressemble 5th edition D&D official source book spell descriptions
-
xi-rope
A generic rope data structure built on top of B-Trees
-
cheetah
An extremely fast and flexible static site generator
-
ultra
Cryptanalysis of the Engima
-
chunkr
A fast and quick chunking library for rust
-
rust-regex-dsl
Regular expression DSL
-
ansi-width
Calculate the width of a string when printed to the terminal
-
java_string
Java strings, tolerant of invalid UTF-16 encoding
-
my_parser_rrin
My parser
-
testing_table
An easy to use library for pretty print tables of Rust
struct
s andenum
s -
rs-line-count
counts number of lines(' ')
-
unicode-casing
Titlecase helper function on characters
-
squidge
shortens delimited data
-
wool
Preview Github Markdown Offline
-
uniart
A CLI tool to convert images and gifs to terminal characters
-
text_unit
Newtypes for text offsets
-
markdown_converter
html to markdown converter and flavored markdown to discord markdown converter
-
yeslogic-fontconfig-sys
Raw bindings to Fontconfig without a vendored C library
-
reword
some utility functions for human-readable formatting of words
-
text-colorizer
Transitionary package
-
make_it_braille
A CLI tool and library to manipulate unicode braille images
-
mdbook-metadata
mdBook preprocessor to parse markdown metadata
-
pukram2html
converting Pukram-formatted text to HTML
-
unveil-rs
Unveil Rs is a tool to create presentations from markdown files
-
minify-html-common
Common code and data for minify-html*
-
unicode_names2_generator
Generates the perfect-hash function used by
unicode_names2
-
utils_rust
这是一个用于各种实用功能的 Rust 库
-
dispnet-hash
Hash Library used for Dispnet
-
pho
PHOnetic alphabet renderer
-
mdbook-callouts
mdBook preprocessor to add Obsidian Flavored Markdown's Callouts to your book
-
coloredpp
Lighweight crate for coloring texts
-
nih_log
An opinionated yet flexible logger catering to the needs of the NIH-plug plugin framework
-
mdbook_fork4ls
Fork of mdBook for mdBook_LS
-
cyrconv
A funny faux cyrillic character mapper
-
kbnf-regex-automata
A forked version of regex-automata for kbnf
-
dmos-cli
Djot HTML renderer with advanced features - CLI
-
code-splitter
Split code into semantic chunks using tree-sitter
-
jira-clean
clean up Jira task description that is an output of jira-cli tool
-
loe
Very fast and yet another line ending (CRLF <-> LF) converter written in Rust
-
texcore
Create LaTeX documents using native Rust types
-
abbreviator
abbreviating long words
-
easy_io
Fast and dead-simple IO for competitive programming in Rust
-
markdown-formatter
Flavored Markdown (ZH) content formatter
-
hexstring
handling hexadecimal string
-
ungoliant
The pipeline for the OSCAR corpus
-
sms_splitter
An SMS message splitter and part calculator with support for GSM and Unicode
-
line-numbers
Find line numbers in strings by byte offsets, quickly
-
mago-span
Tracks exact character positions in source code, making it easier to report precise errors, warnings, and other diagnostics
-
document_tree
reStructuredText’s DocumentTree representation
-
align
aligning text
-
tagsearch
Filter plaintext files based on @keyword tags
-
cellumina
configure and run cellular automata
-
minigrep_elijahkx
MiniGrep is a Rust-based command-line tool, with a (current) size of 588KB that lets users search files for a given query string and shows matching lines with their line numbers
-
bigstr
A command-line tool to make string BIG
-
mrdocument
Automatic PDF transcription and classification via OpenAI
-
usage-lib
working with usage specs
-
qsv_vader_sentiment_analysis
Bindings for Rust from the original Python VaderSentiment analysis tool. Forked for use with qsv.
-
lister-cli
Lister: Navigate Markdown Lists
-
recase
Changes the convention case of input text
-
cosmic-text
Pure Rust multi-line text handling
-
mdbook-svgbob2
Alternative mdbook preprocessor for svgbob
-
galm
pattern matching library
-
imperative
Check for imperative mood in text
-
markdown-gen
generating Markdown files
-
p4d-mdproof
Markdown to PDF converter
-
quickmd
Quickly preview a markdown file
-
fast_aug
Fast data augmentation for text
-
cargo-markdown
Local crates.io readme development server with ultra-fast hot reloading goodness
-
rusty-x
A snippet manager in Rust
-
bfom-lib
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
adobe-cmap-parser
parse Adobe CMap files
-
marktask
A CLI tool for parsing and manipulating Markdown tasks
-
loading
Show loading in terminal
-
mdbook-davids_cooking
A preprocesor for whatever https://davidsotomarchena.gitlab.io/davids-cooking/ needs
-
tokengrams
Compute n-gram statistics and model language over pre-tokenized text corpora used to train large language models
-
tectonic_engine_xetex
The XeTeX engine as a reusable crate
-
kashida
Insert Kashidas/Tatweel into Arabic text, e.g. for justification purposes.
-
untanglr
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies
-
prototty_render
Traits for defining a renderer and a view
-
lucide-yew
Yew port of Lucide
-
twitter-text
in Rust
-
c6o-obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
ucfirst
Uppercase the first letter of a string
-
pulldown_mdbook
A pull parser for mdBook
-
ascii-hangman-backend
customizable Hangman game with ASCII-art rewarding for children (backend)
-
udp-logger-rs
Log macro for log's kv-unstable backend and a UDP socket logger
-
advent-ocr
Converts ASCII-art representations of letters generated by Advent of Code puzzles into a String containing those letters
-
bk-tree
A Rust BK-tree implementation
-
yinzhe9
喵喵隐者9
-
rep-grep
wgrep/write-grep CLI
-
roxy_markdown_parser
Roxy plugin for parsing Markdown
-
mdbook-force-relative-links
An
mdbook
pre-processor to transform all local links to relative ones -
markdown-toc
Markdown Table of Contents generator
-
scatternotes
A cli application to manage unstructured notes
-
flashtext2
The FlashText algorithm implemented in Rust
-
dtex
Better TeX
-
godotdoc
documentation generator for GdScript
-
zspell-cli
Command line interface for the ZSpell spellchecking library
-
blitztext
fast keyword extraction and replacement in strings
-
cesu8-str
CESU-8 and Java CESU-8 string validation and manipulation
-
spacey
a tiny yet performant whitespace vmm/interpreter
-
neuralassimilator
fine-tuning Language Learning Models (LLMs) from unstructured text
-
subject-classifier
classifying a commit by it's subject
-
sapling-streampager
streampager is a pager for command output or large files
-
bstr
A string type that is not required to be valid UTF-8
-
onig_sys
onig_sys
crate contains raw rust bindings to the oniguruma library. This crate exposes a set of unsafe functions which can then be used by other crates to create safe wrappers around Oniguruma… -
to-binary
Converts From Hex, UTF-8, Vector of Bytes, or a Byte Slice to a Binary String And Implements Many Useful Functions For Binary Strings
-
prolangkit
A set of language-agnostic utilities for langdev
-
Person_Parser
Rust parser
-
match-pinyin-with-hanzi
Checks whether the sentence in Chinese characters (汉字) matches with the sentence in pinyin (拼音). Erhua is supported.
-
tangler
Extracts code blocks from Markdown documents
-
vec-string-to-static-str
providing utilities for converting vectors of
String
s into vectors of&'static str
-
path2regex
Express style path to RegExp utility
-
qm-role-build
Role builder from markdown tables
-
detect-newline-style
Determine a string's preferred newline character
-
text-transliterate
transliterate texts using the SO iconv from POSIX
-
hebrew
alephbet primatives and parsing library for rust
-
levenshtein_lite
No-frills implementation of a Levenshtein Automata and the Levenshtein Distance function
-
text-scanner
A UTF-8 char-oriented, zero-copy, text and code scanning library
-
slugify-rs
generate slugs from strings
-
bilingual
A cmdline tool used for markdown translation via calling Chinese translation api cloud services
-
hebrew_unicode_utils
Some functions for processing Hebrew unicode characters
-
rust-texas
generate latex documents
-
etch
Not just a text formatter, don't mark it down, etch it
-
haoxue-dict
Chinese dictionary and word segmenter
-
campfire
A tiny static site generator, greatly inspired by Zola
-
story-dl
Story web scraping
-
founder
The package provides a font toolbox
-
character_frequency
counting character frequencies in a string concurrently
-
pdfcr
render a codebase to a pdf
-
rsmorphy
Morphological analyzer / inflection engine for Russian and Ukrainian (soon) languages (WIP)
-
pomsky-bin
Compile pomsky expressions, a new regular expression language
-
find_unicode
Find Unicode characters, the easy way!
-
lingua-english-language-model
The English language model for Lingua, an accurate natural language detection library
-
elden-ring-saver
ansi2
-
wordmarkov
but flexible Markov chain library, specifically for text sentences, which handles punctuation and whitespace
-
hyphenation_commons
Proemial code for the
hyphenation
library -
neo-mime
Strongly Typed Mimes
-
wantora
wantora工具
-
markovish
Markov chain implementation for text generation
-
mle
The markup link extractor (mle) extracts links from markup files (Markdown and HTML)
-
analyse-json
CLI tool for inspecting (Newline Delimited) NDJSON or JSON to understand the contents
-
lindera-unidic
A Japanese morphological dictionary for UniDic
-
kspconfigtool
KSP1 ConfigNode parser and block removal tool
-
harfbuzz-sys
Rust bindings to the HarfBuzz text shaping engine
-
pi_ucd
unicode字符函数,获得字符的语言区间段;及根据文字排版的需要,判断字符是否为单字字符或字母字符
-
todo-to-issue
CLI tool that converts forgotten TODO comments into actionable GitHub issues
-
searcher_txt
A copy of grep that i mate to train my rust skills
-
sttx
belt for transforming speech-to-text data
-
sayit
String replacements using regex
-
ngrammatic
Character-oriented ngram generator and fuzzy matching library
-
khat
A cat clone, nothing more nothing less
-
crate_file_dyzs_16
program to read a CSV file and write a JSON file
-
letter-sequence
A method to create sequence displayed as uppercase or lower letters, or digits
-
faster-chars-count
counting length of chars faster than Chars::count()
-
censor
text profanity filter
-
wcount
CLI word counting tool
-
regex-split
split_inclusive for the regex crate
-
reddish
A Rust utility library
-
opentalk-types-signaling-meeting-notes
Signaling types the OpenTalk meeting-notes module
-
ufofmt
A fast, flexible UFO source file formatter based on the Norad library
-
ngram
Iterator adaptors for n-grams and k-skip-n-grams
-
mathematica-notebook-filter
mathematica-notebook-filter
parses Mathematica notebook files and strips them of superfluous information so that they can be committed into version control systems more easily -
ru-html-extractor
A universal web page main content extractor based on line block density distribution
-
tet_rs
A third-party implementation of Text Entry Throughput (ref. https://doi.org/10.1145/3290605.3300866) for Rust
-
mdbook-najan
Preprocessor for the Najan mdBook
-
djotters
Djot (Markdown) parser that runs at hyper speeds!
-
synox
Program synthesis of string transformations from input-output examples
-
lexmatch
lexicon matching tool that, given a lexicon of words or phrases, identifies all matches in a given target text. Uses suffix arrays.
-
runestr-pancjkv
rune
-based Pan-CJKV support -
notegraf
Core library for building a graph-oriented notebook
-
ranting
Linguistic formatting placeholder extensions for rust
-
ucd
Extends the char type to provide access to most fields of the UCD, Unicode Character Database, as of version 9.0.0. It aims to be compact, fast, and use minimal dependencies (only rust's core crate)…
-
CFDTCP
Short text clustering for determining center points and correlation judgment [one-hot,Jenks Natural Breaks]
-
mdbook-scientific
Enables inline equations for mdbook to set by $..$ signs and $$..$$
-
unescape
Unescapes strings with escape sequences written out as literal characters
-
mdbook-svgdx
mdbook preprocessor to convert svgdx fenced code blocks into inline SVG images
-
jp_utils
Utils for working with Japanese text
-
display_bytes
Human-readable display of byte sequences
-
alass-ffi
FFI bindings for subtitle synchronization with alass
-
rascii_art
Advanced ASCII Art Generator
-
bge
Rust interface for BGE Small English Embedding Library
-
uulm-sopra23-group03-core
Core library for implementing Battle of the Centerländ
-
akai
Next-generation, functional and simple ID system
-
ryaspeller
lib for searching typos in text, files and websites
-
abjad
Calculate the numerical abjad value of Arabic-script text
-
gematria_rs
Gematria, a traditional Hebrew numerology system
-
boss
Baseball Open Source Software
-
grammateus
facilitate working with Ancient Greek words
-
testcall
companinon crate to bintest, implements test facilities
-
aqkanji2koe
wrapper for Rust
-
pulldown_typst
A pull parser for Typst markup
-
jpreprocess-njd
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
guarding
guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
fast-str
A flexible, easy-to-use, immutable, efficient
String
replacement for Rust -
xpanda
Unix shell-like parameter expansion/variable substitution
-
sluggify
slug or clean url generator for rust. With default settings, you will get an hyphenized, lowercase, alphanumeric version of any string you please, with any diacritics removed, whitespace and dashes collapsed…
-
kana-converter
converter for half-width/full-width Japanese language characters (katakana, hiragana, and ASCII)
-
aki-mline
match line, regex text filter like a grep of linux command
-
unicode_categories_zheyuye
Query Unicode category membership for chars
-
unicode-vo
Unicode vertical orientation detection
-
deepl-rustls
DeepL API - RusTLS Fork
-
ob
A Blog and RSS system written in Rust
-
commonregex
Rust port for CommonRegex. Find all times, dates, links, phone numbers, emails, ip addresses, prices, hex colors, and credit card numbers in a string. We did the hard work so you don't have to.
-
langsan
sanitizing language model input and output
-
rtss
A command-line tool to annotate stdout/stderr with elapsed times
-
mitex
TeX2Typst converter
-
notmecab
tokenizing text with mecab dictionaries. Not a mecab wrapper.
-
char_reader
Safely read wild streams as chars or lines
-
valid_rust_char
A tiny library to check if a char it's valid in a rust file
-
castle_shared_parser
Castle Shared Parser
-
mdbook-check-missing-md
A backend for
mdbook
which will find Markdowns you forgot on SUMMARY.md -
emoji
Every emoji, their metadata, and localized annotations
-
gbx
GBX (Grundbuch-Exchange) Dateiformat
-
strizer
minimal and fast library for text tokenization
-
csvsc
Build processing chains for CSV files
-
mdbook-tocjs
A mdbook preprocessor which adds extra js and css file for ToC hydration
-
regexgrep
ripgrep tool that suports regular expressions
-
refac
Transform some text given a generic natural language prompt
-
ftrace
trace files and paths
-
strcursor
string cursor type for seeking through a string whilst respecting grapheme cluster and code point boundaries
-
plsfix
Text cleaner upper
-
dequote
Remove nested quotes around text
-
mdzk
Plain text Zettelkasten based on mdBook
-
iver
-
emoji_converter
Converts text to emojis
-
matcher_c
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
bzzz
Bee-Speak Translator
-
rl-sys
Native bindings to libreadline
-
explore
terminal file explorer
-
character_text_splitter
splitting text into chunks with overlap, designed for handling large amounts of text efficiently. Implementation is identical to langchain's CharacterTextSplitter
-
sortuniq
Find or count unique values in an input stream
-
lang-unicodes
Effortlessly convert and validate text across various Unicode ranges, supporting multiple languages for seamless internationalization in your applications
-
umlauts
text transformation of german umlauts
-
spellabet
Convert characters into spelling alphabet code words
-
cfasttext-sys
fastText ffi binding
-
grep-reader
short text for crates.io
-
pulldown-cmark-escape
An escape library for HTML created in the pulldown-cmark project
-
mdbook-external-links
Open external links inside your mdBooks in a different tab
-
mdbook-mdinclude
Preprocessor for mdBook that better handles including markdown files
-
sedregex
Sed-like regex library
-
invisible_unicode
finding invisible unicode characters
-
lingua-french-language-model
The French language model for Lingua, an accurate natural language detection library
-
swot
community-driven or crowdsourced library for verifying that domain names and email addresses are tied to a legitimate university of college
-
xconv
A high-performance batch file encoding conversion tool
-
veg
Flexible tables
-
rustrings
Strings manipulation for Rust
-
hfs_nfd
Handle Apple's unique NFD-like Unicode normalization, which is used in HFS+, in Rust
-
bitflip
functions to generate bitflips of binary and UTF-8 strings
-
mdbook-indexing
mdbook preprocessor for index generation
-
split-every
Split for every n occurrences of a pattern iteratively!
-
html_to_epub
A command line converts .html file to .epub file
-
fast_symspell
Spelling correction & Fuzzy search
-
slicedisplay
Simplistic Display implementation for Vecs and slices
-
codes-iana-charset
This package contains an implementation of the IANA Character Set registry
-
streamparser
Adapt simple stream parsers for different types of streams
-
texting
string helpers
-
bmatcher
flexible and efficient binary pattern matching library designed to help you search and match binary data
-
enum-ts
TypeScript Enum pattern matcher codegen
-
common_substrings
Finding all common strings
-
charclass
define and modify unicode character classes
-
redpatterns
a list of patterns for scanners 📟
-
uwu-rs
uwuifying library
-
normalize-hebrew-rs
package that normalizes special symbols within Hebrew string used in the Qumran-Digital project
-
idna-cli
Encode/decode Unicode domain names to/from IDNA ASCII
-
solstice_glyph
A fast text renderer for solstice, powered by glyph_brush
-
divvunspell-bin
Spellchecker for ZHFST/BHFST spellers, with case handling and tokenization support
-
stardict
Rust implement stardict
-
in_rainbows_printer
Prints some In Rainbows-style (the Radiohead album) text to your termial
-
ellipse
Truncate and ellipse strings in a human-friendly way
-
encoded-words
Encoded Words for usage in MIME headers
-
caser
Change text between PascalCase, camelCase, and snake_case
-
mdbook-typst-math
An mdbook preprocessor to use typst to render math
-
irg-kvariants
wrapper around kvariant from hfhchan/irg
-
mdbook-image-size
A mdbook preprocessor which support image size syntax
-
ncase
Enforce a case style
-
gimme
Pull useful data out of your clipboard
-
mdbook-open-git-repo
mdbook preprocessor to add a open-on-git-repo link on every page
-
sauropod
Stellular Shared Resources
-
mdbook-quiz-schema
Schema for quizzes used in mdbook-quiz
-
ucd-raw
Uninterpreted access to the unicode UCD
-
html-to-pulldown-cmark-events
Parse HTML to pulldown-cmark's events
-
ferrissay
cowsay
-
pdf2pwg
Single purpose A4 page renderer rendering PDF using pdfium to PWG/URF
-
compute-file-server-cli
Uploads files to Fastly for serving directly from within Fastly Compute applications. Upload any type of file: images, text, video etc and serve directly from Fastly. It is ideal for…
-
newslookout
A web scraping platform built for news scanning, using LLMs for text processing, powered by Rust
-
runiq-lib
An efficient way to filter duplicate lines from input, à la uniq
-
unindenter
unindent text
-
asimov-core
ASIMOV Software Development Kit (SDK) for Rust
-
hello_lib
Demonstrate Generics Function
-
snake_case_converter
convert strings to snake case
-
zhlint
A linting tool for Chinese text content
-
cautious-octo-funicular
Test: shipping an mdbook with API docs
-
strfry
randomize a string
-
jcalendar
Japanese Calendar for Rust
-
ruSTLa
A reStructuredText → LarST ⊂ LaTeX transpiler
-
anon-csv-cli
anonymise CSV files, providing various options to substitute real data with plausable fake data
-
indentasy
Indent like a party
-
tantivy-czech-stemmer
Czech stemmer as Tantivy tokenizer
-
asciimath-text-renderer
Render asciimath in terminal
-
spongebob
convert text to spongebob case a.k.a tHe MoCkInG sPoNgEbOb MeMe
-
bitranslit
Bi-directional transliterator for Rust. Transliterates strings according to the rules specified in the language packs.
-
nmd-core
Official NMD compiler core
-
ab-radix-trie
A compressed radix trie implementation supporting matching rules
-
yeslogic-unicode-script
Fast lookup of the Unicode Script property
-
unicode-normalization-alignments
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
mdbook-preprocessor-boilerplate
Boilerplate code for mdbook preprocessors
-
br-pdf
PDF Invoice Processing
-
qpprint
console printing/formatting
-
repgrep
An interactive command line replacer for
ripgrep
-
goodname
assist you with cool naming of your methods and software
-
mdtranslation
prepare multi-lingual Markdown documents
-
boreal-cli
CLI utility to run boreal, a YARA rules engine
-
uwubot
discord bot for uwuifying text
-
mask-text
mask text with multiple masking options
-
rust_stringz
string manipulation library for Rust that you should probably don't need. It's just for learning purposes.
-
naromat
Convert text to narou novel format
-
rupantor
A Bengali Phonetic Parser which is very flexible and supports Avro Phonetic
-
cowonverter
owo-ify text files. provides a simple function for this purpose.
-
markdown-extract
Extract sections of a markdown file
-
mdbook_incremental_preview
Incrementally preview mdBook, patching changes instead of making full rebuilds
-
g2-unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
rulet
figlet implementation
-
trie-match
Fast match macro
-
remove-markdown-links
Turns
[example](https://example.com)
intoexample
. That’s it -
mdbook-skill-tree
mdbook plugin to show roadmaps
-
text_layout
Text layout algorithms
-
json-peek
Amature JSON parser library designed for my specific need
-
stylish-core
The semver-stable subset of
stylish
-
wiki_corpus_grammar
Extract text from Wikipedia dumps (.bz2) and convert it to JSONLines format
-
no-comment
Remove rust-style line and block comments from a char iterator
-
tpt
Pure Rust implementation of the Unix concatenate (cat), word-count (wc) and echo command
-
rust_baht_text
Convert number to Thai Baht text
-
mdbook-last-changed
mdbook preprocessor to add the last modification date per page
-
wordpieces
Split tokens into word pieces
-
glifnames
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
tiniestsegmenter
Compact Japanese segmenter
-
text-diff
text diffing and assertion library
-
spf
.spf (Simple Pixel Font) file parsing, and useful api's to go alongside
-
yara-x-parser
A parsing library for YARA rules
-
tabwriter
Elastic tabstops
-
aki-unbody
output first or last n lines, like a head and tail of linux command
-
clippers
Cross-platform clipboard management library
-
rs-wordcloud
☁ Flexible WordCloud implementation for ⚙Rust
-
sejong
Buffer is a buffer that can receive ASCII bytes different from keyboard and send out UTF-32 Hangul string. This buffer allows deletion by Jamo.
-
markov_strings
A simplistic Markov chain text generator
-
uwu_cli
uwuifying the terminal
-
azusa
String index transformer for Rust utf8 to JavaScript utf16
-
mdbook-typst-pdf
mdbook typst pdf backend
-
vape
full width aesthetics
-
wtf8-rs
WTF-8 encoding
-
gecliht
A disparate collection of text manipulation and formatting algorithms
-
minigrepsada
Test
-
charwise
This lightweight, dependency-free rust library provides a convenient way to read characters from different resources
-
event_parser
Rust NLP library for parsing English natural language into icalendar events
-
skyspell_kak
skyspell - kakoune integration
-
bureau
extracting dependency licenses, used by nertsio
-
rcut
replacement for GNU cut that supports UTF-8
-
wordshk_tools
A combination of parsers and other tools for words.hk (粵典)
-
nfa_regex
NFA regex engine for text processing
-
confusables
around Unicode confusables/homoglyphs
-
asciidork-opts
Asciidork opts
-
macro_colors
colorful printing macros
-
strip-tags
Strip HTML and PHP tags from strings
-
encoding_c
C API for encoding_rs
-
enso-lazy-reader
An efficient buffered reader
-
mdbook-collector
A mdbook preprocessor which collects json/yaml/toml data from each post and builds a json file of it
-
mdbook-svgbob
SvgBob mdbook preprocessor which swaps code-blocks with neat SVG
-
whitespace-conf
Key-value configuration file delimited with whitespaces
-
kirjat-rs
prices for finnish textbooks from multiple stores
-
snekdown
A parser for the custom snekdown markdown syntax
-
b2c2-flag
b2c2のコマンドライン引数を保持するだけのやつ
-
grace-cli
CLI tool for processing files and strings
-
looking-glass
reflection & type-erasure library for Rust
-
monkey-printer
infinite nr of monkeys you could write Shakespeare
-
aki-mcycle
mark up text with cycling color
-
help_crafter
help message generator without hussle
-
lexicmp
comparing and sorting strings lexicographically and naturally
-
uwuify
fastest text uwuifier in the west
-
parattice
Recursive paraphrase lattice generator
-
whitespace_text_steganography
A steganography strategy that uses whitespace to hide text in other text
-
bocu1
BOCU-1 compressed unicode encoding
-
txt_to_md
Command converting from a txt file to a markdown file
-
translitrs
Transliteration utility for Serbian language
-
fsays
flavored replacement for the classic cowsay
-
rustic_print
A versatile Rust library for enhancing console output. It offers a range of features to create a more engaging and informative command-line interface.
-
ps-str
String transcoding library
-
techlead
CLI is a command-line interface that enables developers to chat with an AI assistant powered by the OpenAI GPT language model, designed specifically to help with your Rust project
-
interslavic
in rust
-
igpay-atinlay
Translate text to Pig Latin
-
hi-doc
Diagnostics formatting library
-
ra_rustc_lexer
lexer used by rustc, zero stability gurantees
-
readability-rs
Port of arc90's readability project to rust
-
veryfi
Module for communicating with the Veryfi OCR API
-
swrite
Infallible alternatives to write! and writeln! for Strings
-
dictcc
Rust API for reading and querying the dict.cc offline translation database
-
lf_lint
Linter that checks that all files in <PATH> ends with a newline
-
ctrl-z
A composable reader to treat
0x1A
as an end-of-file marker -
linkcheck
extracting and validating links
-
uiuifree-normalize
uiuifree text normalize
-
boringascii
Strings that can only be constructed to contain non-whitespace, non-control ASCII characters
-
text_maid
Text maid
-
sm-search
way of searching through text - for people who are too lazy to use Regex
-
text_searcher
text searcher tool
-
wattle
tools for rust
-
charmap
one-to-(none/one/many) character mapping
-
wcrs
GNU wc in Rust
-
ripgrep
line-oriented search tool that recursively searches the current directory for a regex pattern while respecting gitignore rules. ripgrep has first class support on Windows, macOS and Linux.
-
mdbook-bibfile-referencing
An mdBook preprocessor to add bibfile referencing to each page
-
playin-cards
♠ Library for French-suited playing cards
-
string_manip_rust
Demo of managing projects
-
bnf_sampler
that uses recursive descent algorithm to ensure tokens produced by a large language model follow a Backus Naur Form schema
-
terminal_cli
A standalone library with no-std support for command line terminal interfaces. With autocomplete support, helpers for commands and properties and a prompt implementation.
-
group-similar
Group similar values based on Jaro-Winkler distance
-
mdx
in Rust
-
utf-cli
A UTF code viewer
-
slack_update
app to set Slack status, emoji and photo
-
utfx
-
unic-ucd-age
UNIC — Unicode Character Database — Age
-
esperanto-text
Convert Esperanto text between UTF-8, x-system and h-system transliterations
-
merge-whitespace-utils
Procedural macros for merging whitespace in const contexts
-
literumilo
A spell checker and morphological analyzer for Esperanto
-
stylish-stringlike
API for string-like objects that have styles applied
-
timeblok
A language for event scheduling in plain text
-
twjsrs
(De)serialize TiddlyWiki tiddlers to and from their JSON format
-
mdbook-nix-eval
mdbook preprocessor for evaluating nix expressions
-
shear
trimming excess contents from things
-
collapse
Trim and collapse consecutive whitespace to a single space
-
mutf8
Basic support for mutf8 strings
-
assert-text
the testing macro tools
-
kanjidic_types
A collection of types encompassing the variety of data about kanji available from Kanjidic
-
spongemock
CLI-tool for random text capitalization
-
afrim-memory
handle of sequential codes easier for an input method
-
buss-protocol
Bussin binary protocol
-
thesauromatic
command-line thesaurus that returns related words when given a word. The output words are one per line, making it easy to process in shell pipelines.
-
compact_str
A memory efficient string type that transparently stores strings on the stack, when possible
-
genex
Text-expansion library
-
yeslogic-unicode-blocks
Functions to access and search Unicode blocks
-
lindera-tantivy
Lindera Tokenizer for Tantivy
-
pinot
Fast, high-fidelity OpenType parser
-
rustplexity
bigram-based perplexity calculator, useful for filtering out boilerplate or other abnormal text
-
mdbook-numthm
An mdbook preprocessor for automatically numbering theorems, lemmas, etc
-
rscx-mdx
MDX support for RSCx
-
mail-internals-ng
[mail-api] _internal_ parts for the mail-api crates
-
unfurl
expanding links in text
-
peppergrep
grep utility written following the 12th chapter of the Rust book. Some little modifications were made.
-
glyphana
Quickly find, inspect & collect unicode glyps
-
md2gemtext
for converting Markdown into gemtext
-
chisel-lexers
Chisel backend lexers/scanners
-
minigre_base
text file search tool
-
argot
Parse documentation from codebases into Markdown for easy doc creation
-
ripsecrets
A command-line tool to prevent committing secret keys into your source code
-
str_overlap
Methods for finding the overlap between two string slices
-
lindera-unidic-builder
A Japanese morphological dictionary builder for UniDic
-
word-star-puzzle-solver
A CLI tool for solving VGs word star puzzle
-
weggli-ruleset
Ruleset managment/matching/reporting for weggli patterns
-
char_trie
Text segmentation based on trie tree, High performance, support for custom dictionary
-
strange
A static website generator
-
mdbook-footnote
mdbook preprocessor for footnotes
-
unicode-character-database
Unicode character database tables (Unicode Standard Annex #44) generated using
ucd-generate
-
asciir
Print ASCII table/values
-
neardup
near-duplicate matching
-
polyhorn-ios
Polyhorn for iOS
-
pdfrust
PDF parser
-
yozuk-core-skillset
Set of default Yozuk skills
-
crawdad
ChaRActer-Wise Double-Array Dictionary
-
mdict
Rust implement mdict
-
mdbook-plugin-utils
mdBook plugins
-
clippy_lints
A bunch of helpful lints to avoid common pitfalls in Rust
-
mdbook-twiki
twiki backend for mdbook
-
delay_writer
Wraps a writer and delays its output after each newline
-
runanum
Существительные с правильными окончаниями после чисел
-
text-utils
Text utils for unescaping and align
-
aprilasr-sys
Low-level FFI bindings for the april-asr C api (libaprilasr)
-
node-emoji
Convert
:emoji:
to Unicode using GitHub’s and EmojiDB’s emoji names -
pink-runtime
A very minimal replecement based DSL, intended for math
-
basic_lib_for_me
just a basic lib that i will use usually
-
lithe
A Slim template engine by using Pest
-
console-traits
Basic console traits, such as setting char position and writing strings
-
emojicons-2021
Parse :emoji: notation to unicode representation
-
product-os-content
Product OS : Content provides a complete solution for content management for the purpose of serving content via Product OS : Server
-
sapling-drawdag
Parse an ASCII DAG into graph edges
-
noodler
A port of the python-ngram project that provides fuzzy search using N-gram
-
spacemod
A easy to understand and powerful text search-and-replace tool
-
lingua-german-language-model
The German language model for Lingua, an accurate natural language detection library
-
sauron-markdown
parsing markdown into sauron node
-
humnum
Human numeric sorting program — does what
sort -h
is supposed to do! -
small-bwt
BWT construction in small space
-
uniwhat
Display the unicode characters text
-
glyph_brush_layout
Text layout for ab_glyph
-
cyrla
two-way conversion between latin and cyrillic script
-
fastchr
Faster memchr using SIMD intrinsics
-
indentation
Formatter
-
veloci_levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
arbitrator
Format text based on a set of rules and regexes
-
bibleref
Structures and functions for managing Bible references
-
strmatch
Conditionally match strings in Rust using regex without much boilerplate
-
opt2doc
generate documentation from Rust source code
-
wfst4str
Python library based on rustfst for manipulatig strings with wFSTs
-
escape-bytes
Escapes bytes that are not printable ASCII characters
-
transcript
A transcriber for European scripts
-
fancy-regex-fork-pb
A custom fork of the fancy-regex crate. You probably don't want to use this.
-
fum
fum finds fuzzy matches to a literal search pattern, searching recursively through all the files in the current directory and respecting gitignore rules
-
univstring
The Universal String(Rust String, CString, Wide(UTF-16/UTF-32) CString) trait
-
rex-regex
A pedestrian library for regular expressions: Slow, but easy to understand
-
lazy-char-iter
Lazily iterate over Unicode characters from a u8 slice in Rust
-
tectonic_bridge_flate
Exposing flate compression to the Tectonic C code
-
learning_fromdocs_minigrep
Project made on the path toward learning Rust from its docs. Completely personal and to test how to upload to crates.io.
-
ferris-finder
Unicode character finder with fuzzy search
-
recursive-file-loader
recursively load files via references in the files
-
changecase
A trait and implementation for changing the case of Strings and &str. It currently supports uppercase, lowercase, alternating case, and inverting case. Title case is in the works.
-
arabic-script
An expressive API for the characters of the Arabic script
-
zw
encoding and decoding text using zero-width characters
-
cabocha
Safe Rust wrapper for cabocha a japanese language dependency structure analyzer library
-
polyvalid
A validator for package names, namespace, username and app name on wasmer
-
pithy
Ultra-fast, spookily accurate text summarizer that works on any language
-
wcounter
Give the word and count the appearance
-
afrim-translator
Manage the predication system of the afrim input method
-
hina
:]
-
czv
performing CSV-related operations for data engineering and analysis
-
mdbook-shiftinclude
mdbook preprocessor for file inclusion with shift
-
mdbook-bash-tutorial
A mdbook preprocessor that allows embedding Bash scripts as tutorials
-
mdbook-fix-cjk-spacing
mdbook preprocess that fixes CJK line breaks
-
tinytoken
tokenizing text into words, numbers, symbols, and more, with customizable parsing options
-
korean_regex
Regex extension for Hangeul analysis
-
datadriven
Rewritable table-driven testing
-
ucd-util
A small utility library for working with the Unicode character database
-
readability-text-cleanup
cleaning up text from HTML using Mozilla's Readability algorithm
-
tectonic_xetex_layout
XeTeX's font loading and layout interface encapsulation, as a crate
-
chardet
rust version of chardet
-
kanpyo-dict
Dictionary Library for Kanpyo
-
mdbook-chapter-list
mdbook preprocessor for listing subchapters
-
unicode-ellipsis
truncate Unicode strings to a certain width, automatically adding an ellipsis if the string is too long
-
vader_sentiment
Bindings for Rust from the original Python VaderSentiment analysis tool
-
pathmut
Command line utility for extracting, removing, and replacing components from path strings
-
uchr
translate char values to the characters themselves, and vice versa
-
shutil
Shell utility helper library
-
syntastica-themes
A collection of themes for syntastica
-
shoebill
A Wadler/Leijen style pretty-printer
-
strip_markdown
remove markdown syntax from markdown files
-
mdbook-files
Preprocessor for mdbook which renders files from a directory as an interactive widget
-
amongify
A very ඞ sus ඞ program
-
rammer
A Baysian Text Classifier for Spam/Ham
-
verba
working with Latin words
-
password-characters
help with the "enter the 12th, 35th, and 63rd characters from your password" situations
-
senax-inflector
Pluralizes and singularizes English nouns. This crate is forked from https://github.com/whatisinternet/Inflector.
-
unicode_types
A mapping of all the unicode characters into convenience types (one enum per block of characters with one variant per character)
-
readwise
wrapper for the Readwise API
-
matchpick
Find and replace multi-lines using a match-case
-
grepox
Minimalist's grep written in Rust
-
mdbook-iced
An mdBook preprocessor to turn iced code blocks into interactive examples
-
utf8_reader
A UTF-8 reader that read UTF-8 characters from object that implement Read trait
-
matrix_display
display a matrix in command line with pretty unicode separators
-
mdbook-infisearch
InfiSearch plugin for Mdbook
-
any2utf8
Convert any enncoding to utf-8
-
genpdf
User-friendly PDF generator written in pure Rust
-
wz
Count words, fast
-
csv-sanity
Sanitize and transform large CSVs with millions of records quickly and efficiently
-
spacebar
An anti-plagiarism tool based on null width characters
-
skribo
low-level text layout
-
whitespace
Encode arbitrary data whitespaces and vice versa
-
extract-words
Extracts words from text without allocation
-
remake
writing maintainable regex and managing symbol soup
-
ascii-hangman-webapp
customizable Hangman game with ASCII-art rewarding for children (webapp version)
-
sc_decode
Special character decoding
-
mdbook-unlink
A mdBook backend that validates local links
-
gpt4ocr
Extract structured text from PDFs using OpenAI's GPT4o
-
whisperer
将文本编码为简短的中文字符, 防和谐
-
lindera-cc-cedict
A Japanese morphological dictionary for CC-CEDICT
-
kanpyo
Japanese Morphological Analyzer
-
boxy
Declarative builder for Unicode box-drawing characters
-
html-compare
compare html files
-
csv-groupby
execute a sql-like group-by on arbitrary text or csv files
-
autoruby-cli
CLI to easily generate furigana for various document formats
-
hunspell-rs
Rust bindings to the Hunspell library
-
pdf_composer_definitions
PDF Composer definitions crate
-
gregex-logic
Logic for the gregex crate
-
ruby-string
A string type that tracks Ruby glosses attached to parts of it
-
readput
Fast and easy stdin input parsing for competitive programming in rust
-
floem-cosmic-text
Unofficial cosmic-text fork for Floem
-
jpreprocess-jpcommon
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
yarner-block-links
A Yarner plugin that adds to each code block a list of links to all referenced and all referencing blocks
-
alpha-counter
Alphabetic counter
-
wordfreq
port of wordfreq for looking up the frequencies of words in many languages
-
ewts-c
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (c lib)
-
rex-regextract
extracts key value pairs out of text
-
unicode_escape
decoding escape sequences in strings
-
presciidoc
Preprocessing AsciiDoc for other tools
-
markdown2unicode
Converter from markdown notation to unicode characters
-
gfm-autolinks
Parse GitHub Flavored Markdown autolinks
-
word_search_solver
Solves a word search
-
scalar-value-string
Scalar Value String
-
unidok
A powerful, readable, easy-to-learn markup language
-
encoding-index-singlebyte
Index tables for various single-byte character encodings
-
matchable
Check if text is matching against string or regex in one API
-
mdbook-trunk
mdBook plugin which bundles packages using Trunk and includes them as iframes
-
meaningsearch
package that helps you find meaningful lines of any given input. Especially useful in CTFs.
-
llmvm-outsource-lib
outsource backend for llmvm
-
hxgm30-noise
Noise-generation for use in hexagram30 projects
-
inslice
A command-line utility for filtering text input by columns and rows
-
minigrepns
A mini version of famous grep application that searches texts on files
-
diffy-fork-filenames
Fork of https://docs.rs/diffy that allows specifiying filenames
-
pangu2
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
trim
trim whitespaces from files
-
fmt-extra
Extra formaters not shipped in the standard library
-
libxdiff
Rust bindings for the libxdiff C library
-
crypto-invert
Unicode Upside-Down Mapping
-
struckdown
A structured markdown / commonmark library for Rust
-
ron_to_table
pretty print RON as a table
-
mime_4
Strongly Typed Mimes
-
kincaid
A word statistics library in Rust
-
bytepiece_rs
The Bytepiece Tokenizer Implemented in Rust
-
zummi
fun lib that produces spoonerisms
-
spoilers
High-level Rust bindings for CTranslate2
-
pulldown-cmark-fork
A pull parser for CommonMark
-
marcus
An experimental Markdown parser written in Rust
-
ocrmypdf-rs
A sdk for the ocrmypdf command line tool
-
genere
randomization of text respecting grammatical gender of sentences
-
timfmt
A small utility for formatting code as Tim likes it
-
tcalc-rustyline
A fork of Rustyline for use specifically with tcalc
-
spellcheck_toy
a basic spellchecking library based on edit distance
-
termwrap
Wrap Unicode text with ANSI color codes
-
braille_pics
producing text-art pictures using Braille characters
-
trim_lines
An extremely simple and tiny library which provides an iterator over the lines of a string, trimmed of whitespace. It is a simple wrapper around the Lines iterator in std::str which trims the whitespace from each line.
-
moscato
Outline scaler for OpenType glyphs
-
indoc
Indented document literals
-
ipa-translate
translating between IPA and ASCII text
-
stringsext
find multi-byte-encoded strings in binary data
-
bottomify
Fantastic (maybe) CLI for translating between bottom and human-readable text
-
casaubon
A feature-poor static site generator
-
genkit
A common generator kit for static site generator
-
leven-distance
Compute operational differences between two sequences using the Levenshtein algorithm
-
vndb_rs
Bare-bones VNDB api
-
marko
Programmtically format text with Markdown syntax
-
google-book-scraper
downloading the contents of books hosted on books.google.com for offline viewing
-
uwildmat
wildmat/uwildmat matching
-
zawk
An efficient Awk-like language implementation by Rust with stdlib
-
textos
Texts, strings, formatting, unicode…
-
ende
encoding/decoding unicode/utf-8/utf-16(ucs-2) code points
-
ngrams
Generate n-grams from sequences
-
my_project_parser_super_puper
A brief description
-
basic-text-internals
Basic Text string literal implementation details
-
render_readme
Render Markdown or reStructuredText with syntax highlighting and image filtering similar to GitHub's
-
icu_casemapping
Unicode case mapping algorithms
-
wtf8
WTF-8 encoding. https://simonsapin.github.io/wtf-8/
-
syllable
counter for use with reading level calculations
-
serbzip
A quasi-lossless Balkanoidal meta-lingual compressor
-
owoify
Text owoification library
-
gzip-cmp
that makes a distance measurement between binary data based on the difference of the compressed data length
-
sparklet
small flashcards library
-
text_styled
Trait to style String and &str
-
sastrawi
stemming and stopword removal for Bahasa Indonesia based on PHP sastrawi project by Andy Librian
-
gen3-charset
Pokemon Generation 3 Character Set Support (GBA)
-
imatree
command line tool to generate aesthetically pleasing (and sometimes not so pleasing) images of whatever string you want
-
lindera-cc-cedict-builder
A Chinese morphological dictionary builder for CC-CEDICT
-
summertail
Summarize your tail -f output, by reporting how many lines match regexes
-
unicode_reader
Adaptors which wrap byte-oriented readers and yield the UTF-8 data as Unicode code points or grapheme clusters
-
fuzzywuzzy
A pure-Rust clone of the incredibly useful fuzzy string matching python package, FuzzyWuzzy
-
chinese-ner
A CRF based Chinese Named-entity Recognition Library written in Rust
-
uecho
The unicode of the echo command
-
mul
Bengali stemmer
-
text-template
Small template engine for use with plain text (e.g. creating text email), not intended for HTML.
-
mdbook-embed
A preprocessor that simplifies embedded URL
-
resrap
A parser library
-
pra
Print Random ASCII
-
bpmf_py
A Bopomofo and Pinyin library
-
lsp-document
Helpers to convert between LSP documents and Rust strings
-
ctf-brute
Brute-force utilities for Rust
-
irssi-sys
Automatically generated bindings to irssi
-
pomsky-macro
Macro for converting pomsky expressions to regexes
-
character-stream
Helper data structures for reading UTF-8 characters from a stream
-
bookgrep
Basic grep equivalent, minor mods to Chapter 12
-
text_to_emoji
Convert text to emoji
-
write16
A UTF-16 analog of the Write trait
-
quilltex
open-source Rust library designed to convert LaTeX documents into a Delta format that can be used with Quill.js and vice versa
-
subscript-compiler
A modern LaTeX rendition
-
highlight-pulldown
Process pulldown-cmark events to apply syntax highlighting to code blocks
-
kakasi
Romanize hiragana, katakana and kanji (Japanese text)
-
fmty
Composable
core::fmt
utilities -
b2c2-jis-x-201
UTF-8とJIS-X-201を雑に変換処理する
-
mdxbook
Fork of mdBook, with more customizations and flexibility for programmers
-
trans-case
Transform case
-
utf8-command
UTF-8 encoded
std::process::Command
output -
outerspace
Methods for prefixing and suffixing the non-whitespace characters in a string
-
fmtm_ytmimi_markdown_fmt
Fork of @ytmimi's Markdown formatter; powers FMTM
-
morc
Dead simple, minimal markdown generator library written in Rust
-
mdbook-rust
Enhanced Rust support for MDBook
-
scripter
A screenplay compiler
-
chars_data
Build-dependency for chars, the unicode character information CLI
-
askama-markdown-cmark
Askama filter for markdown, using pulldown-cmark
-
jput
puts and putc on unicode-width align for Rust
-
rahat3062_minigrep
A light-weight & minimal implementation of the grep cli app
-
base256_lib
Base256 encoder/decoder
-
hitori
Generic compile-time regular expressions
-
md-include
include any file in markdown files
-
varcon-core
Varcon-relevant data structures
-
vibrato
viterbi-based accelerated tokenizer
-
destiny_rs
An async library to interact with Bungie's destiny 2 api
-
pretok
A string pre-tokenizer for C-like syntaxes
-
md-localizer
Localize markdown with remote links
-
cur
that will hunt for your regular expression
-
halfcaps
tRaNslAtE aNy TeXt To ThIs
-
ogrep
searching in indentation-structured texts
-
byte_string
Wrapper types for outputting byte strings (b"Hello") using the Debug ({:?}) format
-
justify
plaintext while handling Unicode gracefully
-
detect-indent
Detect the indentation of code
-
kl-hyphenate
Knuth-Liang hyphenation for a variety of languages
-
translation-api-cn
Some useful structs for calling Chinese translation api cloud services. A helper tool for
bilingual
cmdline tool. -
darn
Extras for the Book on Data Analysis with Rust Notebooks
-
unic-ucd-normal
UNIC — Unicode Character Database — Normalization Properties
-
shallow
long text
-
bashdoc
generating documentation/help menu for user defined bash functions
-
vidyut-lipi
A Sanskrit transliterator
-
rsonpath-test-codegen
Blazing fast JSONPath query engine powered by SIMD. TOML-based test codegen for rsonpath-lib.
-
stardict_wrapper
Rust implement stardict
-
mdplayscript
An extension of Markdown for play scripts
-
modeling
tools to analysis different languages by Ctags
-
framework
detector for different frameworks in one projects
-
omgwtf8
Optimized-Matching-Generalized Wobbly Transformation Format — 8-bit
-
kasedenv
Read environment variables by lower, upper case or case-insensitive keys
-
caseless
Unicode caseless matching
-
aprilasr
High-level wrapper for the april-asr C api (libaprilasr) using aprilasr-sys
-
random_access_unicode
Provide random access to the unicode characters in a file
-
aki-xtee
copy standard input to each files and standard output
-
fifthtry-mdbook
fork of mdbook, only for ft-cli
-
aki-stats
output the statistics of text, like a wc of linux command
-
lindera-ipadic-builder
A Japanese morphological dictionary builder for IPADIC
-
unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
spdkit-surface
short text for crates.io
-
m_lexer
extensible regular expressions based lexer
-
cowsay
program that generates ASCII art pictures of a cow with a message
-
xhtmlchardet
Character set detection for XML and HTML
-
ezemoji
Catigoryized Emoji's
-
mpdf
writing PDF files
-
llmvm-core-lib
llmvm core application
-
crustword
Crusty Crosswords
-
is_printable
Determine whether a given text-based value is printable
-
regex-automata
Automata construction and matching using regular expressions
-
chinese_segmenter
Tokenize Chinese sentences using a dictionary-driven largest first matching approach
-
code-span
Add additional infomation to code character
-
text-to-json
Convert text to json in rust
-
mdbook-playscript
Preprocessor for mdBook, which styles stage play scripts
-
tradukisto
Kinda useful natural language translation library and utility
-
yozuk-model
NLP model generator for Yozuk
-
shq
sh quoting tool and library
-
rreplace
designed to streamline string replacements. It can handle multiple unique replacements and iterates the string only once.
-
libflagup
Display a country's flag as an emoji
-
unicode-canvas
creating text base drawing
-
text2art
translate text to ascii art
-
str-block
Remove the initial newline and common indentation from string literals
-
wildflower
Wildcard matching against strings
-
korrektor
work with Uzbek language text processing
-
gqlog
👾 filter your json logs with graphql 👾
-
catmark
Console printer for CommonMark
-
like
A SQL like style pattern matching
-
encoding-next-index-tradchinese
Index tables for traditional Chinese character encodings
-
supply-chain-trust-example-crate-000068
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
tectonic_bridge_icu
Expose a subset of the ICU Unicode APIs to Rust/Cargo
-
untex
Understand and manipulate TeX files with ease
-
allsorts_no_std
Font parser, shaping engine, and subsetter for OpenType, WOFF, and WOFF2
-
rmw-utf8
Short text compression algorithm for utf-8 (optimized for Chinese , developed based on rust programming language). 面向utf-8的短文本压缩算法(为中文压缩优化,基于rust编程语言开发)。
-
hunspell-sys
Bindings to the hunspell C API
-
mqtt-analyzer
Help analyzing MQTT messages
-
finl-charsub
Character substitution module for in-line replacement of fixed strings
-
szovegertesimutato-score
Calculate szovegertesimutato score for a given text and language
-
word_filter
A Word Filter for filtering text
-
forgiving-htmlescape
HTML entity encoding and decoding, with support for leaving malformed entities intact
-
single_source
Generate code files from snippets in md tutorial files
-
ed_join
Implemtation of Ed-Join Algorithm for string similarity join
-
litegrep
A basic tool for searching in files for lines of text, based on a query
-
moenarchbook
Creates a book from markdown files
-
mdbook-auto-gen-summary
A preprocessor and cli tool for mdbook to auto generate summary
-
pillar
small tool to format lines into columns
-
mdbook-translation
prepare multi-lingual mdBook books
-
ascii_converter
converting between different ascii representations
-
mdbook-quiz-validate
Input validation for quizzes used in mdbook-quiz
-
ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched
-
summary
Extract the sentences which best summarize a document
-
cutters
Rule based sentence segmentation library
-
cmdcjones_minigrep
A minimal grep clone from the Rust Book
-
bible-reference-rs
Extract Bible references from plain text
-
const-utf16
Utf8 to utf16 conversion functions for use in const contexts
-
mocker
transforming text with uppercase and lowercase letters, resembling the Spongebob Mock Meme
-
tergo-tokenizer
R language tokenizer
-
awabi
A morphological analyzer using mecab dictionary
-
tuilet
A textual user interface for Toilet, the ANSI-art text generator
-
case_style
Converts from and to different case styles
-
string-overlap
A helper crate for "layering" ASCII art
-
hulk
An ultra simple no hassle static site generator
-
jellybean
Syntax highlighting with tree-sitter. Sweet colors.
-
bullshit-rs
A BullshitGenerator implementation in Rust
-
topfew
CLI to find high frequency occurrences in structured text files
-
flw
Process text via configurable tasks
-
mdbookshelf
Create epubs from a list of mdbook repositories
-
lindera-ipadic-neologd-builder
A Japanese morphological dictionary builder for IPADIC NEologd
-
manchu-converter
Converts transcripted Manchu text to Manchu script with Manchu alphabet
-
gst-plugin-textwrap
GStreamer Text Wrap Plugin
-
mdbook-open-gh-issue
mdbook preprocessor to add a open-on-github link on every page
-
clparse
A command line tool for parsing CHANGELOG.md files that use the Keep A Changelog format
-
spinning_cube
Perspective projection of a spinning cube, using just ASCII characters
-
leven
Make your own blog!
-
utf
UTF-8
-
latin1str
Windows-1252 string types
-
read_chars
An iterator over characters read from some I/O source
-
mdbook-fishextract
A mdbook preprocessor which handles mermaid graphs, offline, requires
mmdc
-
slidedeck
Create an HTML slide deck from Markdown
-
emoji-printer
Replace emoji shortcodes in string with emoji unicode (":sushi:" -> 🍣)
-
trigram
Trigram-based string similarity for fuzzy matching
-
pencil_case
but effective library for writing GUI in bevy
-
sydney
Vim-like, Command-line Gemini Client
-
pygmentize
wrapper for syntax highlighting
-
markdown-linkify
Markdown preprocessor for substiting link shorthands to valid links according to configurable regexes and custom substitution implementations
-
skyspell_core
skyspell core library
-
jpreprocess-window
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
stringutils
A collection of various and (hopefully) useful String utility functions
-
cermic
Print .png images in your terminal as ascii art!
-
koelner-phonetik
koelner_phonetik or cologne phonetics is a phonetic algorithm like soundex, but specialized for german words
-
mdbook-chapter-zero
A mdBook preprocessor that allows 0th (sub-)chapter
-
mdbook-checklist
An mdBook preprocessor for generating checklists and indexes
-
furigana
Map furigana to a word given its reading
-
vaporetto_rules
Rule-base filters for Vaporetto
-
rust_lemmatizer
A lemmatizing package for use with a .csv dictionary of lemmas and their corresponding words
-
saurus
Markdown to LaTeX Transpiler
-
seq2xypic
Turn a text sequence diagram into a LaTeX xypic diagram
-
print-positions
providing string segmentation on grapheme clusters and ANSI escape sequences for accurate length arithmetic based on visible print positions
-
platina
parameterized golden testing library
-
simple-word-count
word count function, try to get same result with Microsoft Office Word application
-
igo-rs
Pure Rust port of the Igo, a POS(Part-Of-Speech) tagger for Japanese (日本語 形態素解析)
-
yitizi
異體字查詢 Get variant Chinese characters
-
typos-dict
Source Code Spelling Correction
-
grader
Stream-based CLI for binary sorting text files via a given shell command
-
corg
A cog-like tool
-
file-search
File indexing and search
-
is_utf8
functions to determine if a sequence of bytes is valid utf-8
-
quill_delta_pdf
Convert Quill Delta to PDF
-
seven_seg
Seven-segment digital display for terminal
-
difference
text diffing and assertion library
-
tweak
when/then clauses to run
-
pdf-min
Very minimal crate for writing PDFs
-
utf8_slice
Lightweight UTF8 Slice Utilities
-
cha-rs
Extract specific characters from an input
-
encoding_rs_transcode
Transcode text within writers using encoding_rs
-
bbcode-tagger
BBCode tree parser and tagger
-
ocr_b_checksum
Generates OCR B Checksums
-
hashtag-regex
regex matching hashtags accoding to the unicode spec: http://unicode.org/reports/tr31/#hashtag_identifiers
-
scoundrel
An engine for working with ASCII roguelikes
-
github-slugger
A slugger for GitHub headings
-
string-box
Create Rust string from UTF-8 string, byte string or wide string
-
rescue-blanket
Escape values while they are being formatted
-
literate
programming tool that extracts code written in your Markdown files
-
mdbook-numeq
An mdbook preprocessor for automatically numbering centered equations
-
tagalyzer
A CLI tool to gather statistics on collections of plaintext-adjacent files
-
romulus
a stream editor like sed
-
jg
Jeff Goldblum (jg) is a command-line JSON processor. jg searches for structural patterns in json input and prints each json object that matches the pattern.
-
trevordmiller
Personal CLI
-
kvu
The simplest command line tool to manage key-value pair lines
-
glyph-names
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
lingua-spanish-language-model
The Spanish language model for Lingua, an accurate natural language detection library
-
ayda
Ask your Documents Anything. A tool for querying your documents with a large language model.
-
static_format
Format strings with no runtime overhead
-
gen-epub-book
Generate an ePub book from a simple plaintext descriptor
-
caribon
A repetition detector program and library
-
rustex
auto-generated LaTeX files in Rust
-
ripmors
encoding and decoding international Morse code and several variants
-
intname
Full English name for any integer of any primitive integer type
-
const_format_proc_macros
detail of the
const_format
crate -
yarner-lib
build plugins for the Literate Programming tool Yarner
-
terminal-emoji
safely displaying emoji inside of terminals
-
scie
research about how to build simple code identify engine for different languages
-
bibliofile
A TUI epub reader inspired by DOS-era programs
-
konbaato
Conversions between different Japanese scripts
-
vidyut-kosha
A Sanskrit key-value store
-
cw
Count Words, a fast wc clone
-
lingua-danish-language-model
The Danish language model for Lingua, an accurate natural language detection library
-
ansi-cut
cutting a string while preserving colors
-
encoding-index-korean
Index tables for Korean character encodings
-
pcre2
High level wrapper library for PCRE2
-
correct_word
A No brainer 'did you mean' library for Rust
-
rcut-lib
rcut is a Rust replacement for GNU cut that supports UTF-8
-
uniquewords-rs
Count the frequencies of words in text file(s)
-
ccase
Command line interface to convert strings into any case
-
blockcounter
Counts the blocks in a stream
-
bionic-ebooks
Takes an EPUB file and generate a copy with bionic like font applied
-
ced
Dead easy csv editor
-
encoding_c_mem
C API for encoding_rs::mem
-
mojibake
Encode/Decode bytes as emoji base2048
-
harfbuzz-traits
Rust Traits for the HarfBuzz text shaping engine
-
didaskalia
over-complicated language to store text entries and sequences across different languages
-
patterns
no_std compatible pattern scan library using simd
-
mdbook-morsels
Morsels plugin for Mdbook
-
comment-strip
Remove comments out of text files
-
informa
font compiler for digital typography
-
textalyzer
Analyze key metrics like number of words, readability, and complexity of any kind of text
-
dr
Command-line data file processing in Rust
-
rnltk
Natural Language Toolkit for Rust
-
bebop-lang
LISP based Markdown preprocessor for creating html documents!
-
ddvm
Document to Document Virtual Machine
-
kaomoji-ru
The full http://kaomoji.ru database, encoded and escaped as byte literals
-
chisel-parsers
Chisel parser front ends
-
fast2s
A fast Traditional Chinese to Simplified Chinese conversion library. Built with FST, faster than most of other libraries.
-
ewin-com
editor for Window(GUI) users.No need to remember commands
-
deepfrog
A deep learning NLP suite (PoS,lemmatiser,NER) with FoLiA XML support
-
chinese2digits
The Best Tool of Chinese Number to Digits. A useful tool in NLP and robot project.
-
august
& program for converting HTML to plain text
-
ptero-cli
A text steganography CLI tool for Social Media
-
b2c2-stat
デバッグ用にところどころで使ってるぽい
-
anagrambot
find anagrams of words
-
mdbook-newday
Command line utility to add a new line for today to the top of an mdbook SUMMARY.md book
-
bibutils-sys
Rust bindings for bibutils, a program for bibliography format interconversion
-
zp
Copy the contents of the source file or the std output buffer to the clipboard, allowing users to easily paste the contents into another file or program
-
old_icelandic_zoega
Old Icelandic dictionary for Rust. From "A Concise Dictionary of Old Icelandic" by Geir Zoëga
-
demoji
Remove all emojis from a string
-
color-convert
Support RGB,RGBA,HEX,HSL,HSLA,HSV,CMYK to convert each other, write by rust
-
t9
Easily create port number from words and vice versa
-
bookkeeping
Minimal, safe and flexible in-memory bookkeeping API
-
text_distance
A collection of approximate string matching algorithms
-
mdbook-post
A CLI for add post to mdbook
-
vaporetto_tantivy
Vaporetto Tokenizer for Tantivy
-
mdbook-reference-table
mdBook preprocessor to create reference tables
-
csvre
replacing data in CSV columns with regular expressions
-
yozuk-sdk
Types used in the Yozuk ecosystem
-
markdown-table
Creating markdown tables with Rust!
-
am-parse
converting asciimath to other formats
-
kansuji
漢数字と数字の相互変換のためのライブラリ
-
repub
convert markdown documents to epub
-
markdown-composer
composing markdown documents
-
bos_books_codes
that handles 3-character Bible Books Codes
-
cattocol
Combine two text into one text as columns
-
vcg
manipulating verification conditions
-
esl01-drawdag
Parse an ASCII DAG into parent relations
-
cmark2tex
A small utility to convert markdown files to pdf exploiting tectonic
-
regexnight
Command-line tool to print syntax-highlighted versions of regular expressions and spot errors
-
mdbook-asciidoc
mdBook backend for AsciiDoc generation
-
lingua-dutch-language-model
The Dutch language model for Lingua, an accurate natural language detection library
-
mediawiki_parser
A strict parser for MediaWiki markdown
-
ae-renderable
handling individual text characters that can be rendered on the screen. Intended for use in game development.
-
rew
A text processing CLI tool that rewrites FS paths according to a pattern
-
falcom-sjis
Falcom-compatibile Shift JIS implementation
-
termbook
behind the
termbook-cli
-
terminal-supports-emoji
Check whether the current terminal supports emoji
-
mdtransform
program that renders Markdown as HTML for websites
-
eliza
natural language processing program developed by Joseph Weizenbaum in 1966
-
ssml-parser
parsing speech sythnesis markup language
-
bbd-lib
Binary Braille Dump
-
mdbook-snips
Markers for hidden lines in rust blocks within an mdbook
-
lindera-filter
Character and token filters for Lindera
-
toml_document
Document-style TOML parser. Allows editing TOML document in a way which preserves whitespace and comments.
-
morse-nostd
A nostd version of the morse crate
-
mdbook-tagger
A mdbook preprocessor to collect and build tag data in Json format
-
deliminator-core
Universal code documentation generator
-
prettify-markdown
Format Markdown at the speed of Rust
-
datex
package to automatically process your data and make it look pretty in LaTeX
-
mdbook-hide
A preprocessor for mdbook that adds support for hidden chapters
-
lines_lossy
extension to
BufRead
with a functionlines_lossy
that works likeBufRead::lines
but with lossy UTF-8 decoding -
aklat
create books from markdown files (like Gitbook)
-
ansi-to-tui-forked
convert ansi color coded text into ratatui::text::Text type from ratatui library
-
dhoni
converting Bengali text into their phonetic counterpart
-
conllu-utils
working with the CoNLL-U dependency format
-
latex_snippet
Convert even erroneous LaTeX snippets into HTML
-
anystr
An abstraction over string encoding that supports ASCII, UTF-8, UTF-16 and UTF-32
-
every-range
Iterator for interleaving included and excluded ranges
-
gret
command line tool to search for patterns and show matches in a tree structure
-
token-read
reading whitespace delimited files intended for competitive programming
-
economic_indicator_finder
A finder for extracting economic indicators from paragraphs
-
xsystem
Conversion between the Esperanto x-system and Unicode circumflexes
-
bareun_rs
Bareun is a Korean Morphological analyzer for Rust
-
morph-rs
Dictionary Morphologizer for Russian language
-
fontconfig-rs
Safe, higher-level wrapper around the fontconfig library
-
darts
A double array trie, A Forward Maximum Matching Searcher
-
my_mini_grep
A mini application that aims to replicate the behavior of the grep shell command
-
rustextile
Textile markup language parser for Rust
-
corpus-preproc
A preprocessor for text and HTML corpora
-
stringsort
Pathological sorting of string characters
-
vidyut-chandas
A Sanskrit metrical classifier
-
charisma
Generate ASCII Art using AI 🦾🧠
-
react-html
react html components for frender
-
trunc8
Truncate text to a specific line length, based on a number of parameters
-
kradical_static
Ready-to-use EDRDG radical decompositions
-
ascii-rs
Process image into colored-ascii image
-
syntaxdot-encoders
Encoders for linguistic features
-
LatexImgGen-lib
generating Latex images of equations with an edition number to avoid cache flushing and timestamp attributes
-
perspicuity_formula
Calculate Flesh Reading Ease for a given text and language
-
local-encoding
encoding/decoding string with local charset. It usefull for work with ANSI strings on Windows.
-
highlights
Convert kindle highlights into different formats
-
quoted-string-parser
Quoted string parser for grammar defined in RFC3261
-
nb2fr
Convert integer number to literal French text
-
lingua-italian-language-model
The Italian language model for Lingua, an accurate natural language detection library
-
varcon
Source Code Spelling Correction
-
moenster
mønster (n) - pattern. simple glob-style pattern matching for strings
-
porter-stemmer
Flexible and unicode friendly, Porter stemmer implementation
-
regex-cli
A command line tool for debugging, ad hoc benchmarking and generating regular expressions
-
nipah_tokenizer
A powerful yet simple text tokenizer for your everyday needs!
-
hemtt-tokens
A token library for hemtt
-
ascii_tree
generates ascii trees
-
minigrep_bakedspacetime
Minimal Rust implementation of grep based on The Book
-
mdbook-webinclude
Preprocessor for mdBook that includes content from URLs
-
lindera-ipadic-neologd
A Japanese morphological dictionary for IPADIC NEologd
-
great-tokenizer
A tokenizer
-
bookrafter
This repository contains code related to bookrafter rendering
-
ergrep
grep strings within a line from a text file
-
norm-email
strip email provider defined behaviour from email addresses
-
vl-convert-pdf
convert SVG to PDF with embedded text
-
wkhtmltopdf
High-level bindings to wkhtmltopdf
-
cli-animate
Enrich your Rust CLIs with stunning, easy-to-implement animations
-
math-text-transform
Transform greek letters, latin letters, or decimal digits into certain variants from the mathematical alphanumeric symbols Unicode block (U+1D400–U+1D7FF). For example to bold, italic, script or double-struck.
-
charasay
The future of cowsay 🐮! Colorful characters saying something 🗨️
-
base_emoji
Convert everything to Emojis (and back)! - 🐫🔦🔥🎲🐬🍀🍟🔋🐬🎲🐬🍀🎁🍟💧👂🔥🚪🔋🍟🔦🔋🚗👣🔦🌍👂🍤🐗
-
webreg
A CLI tool for testing regexes against web pages
-
md_converter
My markdown to html converter
-
yozuk-helper-english
English NLP utilities for Yozuk
-
aki-txpr-macro
the more easy to use libaki-*
-
table_to_html
interface to convert a
tabled::Table
into a HTML table (<table>
) -
dynfractal
A program to render any equation (in terms of z and c) into an ASCII fractal
-
minbpe
Port of Andrej Karpathy's minbpe to Rust
-
oneparse
A one in all simple parser framework
-
xim-ctext
compound text en/decoder
-
hline
a grep-like tool that highlights lines in files
-
pdf_seekers
parser and information extractor from PDF documents based on keyword search functionality (powered by Rust)
-
text_alignment
Align your text in Rust in the CLI
-
pangu
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
transcoding_rs
Converts text encoding the easy and efficient way
-
wordbreaker
A Unicode-aware no_std crate (requires alloc) that rapidly finds all sequences of dictionary words that concatenate to a given string
-
bitfont
Takes an ASCII string and generates a vector containing a bitmap font, for easy overlay into images
-
sola
A 16-bit stack machine and forth language
-
syllarust
quickly counting syllables
-
spandex-hyphenation
Knuth-Liang hyphenation for a variety of languages
-
tb_normalization
normalization utf8 string, loc dau vietnamese and some language
-
jp-location-relation
隣接する市区町村の一覧を取得
-
lingua-chinese-language-model
The Chinese language model for Lingua, an accurate natural language detection library
-
textblocks
working with text blocks
-
strings
String utilities, including an unbalanced Rope
-
os_display
Display strings in a safe platform-appropriate way
-
noneifempty
Convert T to None if empty
-
stardict_lib
Rust implement stardict
-
minigrep_macano953
A fun grep tool written as my first incursion into Rust. Not intended to be inherited and used in production code
-
mdbook-latex
An mdbook backend for generating LaTeX and PDF documents
-
trexter
Text progression tracking library
-
mdbook-obsidian
mdBook preprocessor to render Obsidian specific syntax
-
roxy_syntect
Roxy plugin for providing syntax highlighting via Syntect
-
h_hangul
Korean Characters
-
textr
TeX-inspired plug-n-play interface for converting JSON documents into PDFs
-
ucd-parse
parsing data files in the Unicode character database
-
rosie
Interface for the Rosie Pattern Language, for efficient and maintainable text pattern matching and search
-
mepple
English dictionary as a library
-
paperoni
A web article downloader
-
charless_minigrep
A program for searching for phrases in a text and printing the lines containing it
-
groop
Outline-aware grep with syntax highlighting
-
textcat
detect text categories. It can be used to detect the language of a given text
-
toml_to_table
pretty print TOML as a table
-
encoding-next-index-simpchinese
Index tables for simplified Chinese character encodings
-
web-grep
A Grep Tool for HTML or XML
-
simple_csv
CSV parsing implementation
-
ascii-engine
A fast ascii engine written in rust!
-
jpreprocess-dictionary
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
nib
static site generator
-
xsv
A high performance CSV command line toolkit
-
crudite
A JSON CRDT
-
mdtohtml
markdown to html renderer (with a couple of missing features)
-
retest
Command-line regular expression tester
-
mdtranslation-cli
Command-line tools for using mdTranslation, which can be used to prepare multi-lingual Markdown documents
-
mdbook-tectonic
An mdbook backend for generating LaTeX and PDF documents
-
ammonia
HTML Sanitization
-
beary
augment Bear, the note-taking app
-
fancy_print
Easily print animated ASCII text to the terminal!
-
x1b
State tracker for 0x1b terminal escape codes. Also gives random characters names.
-
jellybean-pack-1
Sweet syntax highlighting with tree-sitter
-
yeslogic-ucd-parse
parsing data files in the Unicode character database
-
publish_demo_node
This program picks a random christmas emoji for you
-
cn-font-split
划时代的字体切割工具,CJK与任何字符!支持 otf、ttf、woff2 字体多线程切割,完美地细颗粒度地进行包大小控制。A revolutionary font subetter that supports CJK and any characters…
-
tfidf-summarizer
Basic tf-idf compute for documents
-
rsrpp
project for research paper pdf
-
dumbfuzz
dumb library for fuzzy search
-
aki-json-pick
The json pick out command
-
minigrep_desonglll
grep implementation from The Rust Programing Book
-
japhonex
Japanese phone number checker for Rust
-
pdf_composer_base
PDF Composer base functionality crate
-
rust-cedar
efficiently-updatable double-array trie in Rust (ported from cedar)
-
hsk
Return HSK Level for Simplified Chinese Characters
-
re2
Wrapper for the re2 C++ regex library
-
alpino-tokenizer
Wrapper around the Alpino tokenizer for Dutch
-
mdbook-extended-markdown-table
Preprocessor for mdBook that generates tables with merged cells from ASCII text
-
iconv-compat-win-sys
iconv bindings for Rust
-
unicode_clusters
variable width unicode characters as single items, allowing for array like indexing etc
-
ttf_word_wrap
Wraps text based on character width
-
spyglass
Search engine for documents, inspired by bioinformatics
-
codes-iso-15924
This package contains an implementation of the ISO 15924 Codes for the representation of names of scripts specification
-
markdown_to_html_parser
parses Markdown syntax into HTML
-
readable-readability
Really fast readability
-
mojimoji-rs
fast converter between Japanese hankaku and zenkaku characters, mojimoji
-
zalgo-text
A command line tool for generating zalgo text
-
atm_parser_helper_common_syntax
Parsing utilities for the common syntax specification
-
grep-rezza
An app for search text in file
-
iterlower
Final-sigma-correct lowercasing iterator adapter with option for Turkish/Azeri I behavior
-
unicode_names
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
mdbook-to-github-wiki
Turns an mdbook book into a github wiki
-
tex
The νTeX typesetting engine
-
anagram
A collection of anagram utility functions
-
unicode_skeleton
detects unicode strings that look nearly identical once rendered, but do not compare as equal. It defines "confusable" and "skeleton" based on Unicode Standard Annex #39
-
html2runes
An HTML to Text converter
-
top-english-words
retrieve top words from the English language
-
mdbook-mathpunc
An mdbook preprocessor that prevents line breaks between inline math blocks and punctuation marks when using katex
-
strainer
Command-line tool for finding duplicate lines in text files
-
adauguet_minigrep
First program from the Rust Book
-
diff_match_patch
Diff Match Patch is a high-performance library in Rust-lang that manipulates plain text
-
route-pattern
A parser and matcher for route patterns
-
guarding_parser
Guarding is a guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
genpdfi
User-friendly PDF generator written in pure Rust
-
is-vowel
Heuristically test whether a character is a vowel letter
-
emojito
Find all the Emoji in a string. Supports composed emoji.
-
lindera-decompress
A morphological analysis library
-
rmgrep
rust min grep text tools
-
tpng
A small tool that prints truecolor png renderings to the terminal using unicode block characters
-
lexer-generator
Lexer derived from Regex patterns with user customizeable tokens
-
hashlogs
Command-line utility that hashes the part before a space on each line from stdin with blake2b keyed with an ephemeral randomly-generated key and writes to stdout
-
simplecc
Chinese Convert library (partially) compatible with OpenCC's dictionaries
-
kryptor
command-line cryptography tool
-
rusty_code_code_for_book
my book_rusty code
-
dismej
convert text to discord emojis
-
contractions
expand contractions in English
-
jp_inflections
A small crate to conjugate japanese words
-
sudachiclone
sudachiclone-rs is a Rust version of Sudachi, a Japanese morphological analyzer
-
grepy
search files
-
exit-with-code
A CLI which returns the exit code of your choice
-
indexrs
inefficient multi-language search index
-
rsplitter
Splitter is a tool to split words from text that contains no spaces
-
pest_ascii_tree
Helper crates converting the parsing result of any pest grammar into an ascii tree
-
static_table
creates pretty tables at compiler time
-
kaomji-ru
The full http://kaomoji.ru database, encoded and escaped as byte literals
-
fwuffgrep
Basic implementation of a grep command written in rust
-
fingers
a finger client library
-
traktorhtml2txt
Does what it says, convert an Traktor set list HTML export to text
-
hex_table
Format a byte slice into a hex table
-
math_test
Generates math questions for elementary school kids
-
smoldown
Native Rust library for parsing Markdown
-
mdbook-compress
Compress an mdBook project into a single PDF file
-
unicode_converter
CLI tool to convert data between various Unicode encodings
-
smart-date
Parses relative and absolute dates in natural language text
-
hoedown
bindings for the Hoedown markdown processor
-
geml
Generator-orientated ML parser
-
minigrep_vielenkz
grep using rust
-
hema_minigrep
grep clone
-
markdown-table-formatter
Markdown table formatter fully compliant with Unicode 15.1.0
-
text-sanitizer
convert text to plain ASCII text
-
naming_clt
Extract and convert the naming format(case|notation) of identifiers from files or stdin. Use this tool to prepare identifier name strings for further operations (matching,replacing...) on relative files
-
unic-bidi
UNIC — Unicode Bidirectional Algorithm
-
gulpeaseindex
Calculate Gulpease index for a given text and language
-
encoding8
various 8-bit encodings
-
cologne_phonetics
generate phonetic cologne codes for utf8 strings
-
kanabake
Turn Romaji strings into Hiragana or Katakana
-
rexpaint
functionality for reading and writing .xp files of the Grid Sage Games REXPaint ASCII art editor
-
utf8reader
wrapper around Reader that returns a stream of UTF-8 characters
-
esc
Escape characters in strings
-
font-index
Static index for system fonts
-
asciis
ASCII base on RFC20. Just ord() and chr()
-
texrender
Thin wrapper around running
latexmk
to render LaTeX documents. Also supports generating Tex documents. -
nib-cli
A cli for a yet another static site generator Nib
-
wordninja
port of the Word Ninja English word splitting library
-
mystem
Wrapper around Yandex Mystem for Rust
-
mdbook-multicode
Allows you to give multilanguage code examples, toggled by a spinner
-
lingua-portuguese-language-model
The Portuguese language model for Lingua, an accurate natural language detection library
-
hex_d_hex
HexDHex is a Rust Crate that encodes and decodes byte data to and from its hexidecimal representation. For instance, one may wish, on ocasion that is, to translate a utf8 or ASCII string…
-
rckive-genpdf
User-friendly PDF generator written in pure Rust
-
rut
A small UTF-8 parsing library for applications that need to parse individual
char
s -
newline-converter
Newline byte converter library
-
minigrep5
grep implementation in Rust
-
spellcheck
A spell-checker based on the statistical algorithm described by Peter Norvig in http://norvig.com/spell-correct.html
-
cp437_constants
Defined constants covering the entirety of ASCII code page 437 (https://en.wikipedia.org/wiki/Code_page_437)
-
html_to_pdf_lib
converting HTML to PDF
-
names-changer
Convert a names of sql schemes from camelcase to snake case
-
strng
Owned UTF-8 string
-
latex-to-html
Latex to html converter
-
charjpoet
Charj Poet is a API for write to
.cj
language -
minigrep-yogie
A demo Rust to grep some word from rust-lang.com
-
libgrep-rs
searching through text
-
decline-word
Choose word form based on given number
-
b2c2-common
b2c2の共通部分
-
cjieba-sys
unsafe ffi to cppjieba
-
encoding-index-japanese
Index tables for Japanese character encodings
-
panduck-latex
Use panduck to generate XeLaTeX
-
kth-lines
Command line tool for filtering stdin lines that just work
-
jellybean-pack-0
Sweet syntax highlighting with tree-sitter
-
mdlint-cli
tbd
-
deface
Lightweight markup to HTML converter
-
markx
markdown parser
-
case-conv
Faster case conversion crate
-
minigrep_maybe
Just another minigrep from Rust book
-
jpreprocess-dictionary-builder
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
node_rust_demo
This program picks a random christmas emoji for you
-
aqua_parse
A parser library
-
askama-filters
Extra template filters for Askama
-
pdf_form
programatically filling out pdf forms
-
tablers
create tables and export them into CSV or PDF. Without external deps
-
recode_rs
Command-line tool for converting between the character encodings defined in the Encoding Standard
-
txtframe
Creates a frame for text
-
vextractor
vextractor-cli
-
lyn
A scanner useful for creating recursive descent parsers
-
grep-searcher
Fast line oriented regex searching as a library
-
scrambler
command line tool to scramble letters
-
equt-md-ext
Extend event iterator
-
encoding
Character encoding support for Rust
-
kytea-tokenizer
Wrapper of tokenization by KyTea
-
html_to_markdown
Convert HTML to Markdown
-
conv-rs
CLI converter for multi extensions
-
pascii
image => text
-
munemo-rs
Turn an integer into a more rememberable word, or vice-versa
-
milligrep
Custom simplified implementation of grep
-
fnew
A Unicode-aware line-oriented drop-in replacement for coreutils' fold
-
simplearrayhash
-
stfu
Shut The Ferris Up - profanity filtering for Rust
-
encoding-index-tradchinese
Index tables for traditional Chinese character encodings
-
lindera-compress
A morphological analysis library
-
pikchr-cli
PIC-like diagramming language to SVG converter
-
stopwords
from popular text processing frameworks
-
slicer
that slices string slices into smaller string slices
-
rigrep
grep from Rust Book
-
conveyance
A stop-gap CLI for conveyancing
-
chanoma
Characters Normalization library. 文字列正規化処理用のライブラリです。
-
ezk-internal
ezk internal util
-
pdf_forms
programatically filling out pdf forms
-
text-tables
A terminal/text table prettifier with no dependencies
-
uniaxe
replace Unicode letters with Ascii equivalents
-
scannedpdf
A fast, low memory footprint, simple images to PDF crate
-
nutrimatic
Tools for reading Nutrimatic (https://nutrimatic.org) index files
-
simple-text-pattern
Matching simple text patterns
-
rusty_word_builder
Syllable and Word generation library written fully in Rust
-
latexify
Shared definition for turn a rust object into latex code
-
alphabet-encoder
A quick and dirty way to deal with escape characters
-
esl01-renderdag
Render a graph into ASCII or Unicode text
-
kma-rustlang-vadym-polishchuk-english-parser
parser of English sentences created for KMA Rust course
-
japanese-ruby-filter
Japanese ruby notation parser
-
epubparse
Parse epub and convert to text-only Book structure
-
encoding-next-index-japanese
Index tables for Japanese character encodings
-
goya
morphological analyzer for Rust and WebAssembly
-
convert_encoding
Convert encoding of text files in batch
-
dictionary-1024
a mnemonic dictionary with 1024 words, 10 bits per word
-
rep-cli
Replace text file in bulk
-
unidecode
pure ASCII transliterations of Unicode strings
-
saku
efficient rule-based Japanese Sentence Tokenizer
-
ruby-parser
A parser for the Ruby language
-
jpreprocess-naist-jdic
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
lindera-tokenizer
A morphological analysis library
-
unic-idna-mapping
UNIC — IDNA — IDNA Mapping Table
-
backslash
parsing escape characters
-
fst-subseq-ascii-caseless
An automaton that matches if the input contains a specific subsequence ignoring ASCII case to be used with fst
-
luaparse
A Lua 5.3 parser
-
pdftotext
High-level library that binds to Poppler to extract text from a PDF
-
lingua-swedish-language-model
The Swedish language model for Lingua, an accurate natural language detection library
-
basen
Convert binary data to ASCII with a variety of supported bases
-
STKLR
STKLR: pronounced 'stickler'. Is a cli tool to automatically link functions, enums, structs, traits etc in rust-doc docstrings. I couldn't find a tool like this when I needed it so... here we are.
-
brainfuck_interpreter
brainfuck interpreter written in Rust
-
regex-cli-test
A command line tool for testing against regex patterns
-
bgrep
grep tailored to handle binary patterns and files
-
naveengrep
command line tool similar to the grep
-
transliterate1234
UTF-8 to ASCII transliteration
-
try_publish_for_demo
This program picks a random christmas emoji for you
-
lindera-sqlite
Lindera tokenizer for SQLite FTS5 extention
-
unicode-utf8
that converts utf-8 bytes to a unicode scalar value, and vice versa
-
json-event-parser-witespace
A json event parser with whitespace
-
lindera-py
Python binding for Lindera
-
mdlynx
Small, fast utility to find broken file links in Markdown documents
-
pseudolocalize
Pseudolocalization crate for Rust
-
lindera-analyzer
A morphological analysis library
-
jellybean-pack-2
Sweet syntax highlighting with tree-sitter
-
rosie-sys
build or link to librosie to access the Rosie Pattern Language
-
lithe-cli
A cli of lithe
-
snakecase
general purpose snakecase implementation supporting both ascii and unicode
-
pcre2-sys
Low level bindings to PCRE2
-
asciifolding
ascii folding library
-
djot
Djot parser written in pure Rust
-
tex-rs
create latex documents in Rust
-
encoding-index-simpchinese
Index tables for simplified Chinese character encodings
-
mdoc
Modern PDF creation through Markdown and LaTeX
-
tectonic_engine_xdvipdfmx
The
xdvipdfmx
program as a reusable crate -
base1112031
way to represent numbers using all Unicode characters except ASCII control characters
-
mon-artist
Render ASCII art to SVG, with user-customizable rendering rules
-
debug_canvas
Easily print complicated debug strings
-
pdf2
PDF 2.0 Rust library
-
ascii-alphabetic-char
Traits for ASCII alphabetic characters
-
clparser
A command line tool for parsing CHANGELOG.md files that use the Keep A Changelog format. (fork marcaddeo/clparse)
-
flag_by_code
Easiest way to get an emoji flag from BCP-47 language tags
-
enc-check
inspect utf-8 and utf-16 character encodings
-
the-book-tui
A TUI application to load, read, and search The Rust Book
-
yeslogic-fontconfig
RENAMED: use the fontconfig crate instead
-
rustyword
An anagram finder
-
lingua-polish-language-model
The Polish language model for Lingua, an accurate natural language detection library
-
replace-all
Cli to quickly replace occurences of a word in a file
-
case_convert
Converts the first letter of a Rust String to uppercase
-
unic-ucd-block
UNIC — Unicode Character Database — Unicode Blocks
-
mdtable-cli
that makes creating tables in markdown much easier!
-
publish_demo_node_1
This program picks a random christmas emoji for you
-
minigrep_linqi
grep implementation from The Rust Programing Book
-
writedown
format parser
-
base_u256
base-u256 is to utf-8 as base-64 is to ascii
-
forming
lightweight architecture as code language. 架构描述语言
-
encoding_rs
A Gecko-oriented implementation of the Encoding Standard
-
grep-pcre2
Use PCRE2 with the 'grep' crate
-
emojicons
Parse :emoji: notation to unicode representation
-
owlz
"Owlz" ascii emoojis, created randomly or by design