-
regex
regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
-
similar
A diff library for Rust
-
textwrap
word wrapping, indenting, and dedenting strings. Has optional support for Unicode and emojis as well as machine hyphenation.
-
fancy-regex
regexes, supporting a relatively rich set of features, including backreferences and look-around
-
const_format
Compile-time string formatting
-
convert_case
Convert strings into any case
-
ropey
A fast and robust text rope for Rust
-
lazy-regex
lazy static regular expressions checked at compile time
-
widestring
wide string Rust library for converting to and from wide strings, such as those often used in Windows API or other FFI libaries. Both
u16
andu32
string types are provided, including support for UTF-16 and UTF-32… -
pulldown-cmark
A pull parser for CommonMark
-
unicode-xid
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
deunicode
Convert Unicode strings to pure ASCII by intelligently transliterating them. Suppors Emoji and Chinese.
-
unicode-segmentation
Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules
-
tokenizers
today's most used tokenizers, with a focus on performances and versatility
-
termimad
Markdown Renderer for the Terminal
-
emojis
✨ Lookup emoji in *O(1)* time, access metadata and GitHub shortcodes, iterate over all emoji, and more!
-
unicase
A case-insensitive wrapper around strings
-
html2text
Render HTML as plain text
-
onig
Rust-Onig is a set of Rust bindings for the Oniguruma regular expression library. Oniguruma is a modern regex library with support for multiple character encodings and regex syntaxes.
-
unicode-normalization
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
unicode-bidi
Unicode Bidirectional Algorithm
-
lngcnv
linguistics: display pronunciation, translate between dialects, convert between orthographies; support for multiple languages: English, Latin, Polish, Quechua, Spanish, Tikuna
-
rustybuzz
A complete harfbuzz shaping algorithm port to Rust
-
lopdf
PDF document manipulation
-
prettydiff
Side-by-side diff for two files
-
fuzzy-matcher
Fuzzy Matching Library
-
chardetng
A character encoding detector for legacy Web content
-
linkify
Finds URLs and email addresses in plain text. Takes care to get the boundaries right with surrounding punctuation like parentheses.
-
mdbook
Creates a book from markdown files
-
lindera-cli
A morphological analysis command line interface
-
regress
A regular expression engine targeting EcmaScript syntax
-
slug
Convert a unicode string to a slug
-
text-splitter
Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.
-
Inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
pulldown-cmark-to-cmark
Convert pulldown-cmark Events back to the string they were parsed from
-
stringprep
algorithm
-
diff
An LCS based slice and string diffing implementation
-
unicode-general-category
Fast lookup of the Unicode General Category property for char
-
finl_unicode
handling Unicode functionality for finl (categories and grapheme segmentation)
-
unicode-script
exposes the Unicode
Script
andScript_Extension
properties from UAX #24 -
titlecase
Capitalize text according to a style defined by John Gruber for Daring Fireball
-
shell-escape
Escape characters that may have a special meaning in a shell
-
indenter
A formatter wrapper that indents the text, designed for error display impls
-
charabia
detect the language, tokenize the text and normalize the tokens
-
regex-syntax
A regular expression parser
-
synoptic
low-level, syntax highlighting library with unicode support
-
google-dialogflow3
A complete library to interact with Dialogflow (protocol v3)
-
diffy
Tools for finding and manipulating differences between files
-
unescaper
Unescape strings with escape sequences written out as literal characters
-
unicode-width
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
wana_kana
checking and converting between Japanese characters - Kanji, Hiragana, Katakana - and Romaji
-
hyperlink
Very fast link checker for CI
-
const-str
compile-time string operations
-
roff
ROFF (man page format) generation library
-
tailspin
A log file highlighter
-
line-index
Maps flat
TextSize
offsets to/from(line, column)
representation -
printpdf
writing PDF files
-
ferris-says
flavored replacement for the classic cowsay
-
str_indices
Count and convert between indexing schemes on string slices
-
prefix
A customizable pretty printer for FIX messages
-
focaccia
no_std implementation of Unicode case folding comparisons
-
jieba-rs
The Jieba Chinese Word Segmentation Implemented in Rust
-
ncount
A word count tool intended to derive useful stats from markdown
-
matchers
Regex matching on character and byte streams
-
stringzilla
Faster SIMD-accelerated string search, sorting, fingerprints, and edit distances
-
text-size
Newtypes for text offsets
-
mdbook-pdf
A backend for mdBook written in Rust for generating PDF based on headless chrome and Chrome DevTools Protocol
-
mdbook-pandoc
A pandoc-powered mdbook backend
-
llmvm-core
The core application for llmvm
-
yeslogic-ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched with support for additional tables
-
uncased
Case-preserving, ASCII case-insensitive, no_std string types
-
mdbook-admonish
A preprocessor for mdbook to add Material Design admonishments
-
font-types
Scalar types used in fonts
-
mdbook-katex
mdBook preprocessor rendering LaTeX equations to HTML
-
mdbook-yapp
A mdBook preprocessor for simple text replacements
-
wildcard
matching
-
rphonetic
Rust port of phonetic Apache commons-codec algorithms
-
epub-builder
generating EPUB files
-
bfom
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
pluralizer
Rust package to pluralize or singularize any word based on a count inspired on pluralize NPM package
-
shell-toy
A "fortune | cowsay" implementation in Rust, i.e. a nice little toy to liven up your shell.
-
entities
raw data needed to convert to and from HTML entities
-
mdbook-catppuccin
🎊 Soothing pastel theme for mdBook
-
unindent
Remove a column of leading whitespace from a string
-
stam
powerful library for dealing with stand-off annotations on text. This is the Rust library.
-
autocorrect
A linter and formatter for help you improve copywriting, to correct spaces, words, punctuations between CJK (Chinese, Japanese, Korean)
-
aki-resort
sort lines of text. You can use regex to specify the KEY.
-
unicode_names2
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
mlc
The markup link checker (mlc) checks for broken links in markup files
-
netidx
Secure, fast, pub/sub messaging
-
r2md
Entire codebase to single markdown or pdf file
-
pdf-extract
extract content from pdfs
-
hck
A sharp cut(1) clone
-
stop-words
Common stop words in many languages
-
text_io
really simple to use panicking input functions
-
clipboard-rs
Cross-platform clipboard API (text | image | rich text | html | files | monitoring changes) | 跨平台剪贴板 API(文本|图片|富文本|html|文件|监听变化) Windows,MacOS,Linux
-
xi-unicode
Unicode utilities useful for text editing, including a line breaking iterator
-
slice-command
slice is a command-line tool that allows you to slice the contents of a file using syntax similar to Python's slice notation
-
tossicat
입력된 단어에 맞게 같이 입력된 토시(조사)를 적절하게 변환하는 라이브러리
-
file-organiser
Command line file manager to list, move or delete large numbers of files in nested folders filtered by age, file extension, file name pattern and/or size range
-
stfu8
Sorta Text Format in UTF-8
-
cow-utils
Copy-on-write string utilities for Rust
-
newdoc
Generate pre-populated module files formatted with AsciiDoc that are used in Red Hat and Fedora documentation
-
matcher_rs
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
stylin
Convert markdown to pandoc markdown with custom styles
-
tibco_ems
A high level API for the Tibco EMS
-
sd
An intuitive find & replace CLI
-
pad
padding strings at runtime
-
languagetool-rust
LanguageTool API bindings in Rust
-
ascii
ASCII-only equivalents to
char
,str
andString
-
utf16_iter
Iterator by char over potentially-invalid UTF-16 in &[u16]
-
boreal
evaluate YARA rules, used to scan bytes for textual and binary pattern
-
fontfor
find fonts which can show a specified character and preview them in terminal or browser
-
near-facsimile
Find similar or identical text files in a directory
-
difflib
Port of Python's difflib library to Rust
-
rustyline-async
A minimal readline with multiline and async support
-
bbd
Binary Braille Dump
-
hgrep
grep tool with human-friendly search output. This is similar to
-C
option ofgrep
command, but its output is enhanced with syntax highlighting focusing on human readable outputs. -
mdbook-graphviz
mdbook preprocessor to add graphviz support
-
collclean
Clean up collaboration commands in LaTeX files
-
wezterm-bidi
The Unicode Bidi Algorithm (UBA)
-
decancer
that removes common unicode confusables/homoglyphs from strings
-
lindera
A morphological analysis library
-
affinidi-messaging-text-client
Affinidi Messaging SDK
-
serpl
terminal UI for search and replace, ala VS Code
-
unicode-bidi-mirroring
Unicode Bidi Mirroring property detection
-
duvet
A requirements traceability tool
-
chewing
(酷音) intelligent Zhuyin input method
-
savvy
R extension interface
-
nvl-cli
A program to download webnovels
-
fm
Non-backtracking fuzzy text matcher
-
aho-corasick-unsafe
Fast multiple substring searching
-
mdbook-epub
An EPUB renderer for mdbook
-
vaporetto
pointwise prediction based tokenizer
-
spellbook
A spellchecking library compatible with Hunspell dictionaries
-
sile
Simon’s Improved Layout Engine
-
graphannis
new backend implementation of the ANNIS linguistic search and visualization system
-
mdbook-combiner
combine mdbook summaries from multiple source into one mdbook
-
mdcat
cat for markdown: Show markdown documents in terminals
-
hyphenation
Knuth-Liang hyphenation for a variety of languages
-
unicode-security
Detect possible security problems with Unicode usage according to Unicode Technical Standard #39 rules
-
probly-search
A lightweight full-text search engine with a fully customizable scoring function
-
minimo
terminal ui library combining alot of things from here and there and making it slightly easier to play with
-
console_static_text
Logging for text that should stay in the same place in a console
-
minimizer
Minimize files to find minimal test case
-
tree-sitter-stack-graphs-typescript
Stack graphs definition for TypeScript & TSX using tree-sitter-typescript
-
regex-literal
delimited regular expression literals
-
dvd-term
A bouncing ASCII art DVD logo (or custom text) for the terminal
-
quixote
Quizzes and tests in Markdown
-
omekasy
Decorate alphanumeric characters in your input with various font; special characters in Unicode
-
uwc
Counts things in unicode text files
-
subplot
tools for specifying, documenting, and implementing automated acceptance tests for systems and software
-
mdbook-quiz
Interactive quizzes for your mdBook
-
fax
Decoder and Encoder for CCITT Group 3 and 4 bi-level image encodings used by fax machines TIFF and PDF
-
presenterm
A terminal slideshow presentation tool
-
fiat-lux
Offline terminal-accessible Bible
-
molybdenum
Recursive search and replace CLI application
-
tantivy-analysis-contrib
A set of analysis components for Tantivy
-
uuhelp_parser
A collection of functions to parse the markdown code of help files
-
tuicam
Terminal-based camera with switchable modes
-
vidyut-prakriya
A Sanskrit word generator
-
nucleo-matcher
plug and play high performance fuzzy matcher
-
asciisavers
A small collection of ascii screensavers
-
fasttext
binding
-
sf-api
API to send commands to the Shakes & Fidget servers and parse their responses into characters
-
yake-rust
Yake (Yet Another Keyword Extractor) in Rust
-
cruet
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
wchar
Procedural macros for compile time UTF-16 and UTF-32 wide strings
-
hypher
separates words into syllables
-
obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
unicode-joining-type
Fast lookup of the Unicode Joining Type and Joining Group properties
-
lipsum
lorem ipsum text generation library. It generates pseudo-random Latin text. Use this if you need filler or dummy text for your application. The text is generated using a simple Markov chain…
-
COXave
Instruments for codings
-
aki-xcat
concatenate files that are plain, gzip, xz and zstd
-
ident_case
applying case rules to Rust identifiers
-
mago-docblock
Analyzes PHP docblocks to extract annotations, tags, and documentation comments, aiding tools that rely on inline documentation
-
aneubeck-daachorse
Daachorse: Double-Array Aho-Corasick
-
bashtestmd
Compiles shell commands in .md files into Bash scripts for testing
-
mupdf
Safe Rust wrapper to MuPDF
-
igrepper
The interactive grepper
-
skyspell
Fast and handy spell checker for the command line
-
wordcut-engine
Word segmentation/breaking library
-
trans-epub
Translate EPUB with CLI
-
line-numbers
Find line numbers in strings by byte offsets, quickly
-
jetscii
A tiny library to efficiently search strings and byte slices for sets of ASCII characters or bytes
-
colornames
An enum of color names, with a catchall RGB variant
-
regex-cursor
regex fork that can search discontiguous haystacks
-
qpdf
Rust bindings to QPDF C++ library
-
frida-build
Rust bindings for Frida
-
svgdx-pandoc
pandoc filter for svgdx codeblocks in Markdown
-
latex-thebib
Clean and sort legacy TeX bibliographies written using ‘thebibliography’ via the
refactor
sub-command. Compile BibTeX files to legacythebibliography
TeX code using thecompile
sub-command… -
unicode_titlecase
add Unicode titlecase and Turkish and Azeri locale upper/lowercase utilities to chars and strings
-
unicode-id
Determine whether characters have the ID_Start or ID_Continue properties according to Unicode Standard Annex #31
-
yara-x
A pure Rust implementation of YARA
-
none-shall-pass
Artifact for GitHub Action to validate hyperlinks in all markdown files
-
pager
pipe your output through an external pager
-
levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
twars-url2md
A powerful CLI tool that fetches web pages and converts them to clean Markdown format using Monolith for content extraction and htmd for conversion
-
rustkorean
processing Korean characters. It provides functionalities to check if a character is Korean, classify Korean characters, verify if a character is a leading consonant (choseong), a medial vowel (jungseong)…
-
norad
Read and write Unified Font Object files
-
colossus
powerful tool for AI-driven text processing and analysis
-
htmd-cli
The command line tool for htmd
-
zalgo-codec
Convert an ASCII text string into a single unicode grapheme cluster and back. Provides a macro for embedding Rust source code that has been encoded in this way.
-
tantivy-stemmers
A collection of Tantivy stemmer tokenizers
-
mdbook-theme
A preprocessor and a backend to config theme for mdbook, especially creating a pagetoc on the right and setting full color themes from the offical ace editor
-
trxx
pack and unpack text files
-
mut-str
A toolkit for working with mutable string slices (&mut str)
-
figlet-comment
quickly create banner to use as comments
-
vesti
A preprocessor that compiles into LaTeX
-
fontkit
font loading and indexing
-
ascii_help
help you quickly convert ASCII codes
-
vmks-exam-generator
CLI program for pseudo-randomly generating different variants of an embedded programming exam
-
mdbook-mermaid
mdbook preprocessor to add mermaid support
-
indefinite
Prefix a noun with an indefinite article - a or an - based on whether it begins with a vowel
-
basic-text
Basic Text strings and I/O streams
-
autotex
Continuously compile TeX and LaTeX
-
mdbook-nice
A mdbook plugin to add nice css to your book
-
case_insensitive_hashmap
A HashMap that uses case-insensitive strings as keys
-
prop-check-rs
A Property-based testing Library in Rust
-
emojic
Emoji constants
-
string-patterns
Makes it easier to work with common string patterns and regular expressions in Rust, adding convenient regex match and replace methods (pattern_match and pattern_replace) to the standard…
-
pathmut
Command line utility for manipulating path strings
-
nlpo3
Thai natural language processing library, with Python and Node bindings
-
egui-dropdown
An actual dropdown list for egui
-
nanohtml2text
A zero-dependency library to convert HTML to plain text
-
zhconv
Traditional/Simplified and regional Chinese variants converter based on MediaWiki & OpenCC rulesets and powered by AC automata 轉換简体、繁體及兩岸、新馬中文地區詞,基於MediaWiki和OpenCC之字詞轉…
-
arrow-row
Arrow row format
-
rustic_print
A versatile Rust library for enhancing console output. It offers a range of features to create a more engaging and informative command-line interface.
-
dmos
Djot HTML renderer with advanced features
-
gh-emoji
Convert
:emoji:
to Unicode using GitHub’s emoji names -
simpleml
parser/writer for the Simple Markup Language format, as defined by Stenway. See https://dev.stenway.com/SML/. SML builds on top of WSV but allows hierarchical structuring of data.
-
text_analysis
Analyze text stored as *.txt in provided file or directory. Doesn't read files in subdirectories. Counting all words and then searching for every unique word in the vicinity (+-5 words)…
-
krafna
terminal-based alternative to Obsidian's Dataview plugin, allowing you to query your Markdown files using standard SQL syntax
-
harfbuzz_rs
A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust
-
grok
popular java & ruby grok library which allows easy text and log file processing with composable patterns
-
unicode-ccc
Unicode Canonical Combining Class detection
-
fuzzt
Implementations of string similarity metrics. Includes Hamming, Levenshtein, OSA, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Sørensen-Dice.
-
typed-oid
Typed Object IDs
-
m2h
Convert Markdown to HTML with syntax highlighting
-
libreofficekit
in Rust
-
epcmanager
EPC text tool for RFID
-
mdbook-alerts
mdBook preprocessor to add GitHub Flavored Markdown's Alerts to your book
-
bpetok
CLI for tokenizing text input using Byte Pair Encoding (BPE)
-
rust-persian-tools
Official Rust implementation of Persian Tools
-
pandoc
API that wraps calls to the pandoc 2.x executable
-
cskk
C ABIから使う事を目的とした SKK(Simple Kana Kanji henkan)方式のかな漢字変換ライブラリ
-
puppet-fmt
Automatic code formatter for puppet manifests
-
unidown
Convert Markdown to Unicode
-
instant-segment
Fast English word segmentation
-
chocodye
changing the color of the chocobos' plumage in Final Fantasy XIV
-
ipynb-to-md
Convert Jupyter Notebooks to Markdown files
-
pinyin-parser
Parses a string of pinyin syllables. Covers marginal cases such as
ẑ
,ŋ
andê
. -
treegrep
A pattern matcher frontend or backend which displays results in a tree
-
frawk
an efficient Awk-like language
-
mdbook-docslab
mdBook preprocessor for interactive code with docslab
-
rsrpp-cli
project for research paper pdf
-
asciidork-parser
Asciidork parser
-
fasta-cleaner
Transform fasta files by upper-casing all sequence characters and removing non-ACGT sequence characters
-
vader-sentimental
A faster Rust version from the original Python VaderSentiment analysis tool
-
aki-gsub
substitude text command, replace via regex
-
colored_text
adding colors and styles to terminal text
-
advreader
Split text file into text sequences, strings and (line) comments
-
mdbook-angular
mdbook renderer to run angular code samples
-
rustyink
Blazing fast static site generator
-
json_to_table
pretty print JSON as a table
-
numeric-sort
A zero-allocation, human-readable sorting library
-
pulldown-html-ext-cli
CLI tool for extended HTML rendering of Markdown with pulldown-cmark
-
tesseract-rs
Rust bindings for Tesseract OCR with optional built-in compilation
-
stringcase
Converts string cases between camelCase, COBOL-CASE, kebab-case, and so on
-
rust_string_utils
String utilities for rust based on org.apache.commons.lang3
-
unicode-blocks
contains a list of all unicode blocks and provides some functions to search across them
-
lindera-dictionary
A morphological analysis library
-
inflections
High performance inflection transformation library for changing properties of words like the case
-
zet
zet finds the union, intersection, set difference, etc of files considered as sets of lines
-
xboot
static variables like module-level database connections can be initialized asynchronously before the program starts. / 在程序启动前通过异步函数初始化静态变量,如模块级数据库连接。
-
pulldown-cmark-toc
Generate a table of contents from a Markdown document
-
vlazba
Lojban words generator and analyzer
-
oxford_join
Join string slices with Oxford Commas!
-
svgbob
Transform your ascii diagrams into happy little SVG
-
b2c2-tokenizer
b2c2のBASICコードのトーカナイザー?
-
kelp
A convert tool for Japanese
-
tantivy-jieba
that bridges between tantivy and jieba-rs
-
roe
Unicode case conversion
-
neofile
linux file command for windows. PATCH 4.
-
simple_term_rewriter
A basic interface to rewrite concrete tree terms according to a set of rules
-
inkjet
A batteries-included syntax highlighting library for Rust, based on tree-sitter
-
etradeTaxReturnHelper
Parses etrade and revolut financial documents for transaction details (income, tax paid, cost basis) and compute total income and total tax paid according to chosen tax residency (currency)
-
dptran
run DeepL translations on command line written by Rust
-
timug
It has been created for personal blog creation purpose. Timus has its limits, but it fulfills the purposes for which it was created.
-
gen-mdbook-summary
generate SUMMARY.md for mdbook project
-
percent-encoding
Percent encoding and decoding
-
termdiff
Write a diff with color codes to a string
-
asimov-sdk
ASIMOV Software Development Kit (SDK) for Rust
-
dnd_spellbook_maker
making pdf documents of 5th edition D&D spells that are formatted like D&D source books
-
substring
method for string types
-
jayce
tokenizer 🌌
-
mdbook-typst
An mdBook backend to output Typst markup, pdf, png, or svg
-
shell2batch
Coverts simple basic shell scripts to windows batch scripts
-
fontconfig
Safe, higher-level wrapper around the Fontconfig library
-
mdbook-d2
D2 diagram generator plugin for MdBook
-
dcsv
Dyanmic csv reader,writer,editor
-
text-to-ascii-art
program to convert text to ASCII art
-
supply-chain-trust-example-crate-000022
Fast multiple substring searching
-
git-blamediff
A program to automatically annotate changes to a file in git(1)
-
string-auto-indent
Normalizes multi-line string indentation while preserving platform-specific line endings
-
bogrep
Full-text search for bookmarks from multiple browsers
-
soundex-rs
that calculates soundex value
-
utf8_iter
Iterator by char over potentially-invalid UTF-8 in &[u8]
-
unicode-reverse
Unicode-aware in-place string reversal
-
nlf
A CLI to append newline characters (LF) at the end of text file
-
dprint-plugin-markdown
Markdown formatter for dprint
-
glimpse
A blazingly fast tool for peeking at codebases. Perfect for loading your codebase into an LLM's context.
-
asciigraph
Ascii Graph
-
sublime_fuzzy
Fuzzy matching algorithm based on Sublime Text's string search
-
swift-check
High-performance, robust, and expressive searching and validation (uses SIMD on x86_64, aarch64, and WASM)
-
analiticcl
approximate string matching or fuzzy-matching system that can be used to find variants for spelling correction or text normalisation
-
kathoey
text feminization using open corpus linguistics data
-
quranize
Encoding transliterations into Quran forms
-
rapidfuzz
rapid fuzzy string matching library
-
sk-skimmer
Fuzzy Finder in rust!
-
regex-charclass
Manipulate and convert regex character classes
-
composefs
filesystem
-
xml_magic
A reasonably fast XML formatter
-
any_ascii
Unicode to ASCII transliteration
-
float-pretty-print
Format f64 for showing to user, not for serialisation
-
filenamify
Convert a string to a valid filename
-
mdbook-private
An mdbook preprocessor that controls visibility of private chapters and sections within them
-
mdbook-toc
mdbook preprocessor to add Table of Contents
-
repvar
A tiny CLI tool that replaces variables of the style
${KEY}
in text with their respective value. It can also be used as a rust library -
crowbook-text-processing
some utilities functions for escaping text (HTML/LaTeX) and formatting it according to typographic rules (smart quotes, ellipsis, french typograhic rules)
-
plsfix
Text cleaner upper
-
fast_symspell
Spelling correction & Fuzzy search
-
unicode-case-mapping
Fast lowercase, uppercase, and titlecase mapping for characters
-
text2num
Parse and convert numbers written in English, Dutch, Spanish, Portuguese, German, Italian or French into their digit representation
-
ib-pinyin
一个高性能拼音匹配库
-
iepub
epub、mobi电子书读写
-
chinese_dictionary
A searchable Chinese / English dictionary with helpful utilities
-
extract_anchors
Утилита для извлечения из исходных кодов всех помеченных отрывков
-
mdmodels
generate models, code and schemas from markdown files
-
in_definite
Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.
-
jpreprocess
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
minspan
a package for determining the minimum span of one vector within another
-
mdbook-linkcheck2
A backend for
mdbook
which will check your links for you -
nu-utils
Nushell utility functions
-
asciimath-unicode
Convert asciimath to unicode
-
tabprinter
creating and printing formatted tables in the terminal. It supports various table styles and offers both color and non-color output options.
-
gst-plugin-textahead
GStreamer Plugin for displaying upcoming text buffers ahead of time
-
shwild
(for Rust)
-
libabbs
aosc-os-abbs maintenance
-
enma
serving anime and manga information 📦
-
unicode-truncate
Unicode-aware algorithm to pad or truncate
str
in terms of displayed width -
mdbook-callouts
mdBook preprocessor to add Obsidian Flavored Markdown's Callouts to your book
-
dialogi
A dialog parser
-
pulldown-cmark-mdcat
Render pulldown-cmark events to TTY
-
rust-ai
A collection of 3rd-party AI APIs for Rust
-
mdbook-pagebreaks
A mdbook preprocessor to insert page breaks when rendering to HTML
-
glu
Unpacks a Markdown document full of code snippets into a temp directory
-
termfactor
is your terminal sick, mid or sus? test your terminal features & find out
-
yamd
Markdown Document (flavour)
-
easy_reader
easily navigating forward, backward or randomly through the lines of huge files
-
safe-string
safe interface for interacting with multi-byte strings in Rust, namely IndexedStr, IndexedString, and IndexedSlice
-
mkwebsite
build websites using markdown
-
vi
An input method library for vietnamese IME
-
mandown
Markdown to groff (man page) converter
-
codepack
A *lightning-fast* tool that transforms a directory into a single, LLM-friendly text file, perfect for easy processing and analysis
-
unicode-canonical-combining-class
Fast lookup of the Canonical Combining Class property
-
hh-highlight
hh: The text highlighter. Read from stdin, and highlight selected words printed to stdout
-
mdbook-open-on-gh
mdbook preprocessor to add a open-on-github link on every page
-
linebreak
breaking a given text into lines within a specified width
-
aki-mcolor
mark up text with color
-
rust_file_encode_mode_convert
这是一个rust的库,用于检测文件的编码格式。 支持GBK,GBK2312 , UTF8, UTF16LE, UTF16BE, UTF8+BOM,UTF32 等多种编码格式。
-
mdbook-pikchr
A mdbook preprocessor to render pikchr code blocks as images in your book
-
whitespace-sifter
Sift duplicate whitespaces away!
-
ean-rs
generating and validating EAN barcodes
-
pprint
Flexible and lightweight pretty printing library for Rust
-
diacritics
Remove diacritics from letters, for example when standardizing input for a search
-
stam-python
STAM is a library for dealing with standoff annotations on text, this is the python binding
-
mdbook-cmdrun
mdbook preprocessor to run arbitrary commands
-
kataru
YAML-based dialogue engine for game development
-
rutie
The tie between Ruby and Rust
-
uast
Unicode Aware Saṃskṛta Transliteration in Rust 🦀
-
mktoc
Generate Table of Contents from Markdown files
-
indent
Functions for indenting multiline strings
-
mdbook-external-links
Open external links inside your mdBooks in a different tab
-
catalog-of-markdown
Generate the catalog of markdown file
-
text-editing
string with utilities for editing
-
bwrap
A fast, lightweight, embedded systems-friendly library for wrapping text
-
simdutf
Unicode validation and transcoding at billions of characters per second
-
armnod
random string generator
-
mdbook-chess
An mdbook preprocessing plugin to generate chess boards
-
array_tool
Helper methods for processing collections
-
ColorShell
A small crate for coloring text for rust
-
url_encor
A lightweight library to encode and decode special characters in urls
-
kas-text
Text layout and font management
-
data-streams
Extension traits for reading and writing data with streams
-
inlet_manifold
A general purpose highlighting library
-
nu_plugin_regex
nu plugin to search text with regex
-
capitalize
Change first character to upper case and the rest to lower case, and other common alternatives
-
mantra-lang-tracing
Contains functionality needed to collect requirement traces from code or plain text files for the
mantra
framework -
ewts
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (lib)
-
nibi
static site generator. (WIP, implementing)
-
rsonpath-lib
Blazing fast JSONPath query engine powered by SIMD. Core library of
rsonpath
. -
date_time_parser
Rust NLP library for parsing English natural language into dates and times
-
latkerlo-jvotci
Tools for creating and decomposing Lojban lujvo
-
sgrep
grep util for those lazy to remember many command line options
-
case
A set of letter case string helpers
-
supply-chain-trust-example-crate-000021
regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
-
mini_markdown
Small markdown renderer
-
textcode
Text encoding/decoding library. Supports: UTF-8, ISO6937, ISO8859, GB2312
-
common-words-all
Most common words sorted by ngram frequency
-
poriborton
Interconversion between Unicode and various Bengali ANSI encodings
-
uwurs
UwUify your strings with uwurs!
-
detone
Decompose Vietnamese tone marks
-
crate-starter
starter
-
sqdj
sqdj shortens delimited data
-
line-straddler
Determine how lines through text (underlines, strikethroughs, etc) should be rendered
-
mdbook-codeblocks
A mdbook preprocessor to prepend customizable vignette to code blocks
-
orly
Download O'Reilly books as EPUB
-
crlify
A std::io::Write wrapper that replaces with on Windows
-
mdbook-pdf-headless_chrome
Control Chrome programatically
-
character_converter
Turn Traditional Chinese script ot Simplified Chinese script and vice-versa and tokenize
-
iirs
Inverted repeats finder
-
agentai
designed to simplify the creation of AI agents
-
mago-fixer
Applies automated fixes and transformations to text
-
colonnade
format tabular data for display
-
turn-uppercase
Small command to uppercase text in command line and copy to clipboard
-
slack-blocks-render
Slack blocks render is a Rust library to render Slack blocks as Markdown
-
oxifetch
program that displays key system information, such as OS details, uptime, CPU specs, memory usage, and more. The output includes an ASCII art logo and a quick overview of your machine's current status.
-
santoka
Translations of 668 of Taneda Santoka's free-verse haiku
-
dnd-character
A Dungeons and Dragons character generator
-
fast_whitespace_collapse
Collapse consecutive spaces and tabs into a single space using SIMD
-
str_inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
rewrite
Safely rewrite file contents from stdin, even when file is open as an input
-
mini-openai
An OpenAI API client with minimal dependencies
-
puid
A unique ID generator with 'ch_'-style prefix
-
arabic_reshaper
Reconstruct Arabic sentences to be used in applications that doesn't support Arabic script
-
clima
A minimal Markdown reader in the terminal
-
roman-numerals-rs
Manipulate well-formed Roman numerals
-
deliminator
Universal code documentation generator
-
sapling-streampager
streampager is a pager for command output or large files
-
tectonic_bridge_core
Exposing core backend APIs to the Tectonic C/C++ code
-
dodo-zh
operate on pinyin and cedict.u8 file
-
furigana
Map furigana to a word given its reading
-
vidyut-lipi
A Sanskrit transliterator
-
notion2html
Convert Notion pages to HTML
-
antex
Styled text and tree in terminal
-
zspell
Native Rust library for spellchecking
-
lcat
lolcat in rust! Full unicode support, escapes for ANSI escape sequences, hue shift in Cubehelix
-
bce
-
forbidden-bands
8-bit string handling library
-
unicode_categories
Query Unicode category membership for chars
-
fimdoc
Firendship is Magic Document, converts Markdown into FIMFiction BBCode
-
lll_toon
convert JPG images to ASCII art with colors
-
context-notation
Featherweight semantic notation for text
-
arf-strings
Encoding and decoding for ARF strings
-
utf8path
String-friendly path type
-
tiny-ver
tiny version parser
-
semsimian
Sematic similarity calculations for ontologies implemented in Rust
-
text-parsing
Hierarchical text processing preserving char position info
-
url_parser
URL Parser is a Rust parser developed to parse URLs into structured components such as scheme, domain, path, query and fragment
-
kbo
Spectral Burrows-Wheeler transform accelerated local alignment search
-
fmtm
A diff-friendly Markdown formatter that breaks lines on sensible punctuations and words to fit a line width
-
samvadsetu
LLM API for commonly used LLM services including Gemini, ChatGPT, and Ollama. The name implies a bridge for dialogue since the library facilitates communication and interaction between…
-
hanconv
Convert between Chinese characters variants
-
crates_tut
following the rust book tutorial
-
textpod
Local, web-based notetaking app inspired by 'One Big Text File' idea
-
cbfr
A buffer that run on stack, focusing on performance and speed
-
lindera-ko-dic-builder
A Korean morphological dictionary builder for ko-dic
-
mdbook-ai-pocket-reference
mdbook preprocessor for the ai-pocket-reference project
-
ethan-rs-wc
The ethan-rs-ws(erwc) is word, line, character, and byte count. Like wc command but not just wc command, more accurate and faster. Text can also be read from standard input for statistics.
-
text-tokenizer
Custom text tokenizer
-
tfon
Bitmap font parsing / conversion
-
lookbook
Component preview framework for Dioxus
-
iregex
Intermediate representation for Regular Expressions
-
md-ulb-pwrap
Markdown paragraph wrapper using Unicode Line Breaking Algorithm
-
erebus
A CLI message generation library
-
poppler-sys-rs
Low-level (FFI) bindings for poppler-glib
-
rasciigraph
function to plot ascii graphs
-
mdbook-tocjs
A mdbook preprocessor which adds extra js and css file for ToC hydration
-
rs-tool
A command-line tool to perform reservoir sampling on a file or a stream
-
cloc
Count, or compute differences of, lines of source code and comments
-
indent_write
Write adapters to add line indentation
-
stylish-ansi
stylish
helpers for writing styles as ANSI escape codes -
csv_to_table
pretty print CSV as a table
-
unic-ucd-ident
UNIC — Unicode Character Database — Identifier Properties
-
markov_str
Markov Chain implementation optimized for text generation
-
textwrap-macros
procedural macros to use textwrap utilities at compile time
-
html_parser_tarasenko
Базовий HTML-парсер на Rust з використанням Pest
-
advent-ocr
Converts ASCII-art representations of letters generated by Advent of Code puzzles into a String containing those letters
-
mdsh
Markdown shell pre-processor
-
human_regex
A regex library for humans
-
diary-cli
A powerful cli for documenting and keeping a diary
-
herring-automata
Automata construction for Herring
-
rofi-mode
High-level library for creating Rofi plugins and custom modes
-
speech-synthesis
Common traits for speech synthesis
-
repr
The regular-expression-as-linear-logic interpretation and its implementation
-
fetch-catnip
fetch displaying system information and a cute cat
-
pandoc_types
Rust port of pandoc-types
-
mdbook-ocirun
mdbook preprocessor to run arbitrary commands and code snippets inside containers
-
str-utils
some traits to extend types which implement
AsRef<[u8]>
orAsRef<str>
-
pukram2html
converting Pukram-formatted text to HTML
-
subtitler
parsing and generating subtitles
-
semchunk-rs
A fast and lightweight Rust library for splitting text into semantically meaningful chunks
-
pink_accents
Replacement of patterns in string to simulate speech accents
-
obmrs
As a participant, you will create a structure to receive and hold the exchange-distributed order book. This structure will be called the OrderBoard, and will hold the order book's bids and asks as a price-sorted map…
-
zipcodes
Query US zipcodes without SQLite
-
viterbi_pos_tagger
A part-of-speech (POS) tagger using the Viterbi algorithm
-
document_tree
reStructuredText’s DocumentTree representation
-
ipset_lookup
ipset is a command-line tool that takes networks or IPs and searches through a lot of different threat feeds quickly. It can also download the feed data necessary to perform the queries…
-
sapling-renderdag
Render a DAG into ASCII or Unicode text
-
sourceannot
render snippets of source code with annotations
-
htmd
A turndown.js inspired HTML to Markdown converter
-
mdbook-spec
An mdBook preprocessor to help with the Rust specification
-
paltoquet
rule-based general-purpose tokenizers
-
md-tui
A terminal markdown viewer
-
hangeul
Korean alphabet manipulation library
-
jawk
JSON AWK
-
rulet
figlet implementation
-
chord3
Create pdf songbooks from chopro source
-
wikipedia_prosesize
Count Wikipedia prose size
-
bump-bin
Increments version with semver specification
-
utf58
High-tech encoding of the Unicode space in one quibble and up to 3 bytes
-
tapir-bf
Tapir is a moderately useful Brainfuck interpreter
-
clarifai_grpc
The official Clarifai gRPC Rust client
-
fortune-rs
classic BSD fortune program
-
rust-regex-dsl-creator
Regular expression DSL derive macros
-
uklatn
Ukrainian Cyrillic transliteration to Latin script
-
swimos_model
SwimOS Generic Representation of the SwimOS Serialization Model
-
syllabize-es
Syllabize Spanish text, and much more
-
mtc-token-healing
Token healing implementation
-
betacode
conversion
-
mdbook_rash
Binary to create doc from
rash
code -
harfbuzz_rs_now
A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust
-
smol-symbol
compile-time globally unique, 25-character, 128-bit symbols similar to the Symbol type in Crystal Lang
-
tremor-kv
A logstash inspured key value extractor
-
extract-strings
Extract ascii strings from files
-
chunk_norris
splitting large text into smaller batches for LLM input
-
picodiff
Tiny GUI app to compare text easily
-
mdtrans
Markdown parser and transformer using
pest.rs
, focused on flexibility to a project’s needs -
pragmatic-segmenter
Rust port of pySBD v3.1.0
-
mdbook-plantuml
A preprocessor for mdbook which will convert plantuml code blocks into inline SVG diagrams
-
re_view_text_document
view that shows a single text box
-
trading-types
type definitions (and some utils) as building blocks for type-safe trading
-
rsrusl
A really simple useful library ported to Rust
-
surt-rs
Sort-friendly URI Reordering Transform (SURT)
-
zh_num
Convert ASCII numbers and zh words
-
alphabet_detector
Natural language alphabet detection library
-
mdbook-variables
mdBook proprocessor for risolve variables configured from book.toml
-
loc
Count lines of code (cloc) fast
-
raylib_interactive
An interactive library for Raylib
-
lindera-ko-dic
A Japanese morphological dictionary for ko-dic
-
easy_random
Generate random data easily with easy_random :)
-
reggy
friendly, resumable regular expressions for text analytics
-
abbreviation_extractor
extracting abbreviations from text
-
egg-mode-text
Text parsing for Twitter: character counting, hashtag/mention extraction
-
avt
asciinema virtual terminal
-
tform
format plain text into well-structured Markdown or HTML
-
b2c2-parser
b2c2におけるBASICコードのパーサー?
-
caseless
Unicode caseless matching
-
icall
call interface proto for hpc
-
textra
type less, write more
-
char-ranges
Iterate chars and their start and end byte positions
-
eternaltwin_constants
Hammerfest store implementation
-
repgrep
An interactive command line replacer for
ripgrep
-
static-key
static keys in Rust
-
grammalecte_client
Grammalecte HTTP client
-
mdbook-cat-prep
a preprocessor for mdbook which provides teacher, subject, material and tag functionality
-
heart-strings
Quickly get random heart emojis to copy!
-
terminal-clipboard
a minimal cross-platform clipboard
-
tree-sitter-stack-graphs
Create stack graphs using tree-sitter parsers
-
opentalk-types-signaling-whiteboard
Signaling types the OpenTalk whiteboard module
-
inflector-plus
Adds String based inflections for Rust. Snake, kebab, camel, word, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
csvtool
deal with CSV files
-
sre-engine
A low-level implementation of Python's SRE regex engine
-
regexy
lightweight Rust library for working with regular expressions. The regexy crate provides an easy-to-use interface for matching patterns in strings using regex
-
markov
A generic markov chain implementation in Rust
-
unicode-ellipsis
truncate Unicode strings to a certain width, automatically adding an ellipsis if the string is too long
-
mdbook-curly-quotes
mdBook preprocessor that replaces straight quotes with curlyquotes, except within code blocks or code spans
-
unaccent
remove accents from strings, inspired by PostgreSQL's unaccent extension
-
gh_page_tool
A github gh-pages tool for static blog site
-
writings
The Bahá’í Sacred Writings for use in Rust projects and APIs
-
charx
A replacement for char::is_ascii*
-
truncate_string_at_whitespace
Truncate a &str at the closest whitespace to a specified length with unicode safety
-
svgc
svgc
(previously SvgCompress) is a tool for compressing SVG files by removing unnecessary whitespace, comments, metadata and some other data. It also supports optimization with SVGO and compression into SVGZ. -
lowcharts
draw low-resolution graphs in terminal
-
mdi
markdown include
-
djotters
Djot (Markdown) parser that runs at hyper speeds!
-
scraps_libs
A static site generator that makes it easy to write internal links (Wiki links) in Markdown files
-
tracery
Text-expansion library
-
tbll
tbll outputs data in tabular format
-
unicode-box-drawing
Unicode box-drawing characters
-
streampager
pager for command output or large files
-
mdbook-yml-header
mdBook preprocessor for removing yml header
-
mdbook-kroki-preprocessor
render kroki diagrams from files or code blocks in mdbook
-
lemmeknow
Identify any mysterious text or analyze strings from a file
-
midstring
Create a string between two other strings, that is lexicographically halfway between them
-
asciidork-eval
Asciidork eval
-
mdbook_header_footer
mdBook preprocessor to prepend header and append footer to certain chapters
-
tre-regex
Rust safe bindings to the TRE regex module
-
portmanteau
create portmanteaux
-
hebrew_unicode_script
A lightweight library to check if a hebrew character belongs to certain collections
-
pray
A tui tool for preparing a prompt to the llms
-
uclanr
A random word picker that gives you actually useful words
-
clafrica
This application allows you to type most of the characters in the african alphabet in any text field
-
simple-string-patterns
Makes it easier to match, split and extract strings in Rust without regular expressions. The parallel string-patterns crate provides extensions to work with regular expressions via the Regex library
-
poppler
Wrapper for the GPL-licensed Poppler PDF rendering library
-
utf16_lit
macro_rules to make utf-16 literals
-
secular
No Diacr!
-
hr-id
A human-readable unique ID which supports Unicode, safe for URLs and file paths
-
choco
markup language for dialogue systems
-
ferret
A trigram-based tool for detecting similarity in groups of text documents or program code
-
rust-beam
A LaTeX slide generator you can write in faster than beamer
-
tfidf-text-summarizer
extractive text summarization system which uses TF-IDF scores of words present in the text to rank sentences and generate a summary
-
diffy-imara
Tools for finding and manipulating differences between files
-
cglue-bindgen
cleanup cbindgen headers for CGlue
-
mdbook-hints
mdBook preprocessor to add hover hints to your book
-
trpl
A support crate for The Rust Programming Language book
-
dbxcase
Dropbox-compatible case-folding algorithm
-
text_lines
Information about lines of text in a string
-
metatron
core library
-
substring-replace
developer-friendly methods to manipulate strings with character indices
-
transportations_library
Knowledge stack for transportation
-
ascii-izer
converting an image into ASCII art
-
csml_interpreter
The CSML Interpreter is the official interpreter for the CSML programming language, a DSL designed to make it extremely easy to create rich and powerful chatbots
-
idna-cli
Encode/decode Unicode domain names to/from IDNA ASCII
-
ucf
A universal code formatter
-
slugify
Macro for flexible slug generation
-
linkcheck2
extracting and validating links
-
nerd_font
Enumerated contents of the JetBrainsMono Nerd Font
-
casespaetzle
This package introduces common identifier case traits and a macro for case trait generation
-
whichlicense_detection
detect licenses used by the WhichLicense project
-
yozuk
Chatbot for Programmers
-
rust-tfidf
calculate TF-IDF (Term Frequency - Inverse Document Frequency) for generic documents
-
freesia
some string operators
-
html-auto-p
function like
wpautop
in Wordpress. It uses a group of regex replaces used to identify text formatted with newlines and replace double line-breaks with HTML paragraph tags. -
iconv-native
A lightweight text encoding converter based on platform native API or libiconv
-
precis-tools
Tools and parsers to generate PRECIS tables from the Unicode Character Database (UCD)
-
datatroll
a robust and user-friendly Rust library for efficiently loading, manipulating, and exporting data stored in CSV files
-
gregex
Regex solver utilizing NFA
-
mdbook-dtmo
Creates a book from markdown files with added plugins
-
supply-chain-trust-example-crate-000098
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
zoitei
alphabet conversions
-
tergo-formatter
Formatter for tergo
-
oxcomm
using Google Translate on the fly
-
what-rs
Identify what something is! A pyWhat reimplementation in Rust
-
rmbs
Remove any fluff, corporate speak, or other bullshit from input text and print the TL;DR essence of what's being said, using the www.bullshitremover.com public LLM API
-
text_trees
textual output for tree-like structures
-
simstring_rust
A native Rust implementation of the SimString algorithm
-
gosyn
Golang syntax parser in Rust
-
fluxcap
time expression parsing inspired on duckling
-
string-simple
containing some simple string utilities that I use in my other projects
-
tokenizers-enfer
today's most used tokenizers, with a focus on performances and versatility
-
mdbook-keeper
An improved testing experience for mdbook
-
fuzzy-datetime
Detects, completes and normalises fuzzy date and date-time strings for interoperability with chrono or direct output as ISO-8601-compatible strings
-
srake
RAKE Algoritm
-
escrit
learning languages by reading texts
-
malachi
A domain specific pattern matching language made for defining bot commands
-
repub-rs
binary for converting mhtml webpages into remarkable-style summarized epubs
-
mdbook-embedify
based mdbook preprocessor plugin that allows you to embed apps to your book, like youtube, codepen and some other apps
-
scanlex
lexical scanner for parsing text into tokens
-
rust_iso15924
ISO 15924, Codes for the representation of names of scripts, is an international standard defining codes for writing systems or scripts (a "set of graphic characters used for the written form of one or more languages")…
-
pretty-xmlish
Pretty print XML-ish data with unicode art
-
glk
Bindings for the Glk I/O interface for hosting interactive fiction interpreters
-
gaze
small utility library with the goal of making it easier to scan/lex text and collections
-
plagiarismbasic_lib
Basic plagiarism checker written in Rust
-
ps-str
String transcoding library
-
diff-man
diff utility lib
-
readability
Port of arc90's readability project to rust
-
typeline
Efficient, Type-Safe Pipeline Processor
-
uniquewords-rs
Count the frequencies of words in text file(s) or stdin
-
dicexp
A Dice Expression Interpreter program and library for parsing (and rolling) role-playing game style dice notations (e.g. "2d8+5")
-
vidyut-kosha
A Sanskrit key-value store
-
broken-md-links
A command-line tool and library to detect broken links in Markdown files
-
ruby_inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
mdbook-github-authors
mdbook preprocessor to display Github profiles of authors of a page
-
sbert
Sentence Bert (SBert)
-
mdopen
Preview markdown files in a browser
-
fish-printf
printf implementation, based on musl
-
incredimo
just another font for your terminal
-
prolangkit
A set of language-agnostic utilities for langdev
-
lano-stre
lano's string util
-
matcher_py
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
rascii_art
Advanced ASCII Art Generator
-
mitex-parser
Parser for MiTeX
-
ragegun
Performs lexica based analysis on text (i.e. age, gender, PERMA, OCEAN personality traits, ..)
-
hyperscan
bindings for Rust with Multiple Pattern and Streaming Scan
-
man
Generate structured man pages
-
typst-ansi-hl
highlights your Typst code using ANSI escape sequences
-
somedoc
A very simple document model and markup generator
-
llmvm-codeassist
A LLM-powered code assistant that automatically retrieves context (i.e. type definitions) from a Language Server Protocol server.
-
vndb_tags_get
convert VNDB tag list (JSON to markdown)
-
easy-regex
Make long regular expressions like pseudocodes
-
csmlinterpreter
The CSML (Conversational Standard Meta Language) is a Domain-Specific Language developed for creating conversational experiences easily
-
unicode_extension
Don't use this crate
-
minigrep_xiaoai
一个简单的命令行工具,用于在文件中搜索字符串。
-
mdbook-svgbob
SvgBob mdbook preprocessor which swaps code-blocks with neat SVG
-
cow-rewrite
Rewrite copy-on-write types copying only when it's neccessary
-
etch
Not just a text formatter, don't mark it down, etch it
-
quickner-core
A fast and simple NER tool
-
textspan
Text span utility
-
bobo_html_parser
parser of html markdown
-
notalawyer
display license notices
-
ascii-hangman-backend
customizable Hangman game with ASCII-art rewarding for children (backend)
-
roman_numerals_fn
A function to convert integers to their roman numeral representation as strings. Values from 1 to 3999 are possible, otherwise it returns an OutOfRangeError. Zero has no representation in roman numerals.
-
thesaurus
An offline thesaurus library for Rust
-
cogni
Unix native interface for LLMs
-
anslatortray
translate from English to Pig Latin!
-
message_segment_calculator
package to calculate SMS message segments
-
vectorscan-rs
Ergonomic bindings to the Vectorscan high-performance regex library
-
colorex
flexible Rust library for styling terminal output with color and text formatting. Supports RGB/HEX colors, text styles (bold, italic, underline, etc.), and background colors with a chainable API.
-
wikidot-normalize
provide Wikidot-compatible string normalization
-
isixhosa
help process text in isiXhosa
-
words-count
Count the words and characters, with or without whitespaces
-
gemmy
high performance orderbook written in rust
-
libopenlipc-sys
Wrapper around liblipc to interact with Kindle dbus-based LIPC events
-
hmd
Custom Markdown Engine for my personal blog
-
lodestone
A website wrapper for FFXIV's lodestone
-
sanitize-filename-reader-friendly
A filename sanitizer aiming to produce reader friendly filenames
-
svgbob_cli
Transform your ascii diagrams into happy little SVG
-
mdbook-pagetoc
A mdbook plugin that provides a table of contents for each page
-
libmistry
management system written in Rust
-
mdbook-presentation-preprocessor
A preprocessor for utilizing an MDBook as slides for a presentation
-
libcoerced
serde_json and generic text content matching, verification and coercion
-
cesu8
Convert to and from CESU-8 encoding (similar to UTF-8)
-
auto-regex
Automagically finds a regex that best matches an example and a sample list
-
tzgrep
grep tar.gz
-
split-identifier
Rust package that provides functions to split programmatic identifiers according to case conventions
-
argentum_user_business
Business layer of user component
-
aho-corasick
Fast multiple substring searching
-
term_grid
formatting strings into a grid layout
-
adobe-cmap-parser
parse Adobe CMap files
-
owoify_rs
port of mohan-cao's owoify-js is released under MIT license, which is also the same license of owoify-js. This simple library will turn any string into nonsensical babyspeak similar…
-
jawohl
Complete JSON documents from a stream of JSON tokens
-
mathemascii
AsciiMath parsing and rendering
-
mdbook-linkcheck
A backend for
mdbook
which will check your links for you -
regex-chunker
Iterate over the data in a
Read
type in a regular-expression-delimited way -
linurgy
Manipulate the output of multiple newlines. Replace/Insert/Append newlines with text. Input and output from stdio/files/buffers
-
mdbook-fs-summary
Summary generator for mdbook
-
ik-rs
chinese segment, ik-analyzer for rust
-
screen_printer
displaying rectangular blocks of text to a terminal
-
regex_generate
Use regular expressions to generate text
-
enpsrlib
English Phrase Structure Rules library
-
character-set
High performance
set.contains(char)
-
libanubhav
management system written in Rust
-
gspell
Rust bindings for gspell
-
dedent
Procedural macro for stripping indentation from multi-line string literals
-
address_book
Інструмент командного рядка для парсингу телефонних номерів, ідентифікаторів, дат та неправильних…
-
sesdiff
Generates a shortest edit script (Myers' diff algorithm) to indicate how to get from the strings in column A to the strings in column B. Also provides the edit distance (levenshtein).
-
stego_wps
text-based steganography using the Words Per Sentence (WPS) method. It allows encoding and decoding hidden messages within the structure of a given text.
-
twas
A text substitution application for using random look-up tables to generate text in a manner similar to the Mad Libs game
-
angr
analyse ngrams in text files
-
lisho
personal link shortener with no external dependencies in under 200 lines of Rust
-
ragtime
Easy Retrieval Augmented Generation
-
merge-whitespace
Procedural macros for merging whitespace in const contexts
-
film_parser
application for parsing detailed film data from raw text, including information such as title, release year, director, screenwriter, cast, genre, and runtime, into a structured Film…
-
snailquote
Escape and unescape strings with shell-inspired quoting
-
pandoc-ac
pandoc filter for converting acronym codes to LaTeX
-
kanbanrs
A basic kanban task manager written with egui
-
verify_string
verifying strings
-
bubble-bath
Small and quick HTML sanitizer
-
dmos-cli
Djot HTML renderer with advanced features - CLI
-
reason-shell
Reason: A Shell for Research Papers
-
ncase
Enforce a case style
-
litua
Read a text document, receive its tree in Lua and manipulate it before representing it as string
-
harfbuzz
Rust bindings to the HarfBuzz text shaping engine
-
stringmatch
Allow the use of regular expressions or strings wherever you need string comparison
-
ipa-translate
translating between IPA and ASCII text
-
promptkit_rs
LLM structured prompting library
-
tu
CLI tool to convert a natural language date/time string to UTC
-
readability-text-cleanup
cleaning up text from HTML using Mozilla's Readability algorithm
-
minigrep_baolhq
Just getting started with Rust, enjoying it so far 😇
-
enc-check
inspect utf-8 and utf-16 character encodings
-
hns
Human numeric sorting program — does what
sort -h
is supposed to do! -
typo_checker
Searches for typos of English words and returns matches or similar words.(英単語のタイポを検索し、合致もしくは類似する単語を返却します)
-
nu_plugin_from_dhall
Nushell plugin to add Dhall support
-
shopping-parser
A Rust-based parser for parsing structured product information and shopping lists, supporting multiple currencies and units
-
slicestring
slicing Strings
-
cheetah
An extremely fast and flexible static site generator
-
aki-mline
match line, regex text filter like a grep of linux command
-
rsnltk
Rust-based Natural Language Toolkit
-
unflow
DSL to convert design to code
-
visa
(Virtual Instrument Software Architecture) Wrapper which allows concurrent and dynamic switching between visa implementations from different vendors
-
owned_chars
Owned iterators with the same output as Chars and CharIndices
-
pulldown_mdbook
A pull parser for mdBook
-
simple-markdown-parser
handling markdown. May include extra utilities
-
mdbook-rustviz
An mdbook preprocessor that allows users to embed RustViz visualizations into mdbook projects
-
mago-span
Tracks exact character positions in source code, making it easier to report precise errors, warnings, and other diagnostics
-
unic-emoji-char
UNIC — Unicode Emoji — Emoji Character Properties
-
easy_io
Fast and dead-simple IO for competitive programming in Rust
-
r-matrix
Rust port of cmatrix
-
minigrep_jeck
minigrep
is a grep clone that takes a query and searches for the query in the file; with added support for regex -
palmdoc-compression
Fast & safe implementation of PalmDoc/MOBI/AZW/Kindle flavored LZ77
-
squidge
shortens delimited data
-
image-to-ascii
Converts images and gifs to ascii art
-
rust-texas
generate latex documents
-
lexical-sort
Sort Unicode strings lexically
-
pdf_composer
Create PDF documents from YAML Front Matter source documents
-
casile
The command line interface to the CaSILE toolkit, a book publishing workflow employing SILE and other wizardry
-
colorize-rs
terminal text colorisation using ansi characters
-
bcdown
Bilibili漫画下载器,written in Rust,支持epub pdf zip格式
-
string-overlap
A helper crate for "layering" ASCII art
-
selmr
Package to create and use Simple Explainable Language Multiset Representations
-
pullup
Convert between markup formats
-
ewts-cli
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (cli)
-
segtok
Sentence segmentation and word tokenization tools
-
ungoliant
The pipeline for the OSCAR corpus
-
rustsay
CLI tool in Rust that mimics the classic cowsay program, allowing a cow to speak your text in the terminal
-
chamkho
Khmer, Lao, Myanmar, and Thai word segmentation/breaking library and command line
-
readable-regex
Regex made for humans. Wrapper to build regexes in a verbose style.
-
mecab
Safe Rust wrapper for mecab a japanese language part-of-speech and morphological analyzer library
-
tgrep
Toy grep that honors .gitignore
-
cesu8-str
CESU-8 and Java CESU-8 string validation and manipulation
-
tnipv-lint
lints for tnipv, the Telcoin Network Improvement Proposal validator
-
yeslogic-fontconfig-sys
Raw bindings to Fontconfig without a vendored C library
-
notedown_ast
Notedown Abstract Syntax Tree
-
gdnative-doc
Documentation tool for gdnative
-
markdown-extract-cli
Extract sections of a markdown file with a regular expression
-
emojicon
Find Emoji by using Emoticons and GitHub's, Bengali emoji names
-
crowbook
Render a Markdown book in HTML, PDF or Epub
-
strinject
Inject text from somewhere else into given text
-
hydroper_source_text
Source text containing line locations
-
xi-rope
A generic rope data structure built on top of B-Trees
-
gstring
String with support for Unicode graphemes
-
advanced_string_generator
A command-line tool for generating strings based on customizable regex patterns
-
worcher
full-text search for static websites
-
simple_peg
A command-line peg parser implemented by Rust
-
lindera-unidic-builder
A Japanese morphological dictionary builder for UniDic
-
ultra
Cryptanalysis of the Engima
-
reg_match
A match style regex tool
-
bureau
extracting dependency licenses, used by nertsio
-
mdbook_fork4ls
Fork of mdBook for mdBook_LS
-
env2toml
Convert env vars to toml text
-
b2c2-casl2
b2c2におけるCASL2の雑なパーサー?
-
scatternotes
A cli application to manage unstructured notes
-
testing_table
An easy to use library for pretty print tables of Rust
struct
s andenum
s -
unicode-casing
Titlecase helper function on characters
-
mdbook-tabs
mdBook plugin for rendering content in tabs
-
punycode
Functions to decode and encode Punycode
-
bukvalno
A cli tool for converting images to ascii art
-
tuilet
A textual user interface for Toilet, the ANSI-art text generator
-
tengwar
Transliterate text into J.R.R. Tolkien's Tengwar.
-
ttaw
talking to a wall, a piecemeal natural language processing library
-
dispnet-hash
Hash Library used for Dispnet
-
nu_plugin_emoji
a nushell plugin called emoji
-
nih_log
An opinionated yet flexible logger catering to the needs of the NIH-plug plugin framework
-
wool
Preview Github Markdown Offline
-
unveil-rs
Unveil Rs is a tool to create presentations from markdown files
-
latinrs
Rust package that transliterates text and characters into thier latin form
-
fiberplane-markdown
convert Fiberplane Notebooks to and from Markdown
-
formatjson
Formats JSON files
-
sapling-drawdag
Parse an ASCII DAG into graph edges
-
usage-lib
working with usage specs
-
supply-chain-trust-example-crate-000066
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
kbremap
Custom keyboard layouts for windows
-
cindex
CSV indexing library
-
markdown-gen
generating Markdown files
-
cyrconv
A funny faux cyrillic character mapper
-
jira-clean
clean up Jira task description that is an output of jira-cli tool
-
loe
Very fast and yet another line ending (CRLF <-> LF) converter written in Rust
-
abbreviator
abbreviating long words
-
searcher_txt
A copy of grep that i made to show that im bad at rust
-
cosmic-text
Pure Rust multi-line text handling
-
utf8-io
Traits and types for UTF-8 I/O
-
wildcard_ex
extended wildcards that allows VB-like specifications
-
my_parser_kma_test_group_3_1
A brief description
-
indented_text_writer
IndentedTextWriter
-
rust-regex-dsl
Regular expression DSL
-
repa
Peak Performance Pattern Seeker
-
minigrep_elijahkx
MiniGrep is a Rust-based command-line tool, with a (current) size of 588KB that lets users search files for a given query string and shows matching lines with their line numbers
-
texcore
Create LaTeX documents using native Rust types
-
marker
finding issues in CommonMark documents
-
mudder
Generate lexicographically-spaced strings between two strings from pre-defined alphabets
-
hexstring
handling hexadecimal string
-
opentalk-types-signaling-meeting-report
Signaling types the OpenTalk meeting-report module
-
zhconv-cli
Convert Traditional/Simplified Chinese and regional words of Taiwan/Hong Kong/mainland China/Singapore based on Wikipedia and OpenCC rulesets 轉換中文簡體、繁體及兩岸、新馬地區詞,基於維基百科和…
-
dekor
styling and character repository in Rust
-
quake_text
Utils for Quake strings and characters
-
rins_markdown_parser
markdown parser written on Rust
-
supply-chain-trust-example-crate-000037
Implementations of string similarity metrics. Includes Hamming, Levenshtein, OSA, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Sørensen-Dice.
-
just-enough-emojis
text to emoji cli
-
cellumina
configure and run cellular automata
-
recvmsg
Traits for receiving datagrams reliably, without truncation
-
code-splitter
Split code into semantic chunks using tree-sitter
-
imperative
Check for imperative mood in text
-
liwe
IWE core library
-
lucide-yew
Yew port of Lucide
-
botanical-latin
Decliner / conjugator / inflector for classical / botanical Latin
-
qsv_vader_sentiment_analysis
Bindings for Rust from the original Python VaderSentiment analysis tool. Forked for use with qsv.
-
cargo-markdown
Local crates.io readme development server with ultra-fast hot reloading goodness
-
campfire
A tiny static site generator, greatly inspired by Zola
-
onig_sys
onig_sys
crate contains raw rust bindings to the oniguruma library. This crate exposes a set of unsafe functions which can then be used by other crates to create safe wrappers around Oniguruma… -
manchu-converter
Converts transcripted Manchu text to Manchu script with Manchu alphabet
-
avatarsay
Beautiful quotes from Avatar: The Last Airbender
-
prompt-input
lightweight library for user input prompts in Rust, designed to make input handling straightforward
-
bk-tree
A Rust BK-tree implementation
-
pdfcr
render a codebase to a pdf
-
boreal-cli
CLI utility to run boreal, a YARA rules engine
-
wdl-doc
Documentation generator for Workflow Description Language (WDL) documents
-
correct_word
A No brainer 'did you mean' library for Rust
-
chunkr
A fast and quick chunking library for rust
-
text_unit
Newtypes for text offsets
-
regex-split
split_inclusive for the regex crate
-
tectonic_engine_xetex
The XeTeX engine as a reusable crate
-
mdbook-gitbook
mdBook preprocessor to properly render GitBook specific syntax
-
markdown_converter
html to markdown converter and flavored markdown to discord markdown converter
-
unicount
Alphabetic counter supporting unicode
-
esl01-drawdag
Parse an ASCII DAG into parent relations
-
zspell-cli
Command line interface for the ZSpell spellchecking library
-
bfom-lib
Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?
-
kbnf-regex-automata
A forked version of regex-automata for kbnf
-
c6o-obsidian-export
associated CLI program to export an Obsidian vault to regular Markdown
-
marktask
A CLI tool for parsing and manipulating Markdown tasks
-
recase
Changes the convention case of input text
-
sms_splitter
An SMS message splitter and part calculator with support for GSM and Unicode
-
royal_road_archiver
An archival program and library for the webnovel site RoyalRoad
-
udp-logger-rs
Log macro for log's kv-unstable backend and a UDP socket logger
-
flowistry_ide
Information Flow in the IDE for Rust
-
uo_rst_parser
fork of rst_parser with fixes for upstream-ontologist
-
detect-newline-style
Determine a string's preferred newline character
-
my_parser_rrin
My parser
-
markdown2pdf
Create PDF with Markdown files (a md to pdf transpiler)
-
godotdoc
documentation generator for GdScript
-
pho
PHOnetic alphabet renderer
-
ngrammatic
Character-oriented ngram generator and fuzzy matching library
-
align
aligning text
-
java_string
Java strings, tolerant of invalid UTF-16 encoding
-
asimov-cli
ASIMOV Command-Line Interface (CLI)
-
ascii_converter
converting between different ascii representations
-
uapi-version
Compare versions according to the UAPI Version Format Specification
-
spacey
a tiny yet performant whitespace vmm/interpreter
-
subject-classifier
classifying a commit by it's subject
-
anystr
An abstraction over string encoding that supports ASCII, UTF-8, UTF-16 and UTF-32
-
tiny_pretty
Tiny implementation of Wadler-style pretty printer
-
bstr
A string type that is not required to be valid UTF-8
-
rs-line-count
counts number of lines(' ')
-
mdbook_incremental_preview
Incrementally preview mdBook, patching changes instead of making full rebuilds
-
pomsky-bin
Compile pomsky expressions, a new regular expression language
-
rust_readability
A package to assess the complexity of texts using a variety of readability formulas
-
giff
Visualizes the differences between the current HEAD and a specified branch in a git repository using a formatted table output in your terminal. The differences are displayed with color-coded…
-
veg
Flexible tables
-
path2regex
Express style path to RegExp utility
-
scoundrel
An engine for working with ASCII roguelikes
-
text-scanner
A UTF-8 char-oriented, zero-copy, text and code scanning library
-
censor
text profanity filter
-
quickmd
Quickly preview a markdown file
-
analyse-json
CLI tool for inspecting (Newline Delimited) NDJSON or JSON to understand the contents
-
runiq
An efficient way to filter duplicate lines from input, à la uniq
-
ascii_tree
generates ascii trees
-
ultra-nlp
A NLP library
-
realhydroper-utf16
Work with UTF-16 in Rust
-
convert_string
A trait to convert Strings to safe non-keywords and/or convert a Strings case (snake_case, PascalCase, ...)
-
reword
some utility functions for human-readable formatting of words
-
markdown-toc
Markdown Table of Contents generator
-
runestr-pancjkv
rune
-based Pan-CJKV support -
mdbook-davids_cooking
A preprocesor for whatever https://davidsotomarchena.gitlab.io/davids-cooking/ needs
-
font-map
Macros and utilities for parsing font files
-
prototty_render
Traits for defining a renderer and a view
-
docket
markdown to HTML documentation rendering
-
bilingual
A cmdline tool used for markdown translation via calling Chinese translation api cloud services
-
mdbook-tools
A collection of tools for mdbook
-
aki-xtee
copy standard input to each files and standard output
-
character_frequency
counting character frequencies in a string concurrently
-
CLI_Project_Scott_Coakley
CLI Project in Rust
-
fuzzy-string-distance
Fuzzy string distance comparisons
-
rsmorphy
Morphological analyzer / inflection engine for Russian and Ukrainian (soon) languages (WIP)
-
sayit
String replacements using regex
-
lister-cli
Lister: Navigate Markdown Lists
-
tokengrams
Compute n-gram statistics and model language over pre-tokenized text corpora used to train large language models
-
letter-sequence
A method to create sequence displayed as uppercase or lower letters, or digits
-
hyphenation_commons
Proemial code for the
hyphenation
library -
seshat-unicode
A Unicode Library for Rust. Unicode 16.0.0 ready. XID_Start and XID_Continue are also available.
-
markdown-formatter
Flavored Markdown (ZH) content formatter
-
neo-mime
Strongly Typed Mimes
-
bytescolor
A versatile Rust library for colorizing strings and byte data in terminal applications using ANSI escape codes
-
pi_ucd
unicode字符函数,获得字符的语言区间段;及根据文字排版的需要,判断字符是否为单字字符或字母字符
-
sm-search
way of searching through text - for people who are too lazy to use Regex
-
loki_text
advanced string manipulation with pattern searching and replacement capabilities
-
kspconfigtool
KSP1 ConfigNode parser and block removal tool
-
ut1_blocklist
UT1 blocklist URL/domain filters
-
textframe
query plain text documents by unicode offset without loading them all into memory
-
minify-html-common
Common code and data for minify-html*
-
markdown-it-footnotes
Creates footnotes and lists of footnotes in Markdown documents
-
mantra-miner
your software recite mantras while it runs
-
to-binary
Converts From Hex, UTF-8, Vector of Bytes, or a Byte Slice to a Binary String And Implements Many Useful Functions For Binary Strings
-
markdown-it-latex
Allows for the insertion of math in Markdown documents using LaTeX
-
faster-chars-count
counting length of chars faster than Chars::count()
-
story-dl
Story web scraping
-
twitter-text
in Rust
-
rusty-x
A snippet manager in Rust
-
boss
Baseball Open Source Software
-
reddish
A Rust utility library
-
kana-converter
converter for half-width/full-width Japanese language characters (katakana, hiragana, and ASCII)
-
mdbook-quiz-schema
Schema for quizzes used in mdbook-quiz
-
mdbook-svgbob2
Alternative mdbook preprocessor for svgbob
-
tet_rs
A third-party implementation of Text Entry Throughput (ref. https://doi.org/10.1145/3290605.3300866) for Rust
-
untanglr
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies
-
CFDTCP
Short text clustering for determining center points and correlation judgment [one-hot,Jenks Natural Breaks]
-
synox
Program synthesis of string transformations from input-output examples
-
scanix
search a text or pattern in files. A fast and lightwight text tool.
-
qm-role-build
Role builder from markdown tables
-
display_bytes
Human-readable display of byte sequences
-
ranting
Linguistic formatting placeholder extensions for rust
-
Person_Parser
Rust parser
-
charname
Incredibly simple library that just gives you the Unicode name for a character
-
as3_parser
ActionScript 3 parser
-
fast_aug
Fast data augmentation for text
-
mdbook-scientific
Enables inline equations for mdbook to set by $..$ signs and $$..$$
-
flashtext2
The FlashText algorithm implemented in Rust
-
mdrss
generating RSS feeds from markdown files
-
lingua-french-language-model
The French language model for Lingua, an accurate natural language detection library
-
notegraf
Core library for building a graph-oriented notebook
-
emoji
Every emoji, their metadata, and localized annotations
-
testcall
companinon crate to bintest, implements test facilities
-
notmecab
tokenizing text with mecab dictionaries. Not a mecab wrapper.
-
string_art
Convert images into thread patterns for creating string art. It generates detailed instructions in text format and provides graphical previews of the resulting patterns.
-
regexgrep
ripgrep tool that suports regular expressions
-
mdbook-force-relative-links
An
mdbook
pre-processor to transform all local links to relative ones -
lingua-spanish-language-model
The Spanish language model for Lingua, an accurate natural language detection library
-
jpreprocess-njd
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
unicode-width-16
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
grammateus
facilitate working with Ancient Greek words
-
tabwriter
Elastic tabstops
-
wordshk_tools
A combination of parsers and other tools for words.hk (粵典)
-
gematria_rs
Gematria, a traditional Hebrew numerology system
-
uulm-sopra23-group03-core
Core library for implementing Battle of the Centerländ
-
anon-csv-cli
anonymise CSV files, providing various options to substitute real data with plausable fake data
-
rustblocks
tetromino based falling blocks game
-
loading
Show loading in terminal
-
xpanda
Unix shell-like parameter expansion/variable substitution
-
fast-str
A flexible, easy-to-use, immutable, efficient
String
replacement for Rust -
sluggify
slug or clean url generator for rust. With default settings, you will get an hyphenized, lowercase, alphanumeric version of any string you please, with any diacritics removed, whitespace and dashes collapsed…
-
mathematica-notebook-filter
mathematica-notebook-filter
parses Mathematica notebook files and strips them of superfluous information so that they can be committed into version control systems more easily -
alass-ffi
FFI bindings for subtitle synchronization with alass
-
text-transliterate
transliterate texts using the SO iconv from POSIX
-
guarding
guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
elden-ring-saver
ansi2
-
wordmarkov
but flexible Markov chain library, specifically for text sentences, which handles punctuation and whitespace
-
mago-casing
converting strings between various common casing styles
-
sortuniq
Find or count unique values in an input stream
-
mdbook-preprocessor-utils
writing mdBook preprocessors
-
yinzhe9
喵喵隐者9
-
pra
Print Random ASCII
-
jposta
A fast and intuitive Terminal User Interface (TUI) tool for searching Japanese postal codes and addresses
-
valid_rust_char
A tiny library to check if a char it's valid in a rust file
-
char_reader
Safely read wild streams as chars or lines
-
abjad
Calculate the numerical abjad value of Arabic-script text
-
gbx
GBX (Grundbuch-Exchange) Dateiformat
-
strizer
minimal and fast library for text tokenization
-
commonregex
Rust port for CommonRegex. Find all times, dates, links, phone numbers, emails, ip addresses, prices, hex colors, and credit card numbers in a string. We did the hard work so you don't have to.
-
tagsearch
Filter plaintext files based on @keyword tags
-
mdbook-najan
Preprocessor for the Najan mdBook
-
csvsc
Build processing chains for CSV files
-
slow-scan-print
Text is printed at fixed intervals by character or by line, and its name is inspired by SSTV (Slow Scan TV)
-
typeline_ext_http
http(s) tooling for typeline
-
indicator-extractor
Extract indicators (IP, domain, email, hashes, etc.) from a string or a PDF file
-
byte_string
Wrapper types for outputting byte strings (b"Hello") using the Debug ({:?}) format
-
character_text_splitter
splitting text into chunks with overlap, designed for handling large amounts of text efficiently. Implementation is identical to langchain's CharacterTextSplitter
-
uwu-rs
uwuifying library
-
search-in-terminal
A terminal-based search tool
-
refac
Transform some text given a generic natural language prompt
-
kindle2cbz
extracting images from kindle books in MOBI format to CBZ archives
-
pdf-sign
extract signed date from pdf file
-
dtex
Better TeX
-
mdbook-typst-pdf
mdbook typst pdf backend
-
unescape
Unescapes strings with escape sequences written out as literal characters
-
umlauts
text transformation of german umlauts
-
hl2-lib
highlighting streams of text via regex
-
slicedisplay
Simplistic Display implementation for Vecs and slices
-
langsan
sanitizing language model input and output
-
mdbook-metadata
mdBook preprocessor to parse markdown metadata
-
lindera-unidic
A Japanese morphological dictionary for UniDic
-
rst_parser
a reStructuredText parser
-
ngram
Iterator adaptors for n-grams and k-skip-n-grams
-
google-fonts
Download and cache TTF fonts from Google
-
galm
pattern matching library
-
alpha-counter
Alphabetic counter
-
ellipse
Truncate and ellipse strings in a human-friendly way
-
rustrings
Strings manipulation for Rust
-
utils_rust
这是一个用于各种实用功能的 Rust 库
-
bzzz
Bee-Speak Translator
-
bitflip
functions to generate bitflips of binary and UTF-8 strings
-
mdzk
Plain text Zettelkasten based on mdBook
-
mdast2minimad
converting markdown AST to minimad texts
-
wantora
wantora工具
-
escape-bytes
Escapes bytes that are not printable ASCII characters
-
common_substrings
Finding all common strings
-
skyspell_kak
skyspell - kakoune integration
-
akai
Next-generation, functional and simple ID system
-
split-every
Split for every n occurrences of a pattern iteratively!
-
crate_file_dyzs_16
program to read a CSV file and write a JSON file
-
matcher_c
A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust
-
text_layout
Text layout algorithms
-
utilprint
easily using colors or unicode in your terminal printing
-
sttx
belt for transforming speech-to-text data
-
wimbd
A CLI for inspecting and analyzing large text datasets
-
unicode-vo
Unicode vertical orientation detection
-
pdf2pwg
Single purpose A4 page renderer rendering PDF using pdfium to PWG/URF
-
ob
A Blog and RSS system written in Rust
-
deref
detecting and managing duplicate academic citations
-
normalize-hebrew-rs
package that normalizes special symbols within Hebrew string used in the Qumran-Digital project
-
stardict
Rust implement stardict
-
rl-sys
Native bindings to libreadline
-
rtss
A command-line tool to annotate stdout/stderr with elapsed times
-
asciis
ASCII base on RFC20. Just ord() and chr()
-
ferrissay
cowsay
-
jp-deinflector
A package for deinflecting Japanese words
-
dynfractal
A program to render any equation (in terms of z and c) into an ASCII fractal
-
ucfirst
Uppercase the first letter of a string
-
caser
Change text between PascalCase, camelCase, and snake_case
-
gimme
Pull useful data out of your clipboard
-
swot
community-driven or crowdsourced library for verifying that domain names and email addresses are tied to a legitimate university of college
-
tangler
Extracts code blocks from Markdown documents
-
strcursor
string cursor type for seeking through a string whilst respecting grapheme cluster and code point boundaries
-
ascii-rs
Process image into colored-ascii image
-
solstice_glyph
A fast text renderer for solstice, powered by glyph_brush
-
sedregex
Sed-like regex library
-
pinot
Fast, high-fidelity OpenType parser
-
encoded-words
Encoded Words for usage in MIME headers
-
ucd-raw
Uninterpreted access to the unicode UCD
-
mdtranslation
prepare multi-lingual Markdown documents
-
is_printable
Determine whether a given text-based value is printable
-
indentsort
Structure-preserving sorting of arbitrary indented text
-
ryaspeller
lib for searching typos in text, files and websites
-
asciir
Print ASCII table/values
-
rust_stringz
string manipulation library for Rust that you should probably don't need. It's just for learning purposes.
-
hello_lib
Demonstrate Generics Function
-
rsrpp
project for research paper pdf
-
techlead
CLI is a command-line interface that enables developers to chat with an AI assistant powered by the OpenAI GPT language model, designed specifically to help with your Rust project
-
runiq-lib
An efficient way to filter duplicate lines from input, à la uniq
-
cautious-octo-funicular
Test: shipping an mdbook with API docs
-
looking-glass
reflection & type-erasure library for Rust
-
ufofmt
A fast, flexible UFO source file formatter based on the Norad library
-
aqkanji2koe
wrapper for Rust
-
markdown-extract
Extract sections of a markdown file
-
policyai
mechanism for unstructured, composable policies that transform unstructured text into structured outputs
-
levenshtein_lite
No-frills implementation of a Levenshtein Automata and the Levenshtein Distance function
-
khat
A cat clone, nothing more nothing less
-
mdbook-indexing
mdbook preprocessor for index generation
-
is-vowel
Heuristically test whether a character is a vowel letter
-
yeslogic-unicode-script
Fast lookup of the Unicode Script property
-
explore
terminal file explorer
-
indentasy
Indent like a party
-
bge
Rust interface for BGE Small English Embedding Library
-
mdbook-preprocessor-boilerplate
Boilerplate code for mdbook preprocessors
-
mdbook-treesitter
mdBook preprocessor for html adding tree-sitter highlighting support
-
jp_utils
Utils for working with Japanese text
-
hfs_nfd
Handle Apple's unique NFD-like Unicode normalization, which is used in HFS+, in Rust
-
neuralassimilator
fine-tuning Language Learning Models (LLMs) from unstructured text
-
mask-text
mask text with multiple masking options
-
cfasttext-sys
fastText ffi binding
-
nmd-core
Official NMD compiler core
-
unicode-normalization-alignments
functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
cowonverter
owo-ify text files. provides a simple function for this purpose.
-
adc-lang
Array-oriented reimagining of dc, a terse stack-based esolang
-
slugify-rs
generate slugs from strings
-
nfa_regex
NFA regex engine for text processing
-
markov-text
creating a small markov model for text generation
-
mdbook-skill-tree
mdbook plugin to show roadmaps
-
enum-ts
TypeScript Enum pattern matcher codegen
-
crawdad
ChaRActer-Wise Double-Array Dictionary
-
texting
string helpers
-
write16
A UTF-16 analog of the Write trait
-
ruSTLa
A reStructuredText → LarST ⊂ LaTeX transpiler
-
bigstr
A command-line tool to make string BIG
-
tpt
Pure Rust implementation of the Unix concatenate (cat), word-count (wc) and echo command
-
snakecase
general purpose snakecase implementation supporting both ascii and unicode
-
text-diff
text diffing and assertion library
-
pulldown_typst
A pull parser for Typst markup
-
snekdown
A parser for the custom snekdown markdown syntax
-
aki-unbody
output first or last n lines, like a head and tail of linux command
-
codes-iana-charset
This package contains an implementation of the IANA Character Set registry
-
merge_pdf
Merge PDF files in a directory
-
no-comment
Remove rust-style line and block comments from a char iterator
-
sauropod
Stellular Shared Resources
-
vape
full width aesthetics
-
blitztext
fast keyword extraction and replacement in strings
-
pulldown-cmark-escape
An escape library for HTML created in the pulldown-cmark project
-
cp437_constants
Defined constants covering the entirety of ASCII code page 437 (https://en.wikipedia.org/wiki/Code_page_437)
-
event_parser
Rust NLP library for parsing English natural language into icalendar events
-
irg-kvariants
wrapper around kvariant from hfhchan/irg
-
rupantor
A Bengali Phonetic Parser which is very flexible and supports Avro Phonetic
-
confusables
around Unicode confusables/homoglyphs
-
mdbook-check-missing-md
A backend for
mdbook
which will find Markdowns you forgot on SUMMARY.md -
emoji_converter
Converts text to emojis
-
ripmors
encoding and decoding international Morse code and several variants
-
clippers
Cross-platform clipboard management library
-
spellabet
Convert characters into spelling alphabet code words
-
json-peek
Amature JSON parser library designed for my specific need
-
yara-x-parser
A parsing library for YARA rules
-
uwubot
discord bot for uwuifying text
-
markov_strings
A simplistic Markov chain text generator
-
dictcc
Rust API for reading and querying the dict.cc offline translation database
-
ascii-hangman-webapp
customizable Hangman game with ASCII-art rewarding for children (webapp version)
-
stylish-core
The semver-stable subset of
stylish
-
rep-grep
wgrep/write-grep CLI
-
sejong
Buffer is a buffer that can receive ASCII bytes different from keyboard and send out UTF-32 Hangul string. This buffer allows deletion by Jamo.
-
butter
A personal demo project for fun
-
unindenter
unindent text
-
redpatterns
a list of patterns for scanners 📟
-
kirjat-rs
prices for finnish textbooks from multiple stores
-
asciifolding
ascii folding library
-
pdf-rename
This script reads a list of PDF files from a specified directory and renames each file based on its content. The renaming logic uses the content of the PDF to generate a more descriptive and meaningful filename.
-
naromat
Convert text to narou novel format
-
interslavic
in rust
-
compute-file-server-cli
Uploads files to Fastly for serving directly from within Fastly Compute applications. Upload any type of file: images, text, video etc and serve directly from Fastly. It is ideal for…
-
goodname
assist you with cool naming of your methods and software
-
mdbook-open-git-repo
mdbook preprocessor to add a open-on-git-repo link on every page
-
gecliht
A disparate collection of text manipulation and formatting algorithms
-
rs-wordcloud
☁ Flexible WordCloud implementation for ⚙Rust
-
parattice
Recursive paraphrase lattice generator
-
whitespace_text_steganography
A steganography strategy that uses whitespace to hide text in other text
-
fsays
flavored replacement for the classic cowsay
-
html-to-pulldown-cmark-events
Parse HTML to pulldown-cmark's events
-
jcalendar
Japanese Calendar for Rust
-
igpay-atinlay
Translate text to Pig Latin
-
aki-mcycle
mark up text with cycling color
-
azusa
String index transformer for Rust utf8 to JavaScript utf16
-
harfbuzz-sys
Rust bindings to the HarfBuzz text shaping engine
-
pulldown-html-ext
Extended HTML rendering capabilities for pulldown-cmark
-
cli-colors
A CLI tool for outputting text in ANSI format with features like colors, underlining, boldening, and italicizing
-
ctrl-z
A composable reader to treat
0x1A
as an end-of-file marker -
uiuifree-normalize
uiuifree text normalize
-
trie-match
Fast match macro
-
boringascii
Strings that can only be constructed to contain non-whitespace, non-control ASCII characters
-
collapse
Trim and collapse consecutive whitespace to a single space
-
veryfi
Module for communicating with the Veryfi OCR API
-
charmap
one-to-(none/one/many) character mapping
-
encoding_c
C API for encoding_rs
-
ftrace
trace files and paths
-
beautify
your terminal
-
lindera-ipadic-builder
A Japanese morphological dictionary builder for IPADIC
-
debug_canvas
Easily print complicated debug strings
-
streamparser
Adapt simple stream parsers for different types of streams
-
opentalk-types-signaling-meeting-notes
Signaling types the OpenTalk meeting-notes module
-
monkey-printer
infinite nr of monkeys you could write Shakespeare
-
carnation
some string operators
-
playin-cards
♠ Library for French-suited playing cards
-
bmatcher
flexible and efficient binary pattern matching library designed to help you search and match binary data
-
rcut
replacement for GNU cut that supports UTF-8
-
mdx
in Rust
-
utf-cli
A UTF code viewer
-
wcrs
GNU wc in Rust
-
argot
Parse documentation from codebases into Markdown for easy doc creation
-
literumilo
A spell checker and morphological analyzer for Esperanto
-
ab-radix-trie
A compressed radix trie implementation supporting matching rules
-
pest_ascii_tree
Helper crates converting the parsing result of any pest grammar into an ascii tree
-
glifnames
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
whitespace-conf
Key-value configuration file delimited with whitespaces
-
match-pinyin-with-hanzi
Checks whether the sentence in Chinese characters (汉字) matches with the sentence in pinyin (拼音). Erhua is supported.
-
lf_lint
Linter that checks that all files in <PATH> ends with a newline
-
ra_rustc_lexer
lexer used by rustc, zero stability gurantees
-
twjsrs
(De)serialize TiddlyWiki tiddlers to and from their JSON format
-
mdbook-nix-eval
mdbook preprocessor for evaluating nix expressions
-
unic-ucd-age
UNIC — Unicode Character Database — Age
-
uwuify
fastest text uwuifier in the west
-
group-similar
Group similar values based on Jaro-Winkler distance
-
assert-text
the testing macro tools
-
todo-to-issue
CLI tool that converts forgotten TODO comments into actionable GitHub issues
-
text_searcher
text searcher tool
-
small-bwt
BWT construction in small space
-
linkcheck
extracting and validating links
-
enso-lazy-reader
An efficient buffered reader
-
help_crafter
help message generator without hussle
-
mdbook-chapter-number
A mdBook preprocessor that adds chapter numbers to the each page header
-
vec-string-to-static-str
providing utilities for converting vectors of
String
s into vectors of&'static str
-
mdbook-twiki
twiki backend for mdbook
-
minigrepsada
Test
-
genex
Text-expansion library
-
popgetter-py
Python bindings for popgetter
-
esperanto-text
Convert Esperanto text between UTF-8, x-system and h-system transliterations
-
wattle
tools for rust
-
text-utils
Text utils for unescaping and align
-
spf
.spf (Simple Pixel Font) file parsing, and useful api's to go alongside
-
univstring
The Universal String(Rust String, CString, Wide(UTF-16/UTF-32) CString) trait
-
string_manip_rust
Demo of managing projects
-
timeblok
A language for event scheduling in plain text
-
mdict
Rust implement mdict
-
char_trie
Text segmentation based on trie tree, High performance, support for custom dictionary
-
rex-regex
A pedestrian library for regular expressions: Slow, but easy to understand
-
clippy_lints
A bunch of helpful lints to avoid common pitfalls in Rust
-
google-book-scraper
downloading the contents of books hosted on books.google.com for offline viewing
-
learning_fromdocs_minigrep
Project made on the path toward learning Rust from its docs. Completely personal and to test how to upload to crates.io.
-
mdbook-bibfile-referencing
An mdBook preprocessor to add bibfile referencing to each page
-
utfx
-
compact_str
A memory efficient string type that transparently stores strings on the stack, when possible
-
mitex
TeX2Typst converter
-
mail-internals-ng
[mail-api] _internal_ parts for the mail-api crates
-
terminal_cli
A standalone library with no-std support for command line terminal interfaces. With autocomplete support, helpers for commands and properties and a prompt implementation.
-
ascii-engine
A fast ascii engine written in rust!
-
mdbook-last-changed
mdbook preprocessor to add the last modification date per page
-
lexmatch
lexicon matching tool that, given a lexicon of words or phrases, identifies all matches in a given target text. Uses suffix arrays.
-
bnf_sampler
that uses recursive descent algorithm to ensure tokens produced by a large language model follow a Backus Naur Form schema
-
product-os-content
Product OS : Content provides a complete solution for content management for the purpose of serving content via Product OS : Server
-
swrite
Infallible alternatives to write! and writeln! for Strings
-
morse_code_japanese
handle Japanese Morse Code. (和文モールス符号)
-
founder
The package provides a font toolbox
-
mdbook-typst-math
An mdbook preprocessor to use typst to render math
-
divvunspell-bin
Spellchecker for ZHFST/BHFST spellers, with case handling and tokenization support
-
console-traits
Basic console traits, such as setting char position and writing strings
-
md2gemtext
for converting Markdown into gemtext
-
keyword-tools
Rust tools for keyword extraction and similarity search
-
remake
writing maintainable regex and managing symbol soup
-
ripgrep
line-oriented search tool that recursively searches the current directory for a regex pattern while respecting gitignore rules. ripgrep has first class support on Windows, macOS and Linux.
-
strange
A static website generator
-
markitdown
designed to facilitate the conversion of various document formats into markdown text
-
html_to_epub
A command line converts .html file to .epub file
-
changecase
A trait and implementation for changing the case of Strings and &str. It currently supports uppercase, lowercase, alternating case, and inverting case. Title case is in the works.
-
mdbook-numthm
An mdbook preprocessor for automatically numbering theorems, lemmas, etc
-
genpdf
User-friendly PDF generator written in pure Rust
-
lang-unicodes
Effortlessly convert and validate text across various Unicode ranges, supporting multiple languages for seamless internationalization in your applications
-
humnum
Human numeric sorting program — does what
sort -h
is supposed to do! -
zw
encoding and decoding text using zero-width characters
-
sauron-markdown
parsing markdown into sauron node
-
emojicons-2021
Parse :emoji: notation to unicode representation
-
translit
Transliteration for cyrillic languages that uses officially approved rules
-
macro_colors
colorful printing macros
-
spongebob
convert text to spongebob case a.k.a tHe MoCkInG sPoNgEbOb MeMe
-
charwise
This lightweight, dependency-free rust library provides a convenient way to read characters from different resources
-
qpprint
console printing/formatting
-
stringsort
Pathological sorting of string characters
-
mle
The markup link extractor (mle) extracts links from markup files (Markdown and HTML)
-
wcounter
Give the word and count the appearance
-
ru-html-extractor
A universal web page main content extractor based on line block density distribution
-
mdbook-footnote
mdbook preprocessor for footnotes
-
noodler
A port of the python-ngram project that provides fuzzy search using N-gram
-
hashmoji_generator
Code generation tool for hashmoji
-
spacemod
A easy to understand and powerful text search-and-replace tool
-
pink-runtime
A very minimal replecement based DSL, intended for math
-
mdbook-unlink
A mdBook backend that validates local links
-
veloci_levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
rscx-mdx
MDX support for RSCx
-
mdbook-plugin-utils
mdBook plugins
-
csv-groupby
execute a sql-like group-by on arbitrary text or csv files
-
bitranslit
Bi-directional transliterator for Rust. Transliterates strings according to the rules specified in the language packs.
-
transcript
A transcriber for European scripts
-
grace-cli
CLI tool for processing files and strings
-
wz
Count words, fast
-
fum
fum finds fuzzy matches to a literal search pattern, searching recursively through all the files in the current directory and respecting gitignore rules
-
hunspell-rs
Rust bindings to the Hunspell library
-
spongemock
CLI-tool for random text capitalization
-
basic_lib_for_me
just a basic lib that i will use usually
-
arbitrator
Format text based on a set of rules and regexes
-
kashida
Insert Kashidas/Tatweel into Arabic text, e.g. for justification purposes.
-
tantivy-czech-stemmer
Czech stemmer as Tantivy tokenizer
-
txt_to_md
Command converting from a txt file to a markdown file
-
mdbook-collector
A mdbook preprocessor which collects json/yaml/toml data from each post and builds a json file of it
-
cyrla
two-way conversion between latin and cyrillic script
-
matchpick
Find and replace multi-lines using a match-case
-
password-characters
help with the "enter the 12th, 35th, and 63rd characters from your password" situations
-
verba
working with Latin words
-
delay_writer
Wraps a writer and delays its output after each newline
-
lindera-ipadic-neologd-builder
A Japanese morphological dictionary builder for IPADIC NEologd
-
ripsecrets
A command-line tool to prevent committing secret keys into your source code
-
node-emoji
Convert
:emoji:
to Unicode using GitHub’s and EmojiDB’s emoji names -
encoding-next
Character encoding support for Rust
-
yozuk-core-skillset
Set of default Yozuk skills
-
fastchr
Faster memchr using SIMD intrinsics
-
str_overlap
Methods for finding the overlap between two string slices
-
wtf8-rs
WTF-8 encoding
-
mdbook-bash-tutorial
A mdbook preprocessor that allows embedding Bash scripts as tutorials
-
mdbook-fix-cjk-spacing
mdbook preprocess that fixes CJK line breaks
-
vader_sentiment
Bindings for Rust from the original Python VaderSentiment analysis tool
-
ogrep
searching in indentation-structured texts
-
alphabet-encoding
A way of encoding text
-
strmatch
Conditionally match strings in Rust using regex without much boilerplate
-
mdbook-svgdx
mdbook preprocessor to convert svgdx fenced code blocks into inline SVG images
-
splutter
create/generate text efficiently
-
uchr
translate char values to the characters themselves, and vice versa
-
wordpieces
Split tokens into word pieces
-
rammer
A Baysian Text Classifier for Spam/Ham
-
strip_markdown
remove markdown syntax from markdown files
-
tectonic_xetex_layout
XeTeX's font loading and layout interface encapsulation, as a crate
-
hebrew_unicode_utils
Some functions for processing Hebrew unicode characters
-
arabic-script
An expressive API for the characters of the Arabic script
-
mdbook-files
Preprocessor for mdbook which renders files from a directory as an interactive widget
-
polyhorn-ios
Polyhorn for iOS
-
g2-unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
pdfrust
PDF parser
-
readput
Fast and easy stdin input parsing for competitive programming in rust
-
stylish-stringlike
API for string-like objects that have styles applied
-
afrim-translator
Manage the predication system of the afrim input method
-
merge-whitespace-utils
Procedural macros for merging whitespace in const contexts
-
unicode-display-width
Unicode 15.1.0 compliant utility for determining the number of columns required to display an arbitrary string
-
peppergrep
grep utility written following the 12th chapter of the Rust book. Some little modifications were made.
-
mpdf
writing PDF files
-
csv-sanity
Sanitize and transform large CSVs with millions of records quickly and efficiently
-
bocu1
BOCU-1 compressed unicode encoding
-
mdbook-image-size
A mdbook preprocessor which support image size syntax
-
pulldown-cmark-fork
A pull parser for CommonMark
-
snake_case_converter
convert strings to snake case
-
rust_baht_text
Convert number to Thai Baht text
-
b2c2-flag
b2c2のコマンドライン引数を保持するだけのやつ
-
glyph_brush_layout
Text layout for ab_glyph
-
trim
trim whitespaces from files
-
sc_decode
Special character decoding
-
tinytoken
tokenizing text into words, numbers, symbols, and more, with customizable parsing options
-
yeslogic-unicode-blocks
Functions to access and search Unicode blocks
-
lithe
A Slim template engine by using Pest
-
shoebill
A Wadler/Leijen style pretty-printer
-
kincaid
A word statistics library in Rust
-
spdkit-surface
short text for crates.io
-
unidecode
pure ASCII transliterations of Unicode strings
-
mdbook-auto-gen-summary
A preprocessor and cli tool for mdbook to auto generate summary
-
remove-markdown-links
Turns
[example](https://example.com)
intoexample
. That’s it -
jpreprocess-dictionary
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
syntastica-themes
A collection of themes for syntastica
-
skribo
low-level text layout
-
wcount
CLI word counting tool
-
unidok
A powerful, readable, easy-to-learn markup language
-
asimov-core
ASIMOV Software Development Kit (SDK) for Rust
-
boxy
Declarative builder for Unicode box-drawing characters
-
grep-reader
short text for crates.io
-
gpt4ocr
Extract structured text from PDFs using OpenAI's GPT4o
-
mdbook-infisearch
InfiSearch plugin for Mdbook
-
slack_update
app to set Slack status, emoji and photo
-
mutf8
Basic support for mutf8 strings
-
chisel-lexers
Chisel backend lexers/scanners
-
extract-words
Extracts words from text without allocation
-
justcsv
CSV-file reader/writer
-
tcalc-rustyline
A fork of Rustyline for use specifically with tcalc
-
shear
trimming excess contents from things
-
ocrmypdf-rs
A sdk for the ocrmypdf command line tool
-
ansi-to-tui-forked
convert ansi color coded text into ratatui::text::Text type from ratatui library
-
scalar-value-string
Scalar Value String
-
aki-stats
output the statistics of text, like a wc of linux command
-
polyvalid
A validator for package names, namespace, username and app name on wasmer
-
in_rainbows_printer
Prints some In Rainbows-style (the Radiohead album) text to your termial
-
show-bytes
Display bytes as printable ascii with escape sequences as needed
-
llmvm-outsource-lib
outsource backend for llmvm
-
afrim-memory
handle of sequential codes easier for an input method
-
toml_to_table
pretty print TOML as a table
-
sixbit
Small packed strings
-
html-compare
compare html files
-
ruby-string
A string type that tracks Ruby glosses attached to parts of it
-
lindera-tantivy
Lindera Tokenizer for Tantivy
-
hxgm30-noise
Noise-generation for use in hexagram30 projects
-
korean_regex
Regex extension for Hangeul analysis
-
presciidoc
Preprocessing AsciiDoc for other tools
-
rex-regextract
extracts key value pairs out of text
-
inslice
A command-line utility for filtering text input by columns and rows
-
whitespace
Encode arbitrary data whitespaces and vice versa
-
fmt-extra
Extra formaters not shipped in the standard library
-
lindera-cc-cedict-builder
A Chinese morphological dictionary builder for CC-CEDICT
-
crypto-invert
Unicode Upside-Down Mapping
-
lingua-english-language-model
The English language model for Lingua, an accurate natural language detection library
-
yarner-block-links
A Yarner plugin that adds to each code block a list of links to all referenced and all referencing blocks
-
ascii-alphabetic-char
Traits for ASCII alphabetic characters
-
cur
that will hunt for your regular expression
-
mime_4
Strongly Typed Mimes
-
deepl-rustls
DeepL API - RusTLS Fork
-
tectonic_bridge_flate
Exposing flate compression to the Tectonic C code
-
opt2doc
generate documentation from Rust source code
-
leven-distance
Compute operational differences between two sequences using the Levenshtein algorithm
-
readwise
wrapper for the Readwise API
-
MNRTdWVdFmXoEUf
convert strings to snake case
-
syllable
counter for use with reading level calculations
-
whisperer
将文本编码为简短的中文字符, 防和谐
-
render_readme
Render Markdown or reStructuredText with syntax highlighting and image filtering similar to GitHub's
-
timfmt
A small utility for formatting code as Tim likes it
-
aprilasr
High-level wrapper for the april-asr C api (libaprilasr) using aprilasr-sys
-
matchable
Check if text is matching against string or regex in one API
-
moscato
Outline scaler for OpenType glyphs
-
trim_lines
An extremely simple and tiny library which provides an iterator over the lines of a string, trimmed of whitespace. It is a simple wrapper around the Lines iterator in std::str which trims the whitespace from each line.
-
indoc
Indented document literals
-
struckdown
A structured markdown / commonmark library for Rust
-
typeline_ext_utils
operators for typeline
-
awabi
A morphological analyzer using mecab dictionary
-
like
A SQL like style pattern matching
-
diffy-fork-filenames
Fork of https://docs.rs/diffy that allows specifiying filenames
-
spacebar
An anti-plagiarism tool based on null width characters
-
mdbook-embed
A preprocessor that simplifies embedded URL
-
agomesd_minigrep
A minigrep application built from the rust books
-
ascii-read
BufRead
-like methods for reading into anAsciiString
-
recursive-file-loader
recursively load files via references in the files
-
lindera-ipadic-neologd
A Japanese morphological dictionary for IPADIC NEologd
-
transliterate1234
UTF-8 to ASCII transliteration
-
vibrato
viterbi-based accelerated tokenizer
-
mdbook-mdinclude
Preprocessor for mdBook that better handles including markdown files
-
braille_pics
producing text-art pictures using Braille characters
-
utf8_reader
A UTF-8 reader that read UTF-8 characters from object that implement Read trait
-
unicode-intervals
Search for Unicode code points intervals by including/excluding categories, ranges, and custom characters sets
-
gzip-cmp
that makes a distance measurement between binary data based on the difference of the compressed data length
-
termwrap
Wrap Unicode text with ANSI color codes
-
genere
randomization of text respecting grammatical gender of sentences
-
varcon-core
Varcon-relevant data structures
-
owoify
Text owoification library
-
summertail
Summarize your tail -f output, by reporting how many lines match regexes
-
gfm-autolinks
Parse GitHub Flavored Markdown autolinks
-
cabocha
Safe Rust wrapper for cabocha a japanese language dependency structure analyzer library
-
rexpaint
functionality for reading and writing .xp files of the Grid Sage Games REXPaint ASCII art editor
-
resrap
A parser library
-
strfry
randomize a string
-
tiniestsegmenter
Compact Japanese segmenter
-
libxdiff
Rust bindings for the libxdiff C library
-
spellcheck_toy
a basic spellchecking library based on edit distance
-
runanum
Существительные с правильными окончаниями после чисел
-
amongify
A very ඞ sus ඞ program
-
irssi-sys
Automatically generated bindings to irssi
-
indentation
Formatter
-
lucide-dioxus
Dioxus port of Lucide
-
buss-protocol
Bussin binary protocol
-
casaubon
A feature-poor static site generator
-
hina
:]
-
dequote
Remove nested quotes around text
-
zhlint
A linting tool for Chinese text content
-
mdbook-trunk
mdBook plugin which bundles packages using Trunk and includes them as iframes
-
ngrams
Generate n-grams from sequences
-
rustascii
Display Rust in ASCII
-
fancy-regex-fork-pb
A custom fork of the fancy-regex crate. You probably don't want to use this.
-
codes-iso-15924
This package contains an implementation of the ISO 15924 Codes for the representation of names of scripts specification
-
shutil
Shell utility helper library
-
marko
Programmtically format text with Markdown syntax
-
unicode_reader
Adaptors which wrap byte-oriented readers and yield the UTF-8 data as Unicode code points or grapheme clusters
-
esl01-renderdag
Render a graph into ASCII or Unicode text
-
encoding8
various 8-bit encodings
-
mul
Bengali stemmer
-
textos
Texts, strings, formatting, unicode…
-
chinese_segmenter
Tokenize Chinese sentences using a dictionary-driven largest first matching approach
-
text-template
Small template engine for use with plain text (e.g. creating text email), not intended for HTML.
-
text-sanitizer
convert text to plain ASCII text
-
uwildmat
wildmat/uwildmat matching
-
uniart
A CLI tool to convert images and gifs to terminal characters
-
stringsext
find multi-byte-encoded strings in binary data
-
ctf-brute
Brute-force utilities for Rust
-
pithy
Ultra-fast, spookily accurate text summarizer that works on any language
-
pangu2
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
sastrawi
stemming and stopword removal for Bahasa Indonesia based on PHP sastrawi project by Andy Librian
-
basic-text-internals
Basic Text string literal implementation details
-
genkit
A common generator kit for static site generator
-
vndb_rs
Bare-bones VNDB api
-
mdbook-rust
Enhanced Rust support for MDBook
-
serbzip
A quasi-lossless Balkanoidal meta-lingual compressor
-
tergo-tokenizer
R language tokenizer
-
translation-api-cn
Some useful structs for calling Chinese translation api cloud services. A helper tool for
bilingual
cmdline tool. -
subscript-compiler
A modern LaTeX rendition
-
askama-markdown-cmark
Askama filter for markdown, using pulldown-cmark
-
xconv
A high-performance batch file encoding conversion tool
-
modeling
tools to analysis different languages by Ctags
-
unicode_escape
decoding escape sequences in strings
-
unicode-jp
convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
gen3-charset
Pokemon Generation 3 Character Set Support (GBA)
-
fmty
Composable
core::fmt
utilities -
brainfuck_interpreter
brainfuck interpreter written in Rust
-
rreplace
designed to streamline string replacements. It can handle multiple unique replacements and iterates the string only once.
-
morc
Dead simple, minimal markdown generator library written in Rust
-
xhtmlchardet
Character set detection for XML and HTML
-
unfurl
expanding links in text
-
shallow
long text
-
bpmf_py
A Bopomofo and Pinyin library
-
unic-ucd-normal
UNIC — Unicode Character Database — Normalization Properties
-
halfcaps
tRaNslAtE aNy TeXt To ThIs
-
senax-inflector
Pluralizes and singularizes English nouns. This crate is forked from https://github.com/whatisinternet/Inflector.
-
scripter
A screenplay compiler
-
syllarust
quickly counting syllables
-
chinese-ner
A CRF based Chinese Named-entity Recognition Library written in Rust
-
spoilers
High-level Rust bindings for CTranslate2
-
kanjidic_types
A collection of types encompassing the variety of data about kanji available from Kanjidic
-
pseudolocalize
Pseudolocalization crate for Rust
-
fuzzywuzzy
A pure-Rust clone of the incredibly useful fuzzy string matching python package, FuzzyWuzzy
-
floem-cosmic-text
Unofficial cosmic-text fork for Floem
-
uwl
A management stream for bytes and characters
-
trans-case
Transform case
-
stardict_wrapper
Rust implement stardict
-
kl-hyphenate
Knuth-Liang hyphenation for a variety of languages
-
word-star-puzzle-solver
A CLI tool for solving VGs word star puzzle
-
framework
detector for different frameworks in one projects
-
rsonpath-test-codegen
Blazing fast JSONPath query engine powered by SIMD. TOML-based test codegen for rsonpath-lib.
-
kasedenv
Read environment variables by lower, upper case or case-insensitive keys
-
marcus
An experimental Markdown parser written in Rust
-
outerspace
Methods for prefixing and suffixing the non-whitespace characters in a string
-
zummi
fun lib that produces spoonerisms
-
make_it_braille
A CLI tool and library to manipulate unicode braille images
-
m_lexer
extensible regular expressions based lexer
-
mocker
transforming text with uppercase and lowercase letters, resembling the Spongebob Mock Meme
-
any2utf8
Convert any enncoding to utf-8
-
lsp-document
Helpers to convert between LSP documents and Rust strings
-
mdbookshelf
Create epubs from a list of mdbook repositories
-
wildflower
Wildcard matching against strings
-
mdplayscript
An extension of Markdown for play scripts
-
gregex-logic
Logic for the gregex crate
-
meaningsearch
package that helps you find meaningful lines of any given input. Especially useful in CTFs.
-
bytepiece_rs
The Bytepiece Tokenizer Implemented in Rust
-
code-span
Add additional infomation to code character
-
mdbook-playscript
Preprocessor for mdBook, which styles stage play scripts
-
pretok
A string pre-tokenizer for C-like syntaxes
-
pdf_composer_definitions
PDF Composer definitions crate
-
fifthtry-mdbook
fork of mdbook, only for ft-cli
-
varcon
Source Code Spelling Correction
-
text_styled
Trait to style String and &str
-
mdbook-shiftinclude
mdbook preprocessor for file inclusion with shift
-
zawk
An efficient Awk-like language implementation by Rust with stdlib
-
md-localizer
Localize markdown with remote links
-
bashdoc
generating documentation/help menu for user defined bash functions
-
read_chars
An iterator over characters read from some I/O source
-
ghimlink
Generates a Github-Flavoured-Markdown formatted image link to an image in the same repository
-
unicode-matching
match open/close Unicode graphemes for brackets/quotes
-
str-block
Remove the initial newline and common indentation from string literals
-
hunspell-sys
Bindings to the hunspell C API
-
print-positions
providing string segmentation on grapheme clusters and ANSI escape sequences for accurate length arithmetic based on visible print positions
-
mqtt-analyzer
Help analyzing MQTT messages
-
autoruby-cli
CLI to easily generate furigana for various document formats
-
regex-automata
Automata construction and matching using regular expressions
-
forgiving-htmlescape
HTML entity encoding and decoding, with support for leaving malformed entities intact
-
catmark
Console printer for CommonMark
-
iver
-
finl-charsub
Character substitution module for in-line replacement of fixed strings
-
mdxbook
Fork of mdBook, with more customizations and flexibility for programmers
-
kakasi
Romanize hiragana, katakana and kanji (Japanese text)
-
md-include
include any file in markdown files
-
pomsky-macro
Macro for converting pomsky expressions to regexes
-
libflagup
Display a country's flag as an emoji
-
mdbook-translation
prepare multi-lingual mdBook books
-
hitori
Generic compile-time regular expressions
-
highlight-pulldown
Process pulldown-cmark events to apply syntax highlighting to code blocks
-
my_project_parser_super_puper
A brief description
-
ezemoji
Catigoryized Emoji's
-
pdf-min
Very minimal crate for writing PDFs
-
chardet
rust version of chardet
-
roxy_markdown_parser
Roxy plugin for parsing Markdown
-
hulk
An ultra simple no hassle static site generator
-
lindera-cc-cedict
A Japanese morphological dictionary for CC-CEDICT
-
gqlog
👾 filter your json logs with graphql 👾
-
tectonic_bridge_icu
Expose a subset of the ICU Unicode APIs to Rust/Cargo
-
text-to-json
Convert text to json in rust
-
strip-tags
Strip HTML and PHP tags from strings
-
destiny_rs
An async library to interact with Bungie's destiny 2 api
-
pillar
small tool to format lines into columns
-
re_view_text_log
A view that shows text entries in a table and scrolls with the active time
-
case-conv
Faster case conversion crate
-
minigrep_jamhaze
minigrep tool from the Rust book chapter 12
-
untex
Understand and manipulate TeX files with ease
-
bible-reference-rs
Extract Bible references from plain text
-
pygmentize
wrapper for syntax highlighting
-
sparklet
small flashcards library
-
igo-rs
Pure Rust port of the Igo, a POS(Part-Of-Speech) tagger for Japanese (日本語 形態素解析)
-
ewts-c
Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (c lib)
-
allsorts_no_std
Font parser, shaping engine, and subsetter for OpenType, WOFF, and WOFF2
-
unicode-canvas
creating text base drawing
-
file-search
File indexing and search
-
wtf8
WTF-8 encoding. https://simonsapin.github.io/wtf-8/
-
wordfreq
port of wordfreq for looking up the frequencies of words in many languages
-
character-stream
Helper data structures for reading UTF-8 characters from a stream
-
emoji-printer
Replace emoji shortcodes in string with emoji unicode (":sushi:" -> 🍣)
-
koelner-phonetik
koelner_phonetik or cologne phonetics is a phonetic algorithm like soundex, but specialized for german words
-
encoding-next-index-simpchinese
Index tables for simplified Chinese character encodings
-
mdbook-morsels
Morsels plugin for Mdbook
-
mdbook-checklist
An mdBook preprocessor for generating checklists and indexes
-
leven
Make your own blog!
-
utf
UTF-8
-
rmw-utf8
Short text compression algorithm for utf-8 (optimized for Chinese , developed based on rust programming language). 面向utf-8的短文本压缩算法(为中文压缩优化,基于rust编程语言开发)。
-
bottomify
Fantastic (maybe) CLI for translating between bottom and human-readable text
-
kvu
The simplest command line tool to manage key-value pair lines
-
ed_join
Implemtation of Ed-Join Algorithm for string similarity join
-
czv
performing CSV-related operations for data engineering and analysis
-
tradukisto
Kinda useful natural language translation library and utility
-
mdbook-iced
An mdBook preprocessor to turn iced code blocks into interactive examples
-
moenarchbook
Creates a book from markdown files
-
trigram
Trigram-based string similarity for fuzzy matching
-
skyspell_core
skyspell core library
-
mdbook-quiz-validate
Input validation for quizzes used in mdbook-quiz
-
glyph-names
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
tagalyzer
A CLI tool to gather statistics on collections of plaintext-adjacent files
-
flw
Process text via configurable tasks
-
jellybean
Syntax highlighting with tree-sitter. Sweet colors.
-
pencil_case
but effective library for writing GUI in bevy
-
b2c2-stat
デバッグ用にところどころで使ってるぽい
-
jput
puts and putc on unicode-width align for Rust
-
rescue-blanket
Escape values while they are being formatted
-
yozuk-sdk
Types used in the Yozuk ecosystem
-
crustword
Crusty Crosswords
-
br-pdf
PDF Invoice Processing
-
mdbook-fishextract
A mdbook preprocessor which handles mermaid graphs, offline, requires
mmdc
-
ucd
Extends the char type to provide access to most fields of the UCD, Unicode Character Database, as of version 9.0.0. It aims to be compact, fast, and use minimal dependencies (only rust's core crate)…
-
sydney
Vim-like, Command-line Gemini Client
-
jg
Jeff Goldblum (jg) is a command-line JSON processor. jg searches for structural patterns in json input and prints each json object that matches the pattern.
-
omgwtf8
Optimized-Matching-Generalized Wobbly Transformation Format — 8-bit
-
korrektor
work with Uzbek language text processing
-
cutters
Rule based sentence segmentation library
-
bionic-ebooks
Takes an EPUB file and generate a copy with bionic like font applied
-
platina
parameterized golden testing library
-
mon-artist
Render ASCII art to SVG, with user-customizable rendering rules
-
bullshit-rs
A BullshitGenerator implementation in Rust
-
difference
text diffing and assertion library
-
jpreprocess-jpcommon
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
minigrepns
A mini version of famous grep application that searches texts on files
-
const-utf16
Utf8 to utf16 conversion functions for use in const contexts
-
utf8_slice
Lightweight UTF8 Slice Utilities
-
llmvm-core-lib
llmvm core application
-
fancy_mocker
transforming text into alternating uppercase and lowercase letters, resembling the Spongebob Mock Meme
-
ron_to_table
pretty print RON as a table
-
ultron
A web base text-editor with strong focus on displaying multi-width characters correctly
-
tweak
when/then clauses to run
-
quill_delta_pdf
Convert Quill Delta to PDF
-
dr
Command-line data file processing in Rust
-
neardup
near-duplicate matching
-
seven_seg
Seven-segment digital display for terminal
-
gst-plugin-textwrap
GStreamer Text Wrap Plugin
-
saurus
Markdown to LaTeX Transpiler
-
supply-chain-trust-example-crate-000068
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
scie
research about how to build simple code identify engine for different languages
-
hashtag-regex
regex matching hashtags accoding to the unicode spec: http://unicode.org/reports/tr31/#hashtag_identifiers
-
aprilasr-sys
Low-level FFI bindings for the april-asr C api (libaprilasr)
-
romulus
a stream editor like sed
-
yarner-lib
build plugins for the Literate Programming tool Yarner
-
utf8-command
UTF-8 encoded
std::process::Command
output -
rnltk
Natural Language Toolkit for Rust
-
darn
Extras for the Book on Data Analysis with Rust Notebooks
-
text_to_emoji
Convert text to emoji
-
rust_lemmatizer
A lemmatizing package for use with a .csv dictionary of lemmas and their corresponding words
-
mdbook-open-gh-issue
mdbook preprocessor to add a open-on-github link on every page
-
cowsay
program that generates ASCII art pictures of a cow with a message
-
pcre2
High level wrapper library for PCRE2
-
detect-indent
Detect the indentation of code
-
fmtm_ytmimi_markdown_fmt
Fork of @ytmimi's Markdown formatter; powers FMTM
-
seq2xypic
Turn a text sequence diagram into a LaTeX xypic diagram
-
fast2s
A fast Traditional Chinese to Simplified Chinese conversion library. Built with FST, faster than most of other libraries.
-
grepox
Minimalist's grep written in Rust
-
wiki_corpus_grammar
Extract text from Wikipedia dumps (.bz2) and convert it to JSONLines format
-
readability-rs
Port of arc90's readability project to rust
-
ucd-util
A small utility library for working with the Unicode character database
-
bibliofile
A TUI epub reader inspired by DOS-era programs
-
bbcode-tagger
BBCode tree parser and tagger
-
kanpyo-dict
Dictionary Library for Kanpyo
-
re2
Wrapper for the re2 C++ regex library
-
stringutils
A collection of various and (hopefully) useful String utility functions
-
const_format_proc_macros
detail of the
const_format
crate -
ccase
Command line interface to convert strings into any case
-
github-slugger
A slugger for GitHub headings
-
august
& program for converting HTML to plain text
-
hebrew
alephbet primatives and parsing library for rust
-
unicode_names2_generator
Generates the perfect-hash function used by
unicode_names2
-
kanpyo
Japanese Morphological Analyzer
-
lingua-german-language-model
The German language model for Lingua, an accurate natural language detection library
-
comment-strip
Remove comments out of text files
-
rcut-lib
rcut is a Rust replacement for GNU cut that supports UTF-8
-
single_source
Generate code files from snippets in md tutorial files
-
word_filter
A Word Filter for filtering text
-
is_utf8
functions to determine if a sequence of bytes is valid utf-8
-
ende
encoding/decoding unicode/utf-8/utf-16(ucs-2) code points
-
cw
Count Words, a fast wc clone
-
vaporetto_tantivy
Vaporetto Tokenizer for Tantivy
-
static_format
Format strings with no runtime overhead
-
old_icelandic_zoega
Old Icelandic dictionary for Rust. From "A Concise Dictionary of Old Icelandic" by Geir Zoëga
-
chinese2digits
The Best Tool of Chinese Number to Digits. A useful tool in NLP and robot project.
-
cermic
Print .png images in your terminal as ascii art!
-
topfew
CLI to find high frequency occurrences in structured text files
-
csvre
replacing data in CSV columns with regular expressions
-
ocr_b_checksum
Generates OCR B Checksums
-
haoxue-dict
Chinese dictionary and word segmenter
-
zp
Copy the contents of the source file or the std output buffer to the clipboard, allowing users to easily paste the contents into another file or program
-
intname
Full English name for any integer of any primitive integer type
-
rustex
auto-generated LaTeX files in Rust
-
clparse
A command line tool for parsing CHANGELOG.md files that use the Keep A Changelog format
-
patterns
no_std compatible pattern scan library using simd
-
text_distance
A collection of approximate string matching algorithms
-
quilltex
open-source Rust library designed to convert LaTeX documents into a Delta format that can be used with Quill.js and vice versa
-
literate
programming tool that extracts code written in your Markdown files
-
encoding_rs_transcode
Transcode text within writers using encoding_rs
-
bibutils-sys
Rust bindings for bibutils, a program for bibliography format interconversion
-
string-box
Create Rust string from UTF-8 string, byte string or wide string
-
slidedeck
Create an HTML slide deck from Markdown
-
latin1str
Windows-1252 string types
-
gen-epub-book
Generate an ePub book from a simple plaintext descriptor
-
lindera-ipadic
A Japanese morphological dictionary for IPADIC
-
markdown-linkify
Markdown preprocessor for substiting link shorthands to valid links according to configurable regexes and custom substitution implementations
-
demoji
Remove all emojis from a string
-
szovegertesimutato-score
Calculate szovegertesimutato score for a given text and language
-
caribon
A repetition detector program and library
-
corg
A cog-like tool
-
charasay
The future of cowsay 🐮! Colorful characters saying something 🗨️
-
metatensor-sys
Bindings to the metatensor C library
-
kaomoji-ru
The full http://kaomoji.ru database, encoded and escaped as byte literals
-
ansi-width
Calculate the width of a string when printed to the terminal
-
mediawiki_parser
A strict parser for MediaWiki markdown
-
repocat
concatenate all code and text files in a github repo for LLM inference contexts
-
nipdf
A PDF library written in Rust
-
markdown-table
Creating markdown tables with Rust!
-
bebop-lang
LISP based Markdown preprocessor for creating html documents!
-
mr-regex
Minimalist ASCII Regex-engine with 300 lines of Rust
-
yitizi
異體字查詢 Get variant Chinese characters
-
harfbuzz-traits
Rust Traits for the HarfBuzz text shaping engine
-
darts
A double array trie, A Forward Maximum Matching Searcher
-
basen
Convert binary data to ASCII with a variety of supported bases
-
encoding-index-simpchinese
Index tables for simplified Chinese character encodings
-
markovish
Markov chain implementation for text generation
-
ansi-cut
cutting a string while preserving colors
-
textalyzer
Analyze key metrics like number of words, readability, and complexity of any kind of text
-
morse-nostd
A nostd version of the morse crate
-
word_search_solver
Solves a word search
-
regexnight
Command-line tool to print syntax-highlighted versions of regular expressions and spot errors
-
vaporetto_rules
Rule-base filters for Vaporetto
-
trunc8
Truncate text to a specific line length, based on a number of parameters
-
yozuk-helper-english
English NLP utilities for Yozuk
-
cattocol
Combine two text into one text as columns
-
didaskalia
over-complicated language to store text entries and sequences across different languages
-
typeline_ext_python
python integration for typeline
-
guarding_parser
Guarding is a guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
mdbook-newday
Command line utility to add a new line for today to the top of an mdbook SUMMARY.md book
-
informa
font compiler for digital typography
-
ced
Dead easy csv editor
-
mdbook-chapter-zero
A mdBook preprocessor that allows 0th (sub-)chapter
-
mdbook-snips
Markers for hidden lines in rust blocks within an mdbook
-
ae-renderable
handling individual text characters that can be rendered on the screen. Intended for use in game development.
-
ewin-com
editor for Window(GUI) users.No need to remember commands
-
rew
A text processing CLI tool that rewrites FS paths according to a pattern
-
mdbook-numeq
An mdbook preprocessor for automatically numbering centered equations
-
tpaint
Terminal based unicode and ascii diagrams and drawings
-
bookkeeping
Minimal, safe and flexible in-memory bookkeeping API
-
grader
Stream-based CLI for binary sorting text files via a given shell command
-
ptero-cli
A text steganography CLI tool for Social Media
-
wkhtmltopdf
High-level bindings to wkhtmltopdf
-
base256_lib
Base256 encoder/decoder
-
bbd-lib
Binary Braille Dump
-
owlz
"Owlz" ascii emoojis, created randomly or by design
-
t9
Easily create port number from words and vice versa
-
markdown-composer
composing markdown documents
-
latex_snippet
Convert even erroneous LaTeX snippets into HTML
-
blockcounter
Counts the blocks in a stream
-
cmark2tex
A small utility to convert markdown files to pdf exploiting tectonic
-
lindera-decompress
A morphological analysis library
-
mdtransform
program that renders Markdown as HTML for websites
-
quoted-string-parser
Quoted string parser for grammar defined in RFC3261
-
termbook
behind the
termbook-cli
-
color-convert
Support RGB,RGBA,HEX,HSL,HSLA,HSV,CMYK to convert each other, write by rust
-
lines_lossy
extension to
BufRead
with a functionlines_lossy
that works likeBufRead::lines
but with lossy UTF-8 decoding -
am-parse
converting asciimath to other formats
-
conllu-utils
working with the CoNLL-U dependency format
-
dhoni
converting Bengali text into their phonetic counterpart
-
chisel-parsers
Chisel parser front ends
-
mdbook-asciidoc
mdBook backend for AsciiDoc generation
-
encoding_c_mem
C API for encoding_rs::mem
-
fontconfig-rs
Safe, higher-level wrapper around the fontconfig library
-
ayda
Ask your Documents Anything. A tool for querying your documents with a large language model.
-
text2art
translate text to ascii art
-
vcg
manipulating verification conditions
-
jpreprocess-window
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
summary
Extract the sentences which best summarize a document
-
aklat
create books from markdown files (like Gitbook)
-
minigre_base
text file search tool
-
deepfrog
A deep learning NLP suite (PoS,lemmatiser,NER) with FoLiA XML support
-
ssml-parser
parsing speech sythnesis markup language
-
nb2fr
Convert integer number to literal French text
-
repub
convert markdown documents to epub
-
bookgrep
Basic grep equivalent, minor mods to Chapter 12
-
rustextile
Textile markup language parser for Rust
-
rahat3062_minigrep
A light-weight & minimal implementation of the grep cli app
-
lithe-cli
A cli of lithe
-
bos_books_codes
that handles 3-character Bible Books Codes
-
cha-rs
Extract specific characters from an input
-
fst-subseq-ascii-caseless
An automaton that matches if the input contains a specific subsequence ignoring ASCII case to be used with fst
-
mdbook-tagger
A mdbook preprocessor to collect and build tag data in Json format
-
regex-cli
A command line tool for debugging, ad hoc benchmarking and generating regular expressions
-
mdbook-hide
A preprocessor for mdbook that adds support for hidden chapters
-
sola
A 16-bit stack machine and forth language
-
kradical_static
Ready-to-use EDRDG radical decompositions
-
mrdocument
Automatic PDF transcription and classification via OpenAI
-
economic_indicator_finder
A finder for extracting economic indicators from paragraphs
-
porter-stemmer
Flexible and unicode friendly, Porter stemmer implementation
-
syntaxdot-encoders
Encoders for linguistic features
-
translitrs
Transliteration utility for Serbian language
-
mdbook-webinclude
Preprocessor for mdBook that includes content from URLs
-
litegrep
A basic tool for searching in files for lines of text, based on a query
-
react-html
react html components for frender
-
anagrambot
find anagrams of words
-
trevordmiller
Personal CLI
-
spandex-hyphenation
Knuth-Liang hyphenation for a variety of languages
-
norm-email
strip email provider defined behaviour from email addresses
-
moenster
mønster (n) - pattern. simple glob-style pattern matching for strings
-
find_unicode
Find Unicode characters, the easy way!
-
math-text-transform
Transform greek letters, latin letters, or decimal digits into certain variants from the mathematical alphanumeric symbols Unicode block (U+1D400–U+1D7FF). For example to bold, italic, script or double-struck.
-
mepple
English dictionary as a library
-
paperoni
A web article downloader
-
webreg
A CLI tool for testing regexes against web pages
-
mdbook-tectonic
An mdbook backend for generating LaTeX and PDF documents
-
nipah_tokenizer
A powerful yet simple text tokenizer for your everyday needs!
-
aki-txpr-macro
the more easy to use libaki-*
-
wordbreaker
A Unicode-aware no_std crate (requires alloc) that rapidly finds all sequences of dictionary words that concatenate to a given string
-
deliminator-core
Universal code documentation generator
-
kansuji
漢数字と数字の相互変換のためのライブラリ
-
mdbook-obsidian
mdBook preprocessor to render Obsidian specific syntax
-
textcat
detect text categories. It can be used to detect the language of a given text
-
text-tables
A terminal/text table prettifier with no dependencies
-
minbpe
Port of Andrej Karpathy's minbpe to Rust
-
toml_document
Document-style TOML parser. Allows editing TOML document in a way which preserves whitespace and comments.
-
eliza
natural language processing program developed by Joseph Weizenbaum in 1966
-
base_emoji
Convert everything to Emojis (and back)! - 🐫🔦🔥🎲🐬🍀🍟🔋🐬🎲🐬🍀🎁🍟💧👂🔥🚪🔋🍟🔦🔋🚗👣🔦🌍👂🍤🐗
-
b2c2-jis-x-201
UTF-8とJIS-X-201を雑に変換処理する
-
every-range
Iterator for interleaving included and excluded ranges
-
markdown2unicode
Converter from markdown notation to unicode characters
-
escaped-delimiter
Iterator of delimited slices with escape characters
-
rosie
Interface for the Rosie Pattern Language, for efficient and maintainable text pattern matching and search
-
lindera-filter
Character and token filters for Lindera
-
hemtt-tokens
A token library for hemtt
-
token-read
reading whitespace delimited files intended for competitive programming
-
konbaato
Conversions between different Japanese scripts
-
web-grep
A Grep Tool for HTML or XML
-
mdbook-reference-table
mdBook preprocessor to create reference tables
-
highlights
Convert kindle highlights into different formats
-
vidyut-chandas
A Sanskrit metrical classifier
-
vl-convert-pdf
convert SVG to PDF with embedded text
-
bookrafter
This repository contains code related to bookrafter rendering
-
spongedown
Converts markdown to html with svgbob support
-
xim-ctext
compound text en/decoder
-
spinning_cube
Perspective projection of a spinning cube, using just ASCII characters
-
aki-json-pick
The json pick out command
-
yozuk-model
NLP model generator for Yozuk
-
terminal-supports-emoji
Check whether the current terminal supports emoji
-
textblocks
working with text blocks
-
unidades
playing with physical units
-
lexicmp
comparing and sorting strings lexicographically and naturally
-
retest
Command-line regular expression tester
-
pangu
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
jp-location-relation
隣接する市区町村の一覧を取得
-
uniwhat
Display the unicode characters text
-
datex
package to automatically process your data and make it look pretty in LaTeX
-
rust-cedar
efficiently-updatable double-array trie in Rust (ported from cedar)
-
static_table
creates pretty tables at compiler time
-
matrix_display
display a matrix in command line with pretty unicode separators
-
typos-vars
Source Code Spelling Correction
-
xsv
A high performance CSV command line toolkit
-
encoding-next-index-tradchinese
Index tables for traditional Chinese character encodings
-
gret
command line tool to search for patterns and show matches in a tree structure
-
trexter
Text progression tracking library
-
mdtranslation-cli
Command-line tools for using mdTranslation, which can be used to prepare multi-lingual Markdown documents
-
ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched
-
corpus-preproc
A preprocessor for text and HTML corpora
-
cmdcjones_minigrep
A minimal grep clone from the Rust Book
-
japhonex
Japanese phone number checker for Rust
-
h_hangul
Korean Characters
-
ergrep
grep strings within a line from a text file
-
perspicuity_formula
Calculate Flesh Reading Ease for a given text and language
-
mdbook-latex
An mdbook backend for generating LaTeX and PDF documents
-
local-encoding
encoding/decoding string with local charset. It usefull for work with ANSI strings on Windows.
-
case_style
Converts from and to different case styles
-
hline
a grep-like tool that highlights lines in files
-
spyglass
Search engine for documents, inspired by bioinformatics
-
minigrep_macano953
A fun grep tool written as my first incursion into Rust. Not intended to be inherited and used in production code
-
roxy_syntect
Roxy plugin for providing syntax highlighting via Syntect
-
contractions
expand contractions in English
-
noneifempty
Convert T to None if empty
-
lingua-kazakh-language-model
The Kazakh language model for Lingua, an accurate natural language detection library
-
bitfont
Takes an ASCII string and generates a vector containing a bitmap font, for easy overlay into images
-
tb_normalization
normalization utf8 string, loc dau vietnamese and some language
-
readable-readability
Really fast readability
-
LatexImgGen-lib
generating Latex images of equations with an edition number to avoid cache flushing and timestamp attributes
-
crudite
A JSON CRDT
-
jellybean-pack-0
Sweet syntax highlighting with tree-sitter
-
x1b
State tracker for 0x1b terminal escape codes. Also gives random characters names.
-
publish_demo_node_1
This program picks a random christmas emoji for you
-
beary
augment Bear, the note-taking app
-
mdtohtml
markdown to html renderer (with a couple of missing features)
-
strings
String utilities, including an unbalanced Rope
-
hsk
Return HSK Level for Simplified Chinese Characters
-
mdbook-mathpunc
An mdbook preprocessor that prevents line breaks between inline math blocks and punctuation marks when using katex
-
ammonia
HTML Sanitization
-
tfidf-summarizer
Basic tf-idf compute for documents
-
poetry-book
Create a poetry book in latex, starting from plain text
-
grep-rezza
An app for search text in file
-
iterlower
Final-sigma-correct lowercasing iterator adapter with option for Turkish/Azeri I behavior
-
encoding-index-korean
Index tables for Korean character encodings
-
md_converter
My markdown to html converter
-
text_alignment
Align your text in Rust in the CLI
-
minigreper
Small grep style cli from the book
-
grepy
search files
-
charisma
Generate ASCII Art using AI 🦾🧠
-
cn-font-split
划时代的字体切割工具,CJK与任何字符!支持 otf、ttf、woff2 字体多线程切割,完美地细颗粒度地进行包大小控制。A revolutionary font subetter that supports CJK and any characters…
-
transcoding_rs
Converts text encoding the easy and efficient way
-
cli-animate
Enrich your Rust CLIs with stunning, easy-to-implement animations
-
my_mini_grep
A mini application that aims to replicate the behavior of the grep shell command
-
sudachiclone
sudachiclone-rs is a Rust version of Sudachi, a Japanese morphological analyzer
-
hashlogs
Command-line utility that hashes the part before a space on each line from stdin with blake2b keyed with an ephemeral randomly-generated key and writes to stdout
-
tex
The νTeX typesetting engine
-
diff_match_patch
Diff Match Patch is a high-performance library in Rust-lang that manipulates plain text
-
mdbook-post
A CLI for add post to mdbook
-
mojimoji-rs
fast converter between Japanese hankaku and zenkaku characters, mojimoji
-
morph-rs
Dictionary Morphologizer for Russian language
-
unicode-line-stacker
Tiny library for stacking Unicode line-drawing characters on top of each other
-
falcom-sjis
Falcom-compatibile Shift JIS implementation
-
asciimath-text-renderer
Render asciimath in terminal
-
encoding-index-tradchinese
Index tables for traditional Chinese character encodings
-
html2runes
An HTML to Text converter
-
emojito
Find all the Emoji in a string. Supports composed emoji.
-
top-english-words
retrieve top words from the English language
-
typeline_ext_sqlite
sqlite integration for typeline
-
mdbook-compress
Compress an mdBook project into a single PDF file
-
simple-word-count
word count function, try to get same result with Microsoft Office Word application
-
iconv-compat-win-sys
iconv bindings for Rust
-
simplecc
Chinese Convert library (partially) compatible with OpenCC's dictionaries
-
hex_table
Format a byte slice into a hex table
-
glyphana
Quickly find, inspect & collect unicode glyps
-
indexrs
inefficient multi-language search index
-
base_u256
base-u256 is to utf-8 as base-64 is to ascii
-
ascii-to-hex
A small, simple library to converting an ASCII text string into its hexadecimal equivalent
-
dismej
convert text to discord emojis
-
icu_casemapping
Unicode case mapping algorithms
-
hoedown
bindings for the Hoedown markdown processor
-
rmgrep
rust min grep text tools
-
ascii_utils
handle ASCII characters
-
anagram
A collection of anagram utility functions
-
unicode_categories_zheyuye
Query Unicode category membership for chars
-
cjieba-sys
unsafe ffi to cppjieba
-
kaomji-ru
The full http://kaomoji.ru database, encoded and escaped as byte literals
-
markdown-table-formatter
Markdown table formatter fully compliant with Unicode 15.1.0
-
geml
Generator-orientated ML parser
-
decline-word
Choose word form based on given number
-
lexer-generator
Lexer derived from Regex patterns with user customizeable tokens
-
kryptor
command-line cryptography tool
-
alpino-tokenizer
Wrapper around the Alpino tokenizer for Dutch
-
route-pattern
A parser and matcher for route patterns
-
atm_parser_helper_common_syntax
Parsing utilities for the common syntax specification
-
jpreprocess-naist-jdic
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
zalgo-text
A command line tool for generating zalgo text
-
oneparse
A one in all simple parser framework
-
markdown_to_html_parser
parses Markdown syntax into HTML
-
fancy_print
Easily print animated ASCII text to the terminal!
-
pdf_composer_base
PDF Composer base functionality crate
-
table_to_html
interface to convert a
tabled::Table
into a HTML table (<table>
) -
ttf_word_wrap
Wraps text based on character width
-
mystem
Wrapper around Yandex Mystem for Rust
-
rusty_code_code_for_book
my book_rusty code
-
mdbook-to-github-wiki
Turns an mdbook book into a github wiki
-
html_to_pdf_lib
converting HTML to PDF
-
markx
markdown parser
-
ddvm
Document to Document Virtual Machine
-
naming_clt
Extract and convert the naming format(case|notation) of identifiers from files or stdin. Use this tool to prepare identifier name strings for further operations (matching,replacing...) on relative files
-
rsplitter
Splitter is a tool to split words from text that contains no spaces
-
textr
TeX-inspired plug-n-play interface for converting JSON documents into PDFs
-
asciify
converting images to a readable format on the command line
-
rckive-genpdf
User-friendly PDF generator written in pure Rust
-
stardict_lib
Rust implement stardict
-
jp_inflections
A small crate to conjugate japanese words
-
milligrep
Custom simplified implementation of grep
-
mdbook-multicode
Allows you to give multilanguage code examples, toggled by a spinner
-
tpng
A small tool that prints truecolor png renderings to the terminal using unicode block characters
-
libgrep-rs
searching through text
-
spellcheck
A spell-checker based on the statistical algorithm described by Peter Norvig in http://norvig.com/spell-correct.html
-
dumbfuzz
dumb library for fuzzy search
-
rep-cli
Replace text file in bulk
-
justify
plaintext while handling Unicode gracefully
-
nib
static site generator
-
pdf_form
programatically filling out pdf forms
-
vextractor
vextractor-cli
-
encoding-index-singlebyte
Index tables for various single-byte character encodings
-
hex_d_hex
HexDHex is a Rust Crate that encodes and decodes byte data to and from its hexidecimal representation. For instance, one may wish, on ocasion that is, to translate a utf8 or ASCII string…
-
math_test
Generates math questions for elementary school kids
-
stfu
Shut The Ferris Up - profanity filtering for Rust
-
deface
Lightweight markup to HTML converter
-
kanabake
Turn Romaji strings into Hiragana or Katakana
-
esc
Escape characters in strings
-
texrender
Thin wrapper around running
latexmk
to render LaTeX documents. Also supports generating Tex documents. -
wordninja
port of the Word Ninja English word splitting library
-
font-index
Static index for system fonts
-
strng
Owned UTF-8 string
-
minigrep5
grep implementation in Rust
-
charclass
define and modify unicode character classes
-
exit-with-code
A CLI which returns the exit code of your choice
-
charjpoet
Charj Poet is a API for write to
.cj
language -
rut
A small UTF-8 parsing library for applications that need to parse individual
char
s -
txtframe
Creates a frame for text
-
lingua-slovene-language-model
The Slovene language model for Lingua, an accurate natural language detection library
-
publish_demo_node
This program picks a random christmas emoji for you
-
simplearrayhash
-
chars_data
Build-dependency for chars, the unicode character information CLI
-
tablers
create tables and export them into CSV or PDF. Without external deps
-
panduck-latex
Use panduck to generate XeLaTeX
-
lindera-compress
A morphological analysis library
-
ucd-parse
parsing data files in the Unicode character database
-
cologne_phonetics
generate phonetic cologne codes for utf8 strings
-
unicode_converter
CLI tool to convert data between various Unicode encodings
-
lyn
A scanner useful for creating recursive descent parsers
-
rigrep
grep from Rust Book
-
newline-converter
Newline byte converter library
-
utf8reader
wrapper around Reader that returns a stream of UTF-8 characters
-
chanoma
Characters Normalization library. 文字列正規化処理用のライブラリです。
-
equt-md-ext
Extend event iterator
-
gestalt_ratio
Calculate the gestalt pattern matching ratio between two strings
-
names-changer
Convert a names of sql schemes from camelcase to snake case
-
invisible_unicode
finding invisible unicode characters
-
smart-date
Parses relative and absolute dates in natural language text
-
minigrep_desonglll
grep implementation from The Rust Programing Book
-
nutrimatic
Tools for reading Nutrimatic (https://nutrimatic.org) index files
-
kytea-tokenizer
Wrapper of tokenization by KyTea
-
askama-filters
Extra template filters for Askama
-
unicode_types
A mapping of all the unicode characters into convenience types (one enum per block of characters with one variant per character)
-
aqua_parse
A parser library
-
smoldown
Native Rust library for parsing Markdown
-
unic-idna-mapping
UNIC — IDNA — IDNA Mapping Table
-
minigrep_vielenkz
grep using rust
-
groop
Outline-aware grep with syntax highlighting
-
stopwords
from popular text processing frameworks
-
encoding
Character encoding support for Rust
-
unicode-character-database
Unicode character database tables (Unicode Standard Annex #44) generated using
ucd-generate
-
yeslogic-ucd-parse
parsing data files in the Unicode character database
-
gulpeaseindex
Calculate Gulpease index for a given text and language
-
minigrep_bakedspacetime
Minimal Rust implementation of grep based on The Book
-
latex-to-html
Latex to html converter
-
html_to_markdown
Convert HTML to Markdown
-
wz-conf
Configuration options for wz
-
slicer
that slices string slices into smaller string slices
-
fnew
A Unicode-aware line-oriented drop-in replacement for coreutils' fold
-
grep-searcher
Fast line oriented regex searching as a library
-
goya
morphological analyzer for Rust and WebAssembly
-
kth-lines
Command line tool for filtering stdin lines that just work
-
pdf_forms
programatically filling out pdf forms
-
latexify
Shared definition for turn a rust object into latex code
-
jellybean-pack-2
Sweet syntax highlighting with tree-sitter
-
genpdfi
User-friendly PDF generator written in pure Rust
-
encoding-next-index-japanese
Index tables for Japanese character encodings
-
megagrep
searching text program for practicing Rust
-
recode_rs
Command-line tool for converting between the character encodings defined in the Encoding Standard
-
b2c2-common
b2c2の共通部分
-
typeline_ext_csv
csv parsing and serialization for typeline
-
scannedpdf
A fast, low memory footprint, simple images to PDF crate
-
lingua-basque-language-model
The Basque language model for Lingua, an accurate natural language detection library
-
rusty_word_builder
Syllable and Word generation library written fully in Rust
-
simple-text-pattern
Matching simple text patterns
-
minigrep_maybe
Just another minigrep from Rust book
-
dictionary-1024
a mnemonic dictionary with 1024 words, 10 bits per word
-
conveyance
A stop-gap CLI for conveyancing
-
mdlint-cli
tbd
-
ascii_code_finder
find ascii code of a character or get a character by its ascii code
-
kma-rustlang-vadym-polishchuk-english-parser
parser of English sentences created for KMA Rust course
-
lindera-tokenizer
A morphological analysis library
-
mdbook-extended-markdown-table
Preprocessor for mdBook that generates tables with merged cells from ASCII text
-
nib-cli
A cli for a yet another static site generator Nib
-
lazy-char-iter
Lazily iterate over Unicode characters from a u8 slice in Rust
-
scrambler
command line tool to scramble letters
-
alphabet-encoder
A quick and dirty way to deal with escape characters
-
linetime
command line utility to add timestamps at the start of lines. The tool can either process lines from stdin or execute a command and process lines from the command's stdout and stderr.
-
saku
efficient rule-based Japanese Sentence Tokenizer
-
p4d-mdproof
Markdown to PDF converter
-
pikchr-cli
PIC-like diagramming language to SVG converter
-
ruby-parser
A parser for the Ruby language
-
uwu_cli
uwuifying the terminal
-
borderrs
Add stylish borders around your text and datastructures
-
japanese-ruby-filter
Japanese ruby notation parser
-
bgrep
grep tailored to handle binary patterns and files
-
regex-cli-test
A command line tool for testing against regex patterns
-
conv-rs
CLI converter for multi extensions
-
random_access_unicode
Provide random access to the unicode characters in a file
-
uniaxe
replace Unicode letters with Ascii equivalents
-
backslash
parsing escape characters
-
lindera-analyzer
A morphological analysis library
-
epubparse
Parse epub and convert to text-only Book structure
-
try_publish_for_demo
This program picks a random christmas emoji for you
-
lingua-bulgarian-language-model
The Bulgarian language model for Lingua, an accurate natural language detection library
-
indentation_flattener
From indented input, generate plain output with indentation PUSH and POP codes
-
pdftotext
High-level library that binds to Poppler to extract text from a PDF
-
convert_encoding
Convert encoding of text files in batch
-
minigrep-yogie
A demo Rust to grep some word from rust-lang.com
-
pcre2-sys
Low level bindings to PCRE2
-
jieba-macros
jieba-rs proc-macro
-
base1112031
way to represent numbers using all Unicode characters except ASCII control characters
-
font-map-core
Core font-parsing capabilities for font-map
-
morsels_lang_ascii
Basic ascii tokenizer for morsels
-
unicode_names
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
json-event-parser-witespace
A json event parser with whitespace
-
unicode-utf8
that converts utf-8 bytes to a unicode scalar value, and vice versa
-
rosie-sys
build or link to librosie to access the Rosie Pattern Language
-
mojibake
Encode/Decode bytes as emoji base2048
-
djot
Djot parser written in pure Rust
-
pdf2
PDF 2.0 Rust library
-
jpreprocess-dictionary-builder
Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)
-
unic-bidi
UNIC — Unicode Bidirectional Algorithm
-
unic-ucd-block
UNIC — Unicode Character Database — Unicode Blocks
-
flag_by_code
Easiest way to get an emoji flag from BCP-47 language tags
-
lingua-chinese-language-model
The Chinese language model for Lingua, an accurate natural language detection library
-
mdoc
Modern PDF creation through Markdown and LaTeX
-
STKLR
STKLR: pronounced 'stickler'. Is a cli tool to automatically link functions, enums, structs, traits etc in rust-doc docstrings. I couldn't find a tool like this when I needed it so... here we are.
-
tex-rs
create latex documents in Rust
-
mdlynx
Small, fast utility to find broken file links in Markdown documents
-
uecho
The unicode of the echo command
-
jellybean-pack-1
Sweet syntax highlighting with tree-sitter
-
forming
lightweight architecture as code language. 架构描述语言
-
encoding-index-japanese
Index tables for Japanese character encodings
-
tectonic_engine_xdvipdfmx
The
xdvipdfmx
program as a reusable crate -
grep-pcre2
Use PCRE2 with the 'grep' crate
-
replaxe
A command-line tool to replace text in files with easy patterns
-
yeslogic-fontconfig
RENAMED: use the fontconfig crate instead
-
lindera-py
Python binding for Lindera
-
node_rust_demo
This program picks a random christmas emoji for you
-
lingua-portuguese-language-model
The Portuguese language model for Lingua, an accurate natural language detection library
-
infisearch_lang_ascii
Basic ascii tokenizer for InfiSearch
-
xsystem
Conversion between the Esperanto x-system and Unicode circumflexes
-
rustyword
An anagram finder
-
writedown
format parser
-
minigrep_linqi
grep implementation from The Rust Programing Book
-
mdtable-cli
that makes creating tables in markdown much easier!
-
emojicons
Parse :emoji: notation to unicode representation
-
replace-all
Cli to quickly replace occurences of a word in a file
-
boilerpipe
text extraction from HTML documents
-
scenic
Presentation tool that utilizes LaTex Beamer package
-
encoding_rs
A Gecko-oriented implementation of the Encoding Standard
-
swappy
An anagram generator
-
md2pdf
A small utility to convert markdown files to pdf exploiting tectonic
-
unicode_clusters
variable width unicode characters as single items, allowing for array like indexing etc
-
unic-idna
UNIC — Unicode IDNA Compatibility Processing
-
xgrepx
xgrep is a rust implementation of grep. This is a follow up from the rust book
-
tabled
An easy to use library for pretty print tables of Rust
struct
s andenum
s -
lindera-dictionary-builder
Shared code for building Lindera dictionary files