Text processing

regex

regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.

v1.11.1 18.3M no-std #regex #regex-set #haystack #parser #regex-engine #u8 #nfa #dfa
unicode-width

Determine displayed width of char and str types according to Unicode Standard Annex #11 rules

v0.2.0 14.8M no-std #unicode-width #unicode #unicode-text #no-alloc #text #width
comfy-table

An easy to use library for building beautiful tables with automatic content wrapping

v7.1.4 2.1M #terminal #unicode #wrapping #styling #table
textwrap

word wrapping, indenting, and dedenting strings. Has optional support for Unicode and emojis as well as machine hyphenation.

v0.16.2 7.2M #text-formatting #wrap #typesetting #hyphenation #formatting
encoding_rs

A Gecko-oriented implementation of the Encoding Standard

v0.8.35 8.6M no-std #unicode #charset #web #standard #encoder
similar

A diff library for Rust

v2.7.0 3.4M #unified-diff #difference #change #patience #diff
const_format

Compile-time string formatting

v0.2.34 3.1M no-std #concat #formatting #arguments #no-std #macro #assertions #format
heck

case conversion library

v0.5.0 22.8M no-std #snake-case #camel-case #unicode
fancy-regex

regexes, supporting a relatively rich set of features, including backreferences and look-around

v0.14.0 3.5M no-std #regex #re #fancy-regex #ac
tabled

An easy to use library for pretty print tables of Rust structs and enums

v0.19.0 865K no-std #pretty-table #terminal #tabled #format #print #table
convert_case

Convert strings into any case

v0.8.0 8.7M #casing #string #case #boundaries #title
pulldown-cmark

A pull parser for CommonMark

v0.13.0 1.5M bin+lib #common-mark #markdown #pulldown-cmark #parser
unicode-normalization

functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15

v0.1.24 8.0M no-std #unicode-normalization #recomposition #unicode-text #text #decomposition #normalization #unicode
deunicode

Convert Unicode strings to pure ASCII by intelligently transliterating them. Suppors Emoji and Chinese.

v1.6.2 1.0M no-std #emoji #unicode #ascii #transliteration #unidecode
lazy-regex

lazy static regular expressions checked at compile time

v3.4.1 1.0M no-std #lazy-evaluation #static #regex #macro
rustybuzz

A complete harfbuzz shaping algorithm port to Rust

v0.20.1 401K no-std #true-type #opentype #text-shaping #shaping
unicode-segmentation

Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules

v1.12.0 8.4M no-std #word #unicode-segmentation #boundary #unicode #grapheme #unicode-text #text
onig

Rust-Onig is a set of Rust bindings for the Oniguruma regular expression library. Oniguruma is a modern regex library with support for multiple character encodings and regex syntaxes.

v6.5.1 709K #onig #regex #oniguruma #source #bindings #oniguruma-regex-library
emojis

✨ Lookup emoji in *O(1)* time, access metadata and GitHub shortcodes, iterate over all emoji, and more!

v0.6.4 56K no-std #emoji #github #gemoji #unicode
lopdf

PDF document manipulation

v0.36.0 123K #editing #merge #manipulation #pdf #operand
termimad

Markdown Renderer for the Terminal

v0.32.0 72K #tui #markdown #renderer #parser #terminal
widestring

wide string Rust library for converting to and from wide strings, such as those often used in Windows API or other FFI libaries. Both u16 and u32 string types are provided, including support for UTF-16 and UTF-32…

v1.2.0 1.8M no-std #wide-string #utf-16 #utf-32 #winapi
unicase

A case-insensitive wrapper around strings

v2.8.1 5.8M no-std #case-insensitive #case-folding #lower-case #no-std
mdbook

Creates a book from markdown files

v0.4.50 166K bin+lib #rust-book #gitbook #mdbook #book #markdown
prettydiff

Side-by-side diff for two files

v0.8.0 104K #diff #text #change #word
regress

A regular expression engine targeting EcmaScript syntax

v0.10.3 917K no-std #regex #syntax #regex-regex #why
html2text

Render HTML as plain text

v0.15.0 85K #html #html-text #text
unicode-bidi

Unicode Bidirectional Algorithm

v0.3.18 6.6M no-std #bidi #text-layout #rtl #unicode #unicode-text #text
unicode-general-category

Fast lookup of the Unicode General Category property for char

v1.0.0 357K no-std #unicode #category #general-category #no-std #general
pulldown-cmark-to-cmark

Convert pulldown-cmark Events back to the string they were parsed from

v21.0.0 284K #markdown-converter #common-mark #render #markdown
const-str

compile-time string operations

v0.6.2 310K no-std #string #const #operation #proc-macro
mdxjs

Compile MDX to JavaScript in Rust

v1.0.3 3.1K #markdown #mdx #compile #gfm
linkify

Finds URLs and email addresses in plain text. Takes care to get the boundaries right with surrounding punctuation like parentheses.

v0.10.0 70K #link #web #url #text
fuzzy-matcher

Fuzzy Matching Library

v0.3.7 373K #match #fuzzy-search #text-search #text #search
printpdf

reading and writing PDF files

v0.8.2 13K #pdf #graphics #gui #wkhtmltopdf
lindera

A morphological analysis library

v0.42.4 30K #morphological-analysis #library #tokenize #morphological #dictionary #analysis #reference #file
finl_unicode

handling Unicode functionality for finl (categories and grapheme segmentation)

v1.3.0 456K #unicode-segmentation #unicode #grapheme #segmentation
charabia

detect the language, tokenize the text and normalize the tokens

v0.9.5 15K #tokenize #language #normalize #document #segmenter #tokenizer
garde

Validation library

v0.22.0 51K #validation #garde #rules #ascii #length #derive #valid
diff

An LCS based slice and string diffing implementation

v0.1.13 4.2M #diff #print
roff

ROFF (man page format) generation library

v0.2.2 288K #roff #name #italic #bit #description #synopsis #fr #page
text-splitter

Split text into semantic chunks, up to a desired chunk size. Supports calculating length by characters and tokens, and is callable from Rust and Python.

v0.26.0 40K #tokenize #split #artificial-intelligence #nlp #character #tokenizer
titlecase

Capitalize text according to a style defined by John Gruber for Daring Fireball

v3.5.0 38K bin+lib #title-case #capitalization #wasm #capitalisation
synoptic

low-level, syntax highlighting library with unicode support

v2.2.9 24K #unicode #rules #text-processing #applications #below #buffering #performed #command #comments #great
lngcnv

linguistics: display pronunciation, translate between dialects, convert between orthographies; support for multiple languages: English, Latin, Polish, Quechua, Spanish, Tikuna

v1.10.1 210 app #spelling #phonetic #linguistics #speech #text-processing #language
unicode-script

exposes the Unicode Script and Script_Extension properties from UAX #24

v0.5.7 588K #script #scripting-language #unicode #unicode-text #text #language
diffy

Tools for finding and manipulating differences between files

v0.4.2 238K #patch #merge #diff
text-size

Newtypes for text offsets

v1.1.1 346K #text #size #text-size #offset
Inflector

Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…

v0.11.4 864K #inflection #pluralize #foo-bar #snake #camel
str_indices

Count and convert between indexing schemes on string slices

v0.4.4 165K no-std #string #text #no-std #indices
smartcat

Putting a brain behind cat. CLI interface to bring language models in the Unix ecosystem 🐈‍⬛

v2.2.0 140 app #chatgpt #pipe #cat #cli #artificial-intelligence
usearch

Smaller & Faster Single-File Vector Search Engine from Unum

v2.17.7 1.6K #usearch #search #cluster-analysis #text-search #metrics #unum #faiss #nearest-neighbor #quantization #full-text-search
ascii

ASCII-only equivalents to char, str and String

v1.1.0 1.7M no-std #ascii #caret-decode #libstd
os_display

Display strings in a safe platform-appropriate way

v0.1.4 46K no-std #shell #terminal #shell-terminal #text #cli #no-std
nucleo

plug and play high performance fuzzy matcher

v0.5.0 11K #matcher #nucleo #pattern #status #fuzzy-matching #fuzzy-search
arrow-cast

Cast kernel and utilities for Apache Arrow

v55.1.0 1.7M #arrow #parquet #date-time
unicode_names2

Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…

v1.3.0 163K no-std #unicode #unicode-text #text
chardetng

A character encoding detector for legacy Web content

v0.1.17 157K #unicode #charset #web #content
xan

The CSV magician

v0.50.0 900 app #csv #csv-tsv #tsv #magician #file #column #row #format #statistics
entities

raw data needed to convert to and from HTML entities

v1.0.2-rc.1 96K #character #escaping #html-entities #html
pact_consumer

Pact-Rust module that provides support for writing consumer pact tests

v1.4.0 4.1K #pact #cdc #testing #message #path #response #directory #plain-text #start-mock-server
route-recognizer

Recognizes URL patterns with support for dynamic and wildcard segments

v0.3.1 425K #router #url #segment
cruet

Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…

v0.15.0 27K #inflection #pluralize #snake-case #snake #camel
line-index

Maps flat TextSize offsets to/from (line, column) representation

v0.1.2 68K #line-index #index #line #ide
wana_kana

checking and converting between Japanese characters - Kanji, Hiragana, Katakana - and Romaji

v4.0.0 9.3K bin+lib #kana #hiragana #katakana #japanese #romaji
autocorrect

A linter and formatter for help you improve copywriting, to correct spaces, words, punctuations between CJK (Chinese, Japanese, Korean)

v2.14.0 430 #lint #autocorrect #format #spell-check #cjk
mdbook-katex

mdBook preprocessor rendering LaTeX equations to HTML

v0.9.4 4.2K bin+lib #katex #mdbook #latex #delimiter #macro #default
jieba-rs

The Jieba Chinese Word Segmentation Implemented in Rust

v0.7.2 39K #nlp #chinese #segmenation
zeitgrep

Find frecent results in git repositories using regular expressions

v0.8.0 1.6K app #expression #zeitgrep #repository #frecency #search #grep-like #code-search #git #regex
stringsext

find multi-byte-encoded strings in binary data

v2.3.5 app #stringsext #ascii #getreu #unicode #author #copyright #status #data #string-search #forensics
unicode-case-mapping

Fast lowercase, uppercase, and titlecase mapping for characters

v1.0.0 122K #title-case #upper-case #lower-case #unicode #character
ferris-says

flavored replacement for the classic cowsay

v0.3.2 6.8K #cowsay #rustaceans #print #ferris #fsays
spellbook

A spellchecking library compatible with Hunspell dictionaries

v0.3.4 4.5K no-std #spell-check #dictionary #no-std #nuspell #suggestions #spell-checking #spellcheck #practice
textsurf

Webservice for efficiently serving multiple plain text documents or excerpts thereof (by unicode character offset), without everything into memory

v0.2.0 160 app #annotations #nlp #standoff #text #text-processing #annotation
epub-builder

generating EPUB files

v0.8.0 1.0K #epub #epub-builder #builder #default #toc-element #epub-content #zip-library
unindent

Remove a column of leading whitespace from a string

v0.2.4 4.4M #string #multi-line #heredoc #literals #string-literal #nowdoc
regex-cursor

regex fork that can search discontiguous haystacks

v0.1.5 5.0K #regex-automata #nfa-automata #dfa-automata #regex
htmd

A turndown.js inspired HTML to Markdown converter

v0.2.2 6.9K #html #markdown-converter #render-markdown #js #handler #table #converter
repoyank

Interactively traverse your repository, select files/directories, and quickly prepare structured snippets for LLM interactions

v0.3.0 410 app #llm #code-snippets #repository #clipboard
decancer

that removes common unicode confusables/homoglyphs from strings

v3.2.8 9.9K #homoglyphs #unicode #security #confusable #moderation
mdbook-pdf

A backend for mdBook written in Rust for generating PDF based on headless chrome and Chrome DevTools Protocol

v0.1.11 270 app #rust-book #pdf #mdbook #book #protocols
pdf-extract

extract content from pdfs

v0.9.0 13K #pdf #pdf2text #pdf2txt #text
ncount

A word count tool intended to derive useful stats from markdown

v0.7.2 2.2K app #word-count #text #novel
rumdl

A fast Markdown linter written in Rust (Ru(st) MarkDown Linter)

v0.0.58 1.2K bin+lib #linter #markdown #documentation #markdown-linter #issue
mdbook-pandoc

A pandoc-powered mdbook backend

v0.10.4 4.3K bin+lib #mdbook #pandoc #book #pdf #latex #back-end #table
hgrep

grep tool with human-friendly search output. This is similar to -C option of grep command, but its output is enhanced with syntax highlighting focusing on human readable outputs.

v0.3.8 110 bin+lib #syntax-highlighting #grep #bat #ripgrep #search #directory
mkrs

Build automation tool

v0.23.1 app #target #mkrs #mode #processing #run #targets-dependencies #tool #documentation #readability #default
mdbook-admonish

A preprocessor for mdbook to add Material Design admonishments

v1.19.0 5.3K bin+lib #ui-design #material-design #markdown #mdbook #material #ui
http-cmd

Run a command over HTTP

v1.0.3 290 app #http #command-line-tool #command #cli #hacked #html #language #text #emoji #profit
unicode-blocks

contains a list of all unicode blocks and provides some functions to search across them

v0.1.9 49K no-std #block #cjk #character #unicode
diff-match-patch-rs

The fastest implementation of Myer's diff algorithm to perform the operations required for synchronizing plain text

v0.5.0 4.0K #diff-match-patch #patch #diff #match #text-synchronization
omekasy

Decorate alphanumeric characters in your input with various font; special characters in Unicode

v1.3.1 460 app #emoji #omekasy #unicode #bold-italic #script #monospace #blackboard #sans #bold-script #bold-fraktur
font-types

Scalar types used in fonts

v0.9.0 248K no-std #font-types #font #byte
text_io

really simple to use panicking input functions

v0.1.13 20K #io-read #scan #read-line #scanf #io
rustc-literal-escaper

code to unescape string literals

v0.0.2 192K #rustc-literal-escaper #literals #unicode
hck

A sharp cut(1) clone

v0.11.4 bin+lib #hck #compression #text #delimiter #literals #regex #cli #decompression #column #optimization
harfruzz

A complete harfbuzz shaping algorithm port to Rust

v0.1.0 310 no-std #shaping #true-type #opentype #text-shaping
za

🛠️ Zero-to-All — scan your workspace and generate an opinionated CONTEXT.md so AI, code-reviewers and newcomers always have the full picture

v0.1.0 130 app #za #snippets #binary
stringzilla

Faster SIMD-accelerated string search, sorting, fingerprints, and edit distances

v3.12.5 550 sys no-std #hash #search #sorting
unicode-id

Determine whether characters have the ID_Start or ID_Continue properties according to Unicode Standard Annex #31

v0.3.5 828K no-std #unicode-id #unicode #tr31 #unicode-text #text
netidx

Secure, fast, pub/sub messaging

v0.28.0 100 #networking #distributed #kerberos
stringcase

Converts string cases between camelCase, COBOL-CASE, kebab-case, and so on

v0.4.0 30K #snake-case #camel-case #pascal-case #kebab-case #kebab
uncased

Case-preserving, ASCII case-insensitive, no_std string types

v0.9.10 922K no-std #case-insensitive #ascii #uncased #case-preserving #no-std
inlyne

Introducing Inlyne, a GPU powered yet browserless tool to help you quickly view markdown files in the blink of an eye

v0.5.0 app #viewer #gpu #markdown #dark-light #image
matchers

Regex matching on character and byte streams

v0.2.0 7.3M #regex #pattern-match #streaming #matcher
languagetool-rust

LanguageTool API bindings in Rust

v2.1.5 250 bin+lib #language-tool #rust #client-server #language #docker #changelog
mdbook-yapp

A mdBook preprocessor for simple text replacements

v1.2.1 180 app #mdbook-preprocessor #mdbook #replace #text #mdbook-pre-processor #pattern #text-replacement
whyq

jq wrapper

v0.10.2 app #whyq #jq #format #input #yaml #action #tags #file #tq #yq
uwc

Counts things in unicode text files

v1.0.8 app #word-count #unicode #input #wc #testing-fixtures #cluster #word #count #crlf
vaporetto

pointwise prediction based tokenizer

v0.6.5 1.6K no-std #japanese #tokenize #analyzer #morphological
stop-words

Common stop words in many languages

v0.8.1 11K #stop-words #nlp #localization #word #language
pks

Welcome! Please see https://github.com/alexevanczuk/packs for more information!

v0.2.24 2.6K bin+lib #information #packs #constant #yaml #privacy #ruby #namespaces #ignore #service #product
llmvm-core

The core application for llmvm

v1.1.3 app #artificial-intelligence #llm #api-bindings #thread #preset #prompt #logging #template #back-end #ai
tiefdownconverter

A CLI tool to manage and convert Markdown-based projects

v0.8.1 270 app #pandoc #document-conversion #markdown
tossicat

입력된 단어에 맞게 같이 입력된 토시(조사)를 적절하게 변환하는 라이브러리

v0.6.1 750 #hangeul #hangul #library #라이브러리 #함수
markdown-tool

A CLI utility for converting Markdown into AST and vice versa

v1.0.0 370 app #markdown-converter #markdown #ast #format
buup

Core transformation library with zero dependencies

v0.23.0 2.0K bin+lib #transformer #buup #snake-case #list #reverse-engineering #belt #development-tools #zero-dependencies #pure-rust
luciferous-case-converter

A CLI tool to convert text between different cases

v1.0.0 270 app #converter #text #cli #case
ra_ap_text_edit

Representation of a TextEdit for rust-analyzer

v0.0.241 36K #text-edit #text #edit #start #html
boreal

evaluate YARA rules, used to scan bytes for textual and binary pattern

v1.0.0 170 #yara #scan #string-matching #rules #default #module
cargo-spellcheck

Checks all doc comments for spelling mistakes

v0.15.5 1.8K bin+lib #spell-check #grammar #spelling #mistakes #hunspell #nlp #cargo #language-tool
hyperlink

Very fast link checker for CI

v0.1.44 app #hyperlink #tags #action #link-checker #linter #ci #validation
cow-utils

Copy-on-write string utilities for Rust

v0.1.3 165K no-std #string #cow #text
epub

support the reading of epub files

v2.1.4 1.7K #ebook #epub #epub-doc #spine #cover #metadata #opening
wildcard

matching

v0.3.0 79K no-std #wildcard #matching #no-std
slice-command

slice is a command-line tool that allows you to slice the contents of a file using syntax similar to Python's slice notation

v0.4.2 app #text #slice #tool #txt
xot

Full-featured XML tree library for Rust

v0.31.2 6.1K #tree #dom #xml
mdbook-svgbob

SvgBob mdbook preprocessor which swaps code-blocks with neat SVG

v0.2.2 3.9K app #bob #svg #markdown #mdbook #ascii
elfcat

ELF visualizer. Generates HTML files from ELF binaries.

v0.1.10 160 app #elfcat #html #elf64 #byte #start #msg #syscalls #themes #text #ld
dptran

run DeepL translations on command line written by Rust

v2.2.2 bin+lib #localization #translation #dptran #language #deep-l #translated #settings #key #mode #break
sscanf

(inverse of format!()) Macro based on Regex

v0.4.3 22K #scanf #regex #string #parser #text
apisnip

A terminal user interface (TUI) tool for trimming OpenAPI specifications down to size ✂️

v1.4.59 210 app #openapi #tui #swagger
autosurgeon

working with data in automerge documents

v0.8.7 4.4K #document #documents #documentation #reconcile
mdbook-embedify

based mdbook preprocessor plugin that allows you to embed apps to your book, like youtube, codepen, giscus and many other apps

v0.2.13 500 app #mdbook-preprocessor #mdbook #embed #plugin #app #mdbook-pre-processor
difflib

Port of Python's difflib library to Rust

v0.4.0 2.7M #difflib #text #diff #differs #sequence-matcher
iepub

epub、mobi电子书读写

v0.8.3 230 #ebook #epub #mobi #azw
any_ascii

Unicode to ASCII transliteration

v0.3.2 157K no-std bin+lib #emoji #transliteration #ascii #unicode #unidecode
line-ending

Detect, normalize, and convert line endings across platforms, including support for character streams. Ensures consistent handling of LF, CRLF, and CR line endings in text processing.

v1.5.1 2.4K #line-ending #ending #line-ending-conversion
topiary-queries

tree-sitter query files compatible with Topiary

v0.6.0 750 #code-formatter #tree-sitter #text
zawk

An efficient Awk-like language implementation by Rust with stdlib

v0.5.25 app #stdlib #tsv #awk #csv-tsv #etl #csv
charset

Character encoding decoding for email

v0.1.5 233K #charset #utf-7 #unicode #email
hyphenation

Knuth-Liang hyphenation for a variety of languages

v0.8.4 10K #typesetting #hyphenation #text #language
aki-resort

sort lines of text. You can use regex to specify the KEY.

v0.1.25 bin+lib #text #filter #aki-resort #numeric #month #time #version #mar #jan #oct
newdoc

Generate pre-populated module files formatted with AsciiDoc that are used in Red Hat and Fedora documentation

v2.18.4 150 bin+lib #asciidoc #redhat #documentation
collclean

Clean up collaboration commands in LaTeX files

v0.4.2 340 app #collclean #run #bob #accusam #amet
moonwave

generating documentation from comments in Lua source code

v1.3.0 bin+lib #moonwave #documentation #json #visually
lindera-tantivy

Lindera Tokenizer for Tantivy

v0.42.2 4.5K #tokenize #tantivy #lindera #tokenizer
precis-profiles

PRECIS Framework: Preparation, Enforcement, and Comparison of Internationalized Strings Representing Usernames and Passwords as defined in rfc8265; and Nicknames as defined in rfc8266

v0.1.12 12K #profile #precis #rfc-8264 #rfc-8265 #rfc-8266 #profiles
allms

One Library to rule them aLLMs

v0.17.3 470 #anthropic #gemini #assistant #mistral #openai #api-bindings
mdbook-epub

An EPUB renderer for mdbook

v0.4.48 230 bin+lib #documentation #epub #mdbook #markdown
airshipper

automatic updates for the voxel RPG Veloren

v0.16.0 290 app #airshipper #download #log-file #user #airshipper-server
near-facsimile

Find similar or identical text files in a directory

v1.0.9 130 bin+lib #compare #similarity #duplicates #similar
word-tally

Output a tally of the number of times unique words appear in source input

v0.25.0 850 bin+lib #word-count #tally #cli #word #count #words
arrow-string

String kernels for arrow arrays

v55.1.0 1.5M #arrow #parquet #array #kernel #arrow-arrays
html2md

binary to convert simple html documents into markdown

v0.2.15 8.2K bin+lib #markdown-converter #markdown #html #html-markdown-converter #list #header #quote #table #paragraph
glyph_brush_layout

Text layout for ab_glyph

v0.2.4 54K #text-layout #ab-glyph #glyph #font #layout
norad

Read and write Unified Font Object files

v0.15.0 1.7K #font #ufo #graphics #save
prema

convert markdown to html

v0.1.7 app #html #directory #markdown #themes #basic #footer #file #command
unicode-xid

Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31

v0.2.6 9.2M no-std #xid #unicode #unicode-text #text
unicode-ccc

Unicode Canonical Combining Class detection

v0.4.0 390K #unicode #detect #unicode-ccc #detection
mdcat

cat for markdown: Show markdown documents in terminals

v2.7.1 490 bin+lib #markdown #cat #less #terminal
unicode_categories

Query Unicode category membership for chars

v0.1.1 2.0M #unicode #unicode-categories #char #table
chewing

(酷音) intelligent Zhuyin input method

v0.9.1 150 #chewing #candidate #bopomofo #layout #im #cmake #sub-mode
mdbook-catppuccin

🎊 Soothing pastel theme for mdBook

v3.0.0 app #mdbook #markdown #plugin #catppuccin #pre-processor
cmark-writer

A CommonMark writer implementation in Rust for serializing AST nodes to CommonMark format

v0.7.5 2.0K #markdown #common-mark #serialization #writer #serializer
lsp-textdocument

A LSP text documents manager that map of text document

v0.4.2 12K #lsp-textdocument #textdocument #text-documents
quixote

Quizzes and tests in Markdown

v0.6.4 bin+lib #markdown #quiz #quixote #zes
srgn

A grep-like tool which understands source code syntax and allows for manipulation in addition to search

v0.13.6 bin+lib #grammar #grep #python #manipulation #localization #search #action
sile

Simon’s Improved Layout Engine

v0.15.12 bin+lib #tex #sile #engine #typesetting
dom-content-extraction

Content extraction via text density paper

v0.3.11 230 bin+lib #dom-text-density #document #html #content #paper #url
sigrs

Interactive grep (for streaming)

v0.1.4 app #streaming #sig #grep #keymap
spin-sdk

The Spin Rust SDK makes it easy to build Spin components in Rust

v3.1.1 1.8K bin+lib #spin-sdk #sdk #spin #home
unicode-joining-type

Fast lookup of the Unicode Joining Type and Joining Group properties

v1.0.0 32K no-std #joining #arabic #shaping #unicode #no-std #unicode-properties
regex-syntax

A regular expression parser

v0.8.5 26.9M no-std #regex #parser #ast
unicode-reverse

Unicode-aware in-place string reversal

v1.0.9 85K no-std #reverse #unicode #grapheme #no-std #string #grapheme-cluster
deno_features

definitions of Deno unstable features

v0.2.0 1.6K #deno #deno-features #world
codebase-to-markdown

convert codebase to markdown format

v0.1.2 430 app #format #codebase-to-markdown #markdown #script #inference
rsrpp-cli

project for research paper pdf

v1.0.12 app #rsrpp #rsrpp-cli #pdf
mdbook-quiz

Interactive quizzes for your mdBook

v0.3.12 400 app #markdown #mdbook-quiz #mdbook #quiz
mlc

The markup link checker (mlc) checks for broken links in markup files

v0.21.0 340 bin+lib #html #link-checker #markup #render-markdown #broken
molybdenum

Recursive search and replace CLI application

v0.1.10 bin+lib #search-pattern #case-sensitive #applications #folder #replacer #searcher #version
markdown-it

Rust port of popular markdown-it.js library

v0.6.1 1.4K bin+lib #common-mark #markdown #markdown-it #plugin #cmark #pulldown-cmark #ast
wchar

Procedural macros for compile time UTF-16 and UTF-32 wide strings

v0.11.1 10K #wide-string #utf-16 #wchar #string
sliceslice

A fast implementation of single-pattern substring search using SIMD acceleration

v0.4.3 44K #string-search #simd #string #single #search #text-search #text
distrs

PDF, CDF, and percent-point/quantile functions for the normal and Student’s t distributions

v0.2.2 7.1K no-std #distribution #distrs #distributions #normal #libm
file-organiser

Command line file manager to list, move or delete large numbers of files in nested folders filtered by age, file extension, file name pattern and/or size range

v0.1.8 app #directory #file #file-organiser #utility
sortuniq

Find or count unique values in an input stream

v0.3.0 app #stream #sortuniq #local #film #stage #helena #carroll #television #winifred #november
fontfor

find fonts which can show a specified character and preview them in terminal or browser

v0.4.3 app #font #character #utilities #command-line-utilities #cli #cli-utils
bbd

Binary Braille Dump

v0.3.4 app #dump #bbd #style #wrapping #stdin #output #character
xi-unicode

Unicode utilities useful for text editing, including a line breaking iterator

v0.3.0 124K #xi-unicode #unicode #utf-8
subplot

tools for specifying, documenting, and implementing automated acceptance tests for systems and software

v0.12.0 800 #subplot #metadata #yaml #verification #criteria
autumnus

Syntax highlighter powered by Tree-sitter and Neovim themes

v0.3.1 500 bin+lib #syntax-highlighting #tree-sitter #highlighter-coloring #highlighter #syntax-coloring
hypher

separates words into syllables

v0.1.5 21K no-std #syllable #hyphenation #language
tree-sitter-stack-graphs-typescript

Stack graphs definition for TypeScript & TSX using tree-sitter-typescript

v0.4.0 220 bin+lib #stack-graphs #tree-sitter #typescript #tsx
unescaper

Unescape strings with escape sequences written out as literal characters

v0.1.6 544K #escaping #string #unescaper
json2bin

A fast jsonl to RWKV binidx converter in Rust

v0.2.1 app #json2bin #length #input
mdbook-combiner

combine mdbook summaries from multiple source into one mdbook

v0.1.17 app #combiner #mdbook-combiner #mdbook
matcher_rs

A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust

v0.5.7 120 #multi #matcher #search-pattern #string-search #string #text #pattern #text-search #config
reword

some utility functions for human-readable formatting of words

v7.0.1 8.9K #reword #rogstadkjærnet #olsson
lumin

searching and displaying local files

v0.1.14 900 bin+lib #directory #filtering #file-content #image #content #pattern #regex #searcher
nanohtml2text

A zero-dependency library to convert HTML to plain text

v0.2.1 1.8K bin+lib #text #html-text #html
google-books1-cli

A complete library to interact with books (protocol v1)

v6.0.0+20240621 app #book #google #google-cli #cli #books
cwc

A word counter utility that properly handles CJK and Unicode text

v1.0.2 370 app #text #cwc #pipe #once #counter
repgrep

An interactive command line replacer for ripgrep

v0.16.1 390 app #ripgrep #find-replace #grep #utf-8 #regex
sd

An intuitive find & replace CLI

v1.0.0 8.1K app #find-replace #regex #sed
regex-literal

delimited regular expression literals

v1.3.2 #literals #regex #serialization #delimiter #undelimit #xregex
lexicmp

comparing and sorting strings lexicographically and naturally

v0.2.0 20K #emoji #transliteration #unicode #sorting #lexicographical #iterator
mdka

HTML to Markdown converter

v1.4.8 1.2K bin+lib #markdown-parser #render-markdown #html #convert
indefinite

Prefix a noun with an indefinite article - a or an - based on whether it begins with a vowel

v0.1.9 5.2K #article #noun #grammar #an #a
minimizer

Minimize files to find minimal test case

v2.0.3 800 app #minimizer #case #strategy #command #strategies
string-patterns

Makes it easier to work with common string patterns and regular expressions in Rust, adding convenient regex match and replace methods (pattern_match and pattern_replace) to the standard…

v0.3.9 #string #mode #word #pattern #pair #methods #enums
string-offsets

Converts string offsets between UTF-8 bytes, UTF-16 code units, Unicode code points, and lines

v0.2.0 4.1K #utf-16 #line #position #unicode #character
mdbook-graphviz

mdbook preprocessor to add graphviz support

v0.2.1 700 app #graphviz #mdbook-graphviz #mdbook #file
console_static_text

Logging for text that should stay in the same place in a console

v0.8.3 5.4K #console #text #console-static-text
vew

Visualize lsof output

v0.1.0 app #vew #table-row #header #output #process #size
skyspell

Fast and handy spell checker for the command line

v4.0.0 bin+lib #spell-check #line #action #dictionary #list #interface #identifier #project
mdbook-typst

An mdBook backend to output Typst markup, pdf, png, or svg

v0.1.7 app #typst #svg #mdbook #config
deeplx

package for unlimited DeepL translation

v1.4.1 bin+lib #translation #deeplx #deep-lx #config #axum #deep-l
wordcut-engine

Word segmentation/breaking library

v1.1.9 #nlp #library #engine #wordcut #path #load-dict #text #edge #cluster
fm

Non-backtracking fuzzy text matcher

v0.4.0 17K #matcher #fm #matching
swc-plugin-inferno

SWC plugin for InfernoJS

v2.5.0 290 #swc-plugin #plugin #swc #inferno #text #flags #fragment
holy-carpet

customizable blog creator

v0.1.2 420 app #render-markdown #blog #html #carpet #creator #markdown
bundle_repo

Pack a local or remote Git Repository to XML for LLM Consumption

v0.6.0 app #artificial-intelligence #tokenize #git #llm #cli
yake-rust

Yake (Yet Another Keyword Extractor) in Rust

v1.0.3 #keyword #nlp #extractor #terms #sentence #stop-words #text
qpdf

Rust bindings to QPDF C++ library

v0.3.4 750 #pdf #qpdf #object #x86-64 #linux #windows-msvc #gnu #unknown #aarch64-apple-darwin
vidyut-prakriya

A Sanskrit word generator

v0.2.0 #sanskrit #generator #prakriya #nlp #error
pad

padding strings at runtime

v0.1.6 135K #pad #run-time #alignment #stdlib
ident_case

applying case rules to Rust identifiers

v1.0.1 8.0M #field #rename-rule #case
roe

Unicode case conversion

v0.0.7 no-std #unicode #lower-case #upper-case #capitalize #no-alloc #convert
byteyarn

hyper-compact strings

v0.5.1 80K #string #binary #text
outlines-core

Structured Generation

v0.2.11 290 bin+lib #generation #json-schema #outline #regex #automation #age
termdiff

Write a diff with color codes to a string

v4.1.0 650 #terminal #text #diff
scru64

Sortable, Clock-based, Realm-specifically Unique identifier

v2.0.1 no-std #identifier #scru64 #text #bit #integer #object #variables
unicode-truncate

Unicode-aware algorithm to pad or truncate str in terms of displayed width

v2.0.0 521K no-std #truncate #unicode #pad #unicode-text #unicode-width #width #text
drova_plugins

Main plugins for drova

v3.0.2 1.5K #plugin #drova #markdown #protocols #input #converter
igrepper

The interactive grepper

v1.3.5 app #grepper #igrepper #line #editor
dmos

Djot HTML renderer with advanced features

v0.6.1 290 #syntax-highlighting #djot #dmos #highlighting #handler #syntax #anchor #emoji
timerfd

interface to the Linux kernel's timerfd API

v1.6.0 64K #timerfd #timer #api #time #disarmed #duration #use-case #set-time-flags #default
obsidian-export

associated CLI program to export an Obsidian vault to regular Markdown

v25.3.0 120 bin+lib #markdown #obsidian #export #front-matter #exporter #embed
codebank

powerful code documentation generator that creates structured markdown documentation from your codebase. Supports multiple languages including Rust, Python, TypeScript, C, and Go with intelligent parsing and formatting…

v0.4.5 600 bin+lib #parser-generator #documentation #markdown #code #parser #generator
scx_lavd

Latency-criticality Aware Virtual Deadline (LAVD) scheduler based on sched_ext, which is a Linux kernel feature which enables implementing kernel thread schedulers in BPF and dynamically loading them…

v1.0.12 160 app #sched #scx #ext #com #tree #sched-ext #case
levenshtein_automata

Creates Levenshtein Automata in an efficient manner

v0.2.1 384K #automata #levenshtein-automata #levenshtein #fuzzy
llguidance

Super-fast Structured Outputs

v0.7.23 15K #output #llguidance #grammar #outputs
htop

HTML to PDF converter

v0.2.0 app #headless-chrome #pdf #converter #html
ewts-cli

Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (cli)

v0.1.3 app #converter #ewts #tibetan #symbols #localization #cli
bashdoc

generating documentation/help menu for user defined bash functions

v0.6.0 1.0K app #documentation #bash #bashdoc #docs #output #color #zshrc #delimiter #below #void
htmd-cli

The command line tool for htmd

v0.4.1 270 app #markdown-converter #html #html-markdown-converter #markdown #converter
ohos-drawing-sys

Bindings to the native_drawing API of OpenHarmony OS

v0.2.2 5.6K #harmony-os #arguments #open-harmony #drawing #ffi
probly-search

A lightweight full-text search engine with a fully customizable scoring function

v2.0.1 550 #search-query #search-index #bm25 #search #query
fasttext

binding

v0.7.8 23K #fasttext #classify #bindings #text #api-bindings
open-lark

Lark/Feishu Open API SDK(WIP)

v0.3.6 #sdk #lark #feishu
COXave

Instruments for codings

v1.0.8 1.4K #utf-16 #utf-32 #utf-8 #ascii #encoding
whitespace-sifter

Sift duplicate whitespaces away!

v2.3.5 150 #white-space #duplicates #sifter #string #text
sf-api

API to send commands to the Shakes & Fidget servers and parse their responses into characters

v0.2.1 #character #calendar #authentication #state #now
portrait

Fills an impl with the associated items required by the trait

v0.3.1 120 #portrait #default #text #delegation #import #statement
mdbook-mermaid

mdbook preprocessor to add mermaid support

v0.15.0 8.7K bin+lib #mdbook-mermaid #mermaid #html #js #editor #td #b-d #c-d #a-b #mdbook-plugins
unicode_titlecase

add Unicode titlecase and Turkish and Azeri locale upper/lowercase utilities to chars and strings

v2.4.0 210 #title-case #string #unicode #casing #locale #to-titlecase
asmfmt

A formatter designed for programs in assembly language with AT&T syntax

v1.0.2 app #assembly #gpl #fmt #file #asm
cicero-sophia

High-performance NLU (natural language understanding) engine built in Rust for speed, accuracy, and privacy

v0.6.3 550 #nlp #chat #nlu #store #word
zhconv

Traditional/Simplified and regional Chinese variants converter based on MediaWiki & OpenCC rulesets and powered by AC automata 轉換简体、繁體及兩岸、新馬中文地區詞，基於MediaWiki和OpenCC之字詞轉…

v0.3.1 420 #chinese #mediawiki #localization #conversion #open-cc #variant #ruleset
nlpo3

Thai natural language processing library, with Python and Node bindings

v1.4.0 1.2K #tokenize #nlp #thai #word-segmentation
codepage

Mapping between Windows code page numbers and encoding_rs character encodings

v0.1.2 168K #winapi #codepage #unicode #windows
mcat

a powerfull extended cat command, to cat all the things you couldn't before

v0.2.8 2.3K app #cat #inline #terminal #file-converter #markitdown
rzozowski

A regex crate using Brzozowski derivatives

v0.2.0 200 #regex #derivative #brzozowski
diesel_full_text_search

Adds support for PostgreSQL full text search to Diesel

v2.2.0 10K #full-text-search #text #diesel
newline-converter

Newline byte converter library

v0.3.0 331K #line-break #newlines #crlf #convert #unix2dos #newline #methods
instant-segment

Fast English word segmentation

v0.11.1 #segmentation #segment #instant-segment
affinidi-messaging-text-client

Affinidi Messaging SDK

v0.10.7 440 app #affinidi #ssi #client
unidown

Convert Markdown to Unicode

v0.8.3 bin+lib #unicode #unidown #string #table #heading
speedreader

A command-line speed reading tool

v0.1.2 310 app #text #speed-reading #cli #config
mdmodels

generate models, code and schemas from markdown files

v0.2.3 180 bin+lib #define #model #object #python #markdown #golang #typescript #template
lindera-ko-dic-builder

A Korean morphological dictionary builder for ko-dic

v0.32.3 12K #builder #morphological #ko-dic #dictionary #korean
stfu8

Sorta Text Format in UTF-8

v0.2.7 66K #binary #repr #unicode #unicode-text #invalid #text
grok

popular java & ruby grok library which allows easy text and log file processing with composable patterns

v2.0.0 22K #grok #alias #compilation #processing
latkerlo-jvotci

Tools for creating and decomposing Lojban lujvo

v2.3.1 230 bin+lib #lujvo #lojban #latkerlo-jvotci #31s #6s #1m27s
wrapr

wrap your code for ai

v0.1.8 110 app #tui #clipboard #file-utility #file #artificial-intelligence #config #terminal
symbolic-cfi

process call frame information

v12.15.5 13K #symbolic #information #symbolic-cfi #background
aki-xcat

concatenate files that are plain, gzip, xz and zstd

v0.1.36 bin+lib #text #filter #aki-xcat #abcdefg #hijklmn #gz #lz4 #txt
quagga

CLI tool that combines multiple text files into a single prompt suitable for Large Language Models

v0.1.4 150 bin+lib #llm #text #quagga #cli #txt #directory #node-modules #size #quagga-ignore #part
glimpse

A blazingly fast tool for peeking at codebases. Perfect for loading your codebase into an LLM's context.

v0.7.5 460 app #directory #depth #detect #processing #back-end #tokenize #pdf #repository #markdown #exit
wit_owo

interacting with the Wit.ai API

v1.0.2 230 #nlp #nlu #wit-ai #api #api-bindings
mdlib

A beautiful markdown note-taking application

v0.1.1 100 app #notes #web-apps #markdown #knowledge-base #wiki
lsprotocol

Rust types for Language Server Protocol generated from LSP specification

v1.0.0-alpha.2 #lsp #text-document #proposed
line-numbers

Find line numbers in strings by byte offsets, quickly

v0.4.0 1.4K #line-numbers #numbers #line #quickly
mupdf

Safe Rust wrapper to MuPDF

v0.5.0 2.1K #pdf #mupdf #mu-pdf #progress
create_broken_files

Create broken files from other ones

v3.1.0 500 app #broken #character #ones #crash #data #fuzzer
mdbook-theme

A preprocessor and a backend to config theme for mdbook, especially creating a pagetoc on the right and setting full color themes from the offical ace editor

v0.1.6 bin+lib #themes #rust-book #markdown #ace #book #pre-processor
datafusion-functions

Function packages for the DataFusion query engine

v47.0.0 450K #data-fusion #logical #plan #expression #expressions
tmenu

TUI fuzzy finder

v0.1.0 140 app #ratatui #fuzzy-finder #tmenu #tui
character_converter

Turn Traditional Chinese script ot Simplified Chinese script and vice-versa and tokenize

v2.1.5 1.0K #chinese #hanzi #traditional #simplified #localization #convert
mdbook-alerts

mdBook preprocessor to add GitHub Flavored Markdown's Alerts to your book

v0.7.0 3.0K bin+lib #mdbook #mdbook-preprocessor #markdown #github #mdbook-pre-processor
roman-numerals-rs

Manipulate well-formed Roman numerals

v3.1.0 2.0K no-std #roman-numeral #roman-numerals #numeral #roman
patto

🐙 Yet another plain text format for quick note taking and task management

v0.1.8 bin+lib #management #patto #parser #markdown #properties #renderer #note #define #plugin #diagnostics
tabprinter

creating and printing formatted tables in the terminal. It supports various table styles and offers both color and non-color output options.

v0.2.1 #formatting #alignment #table #amiga #cell #color #style #stdout #character #term-color
autotex

Continuously compile TeX and LaTeX

v1.4.1 390 app #latex #autotex #tex-engine #engine #pdf #manual
pyo3-filelike

Rust access to Python file-like objects

v0.4.2 4.0K #object #pyo3-filelike #python #objects
vmks-exam-generator

CLI program for pseudo-randomly generating different variants of an embedded programming exam

v1.3.1 bin+lib #generator #exam #vmks #questions #bank #group #segment
purlu

A full-text search engine

v2.0.0 250 #english #purlu #text #index #query #object
junit-report

Create JUnit compatible XML reports

v0.8.3 115K #junit #report #xunit #test-suite #xml
seshat-unicode

A Unicode Library for Rust. Unicode 16.0.0 ready. XID_Start and XID_Continue are also available.

v0.3.1 1.5K #unicode #unicode-version #normalization #properties #break #version #standard #𓋇𓏏𓁐
text2num

Parse and convert numbers written in English, Dutch, Spanish, Portuguese, German, Italian or French into their digit representation

v2.6.1 #nlp #words-to-numbers #text2digits
unic-char-property

UNIC — Unicode Character Tools — Character Property taxonomy, contracts and build macros

v0.9.0 991K #character-property #unic #unicode #unicode-text #macro #text
clipcount

Counting words from the clipboard content

v1.0.7 270 app #word-count #clipboard #text #count #words
codebook_config

Configuration handling for the Codebook spell checker

v0.1.0 160 bin+lib #spell-check #config #settings
mdbook-environment

A preprocessor for MdBook for working with environment variables

v0.0.4 app #mdbook #environment #pre-processor #environments
chonkier

🦛 Chonkie, now in Rust 🦀: No-nonsense, ultra-fast, ultra-light chunking library

v0.0.2 160 #chonkier #chunks #sentence #recursive-chunker #recursive-rules #character-tokenizer #ever #crab-sparkles
plsfix

Text cleaner upper

v0.1.8 #upper #plsfix #print #fix-text #â-œ
mdbook-pagebreaks

A mdbook preprocessor to insert page breaks when rendering to HTML

v0.3.1 120 app #html #mdbook #pagebreaks
cronus_generator

The generators for cronus API spec

v0.4.4 230 #typescript #cronus #string #axum #openapi #async #async-trait #eq #documentation
rapidfuzz

rapid fuzzy string matching library

v0.5.0 12K #levenshtein #string-similarity #hamming #jaro
capitalize

Change first character to upper case and the rest to lower case, and other common alternatives

v0.3.4 1.2K #capitalize #string #title #case #change #alternative
weasel-gen

Random ascii generation animation until target string is met

v1.0.0 app #weasel-gen #met #gen #gif #0-9 #world
herring-automata

Automata construction for Herring

v0.1.3 170 #dfa-automata #nfa-automata #herring #dfa #automata
quickmd

Quickly preview a markdown file

v0.7.1 bin+lib #markdown #gtk #file
mut-str

A toolkit for working with mutable string slices (&mut str)

v1.1.0-alpha.2 no-std #string #slice #mutability #str-ext
jetstream_9p

Jetstream is a RPC framework for Rust, based on the 9P protocol and QUIC

v8.1.5 #jetstream #jetstream-9p #tags #streaming #mtls #echo #documentation #0-rtt
esri_ascii_grid

reading ESRI Ascii Grid .asc files

v0.4.6 #ascii #grid #raster #esri #asc
boreal-cli

CLI utility to run boreal, a YARA rules engine

v1.0.0 130 app #yara #scan #string-matching #engine
vader-sentimental

A faster Rust version from the original Python VaderSentiment analysis tool

v0.1.2 bin+lib #nlp #sentiment-analysis #vader-sentimental #text-analysis #analyse #lol #content #vader-sentiment-analysis #words-phrases #sux
mandown

Markdown to groff (man page) converter

v1.1.0 4.8K bin+lib #troff #manpage #markdown #roff
whitespacesv

parser/writer for the Whitespace-Separated Value format, as defined by Stenway. See https://dev.stenway.com/WSV/. WSV offers an unambiguous alternative to CSV.

v1.0.2 #separated #value #white-space #wsv #reliable-txt #array
galm

pattern matching library

v0.3.1 100 #matching #sorting #galm #cli #command #start
kelp

A convert tool for Japanese

v0.6.0 500 bin+lib #cli #convert #hiragana #half-width #full-width #katakana
peppi

Parser for Slippi replay files

v2.1.0 200 #peppi #position #state #format
unicode-security

Detect possible security problems with Unicode usage according to Unicode Technical Standard #39 rules

v0.1.2 209K #security #unicode #unicode-text #text
text-to-ascii-art

program to convert text to ASCII art

v0.1.10 300 bin+lib #ascii-art #art #text-to-ascii-art #text #string #rectangle #assets #com
rustdoc-stripper

manipulate rustdoc comments

v0.1.19 550 bin+lib #documentation #rustdoc #tool #strip #docs
ipset_lookup

ipset is a command-line tool that takes networks or IPs and searches through a lot of different threat feeds quickly. It can also download the feed data necessary to perform the queries…

v0.4.8 bin+lib #blocklists #threat-feed #cti #threat-intel
unicode-properties

Query character Unicode properties according to UAX #44 and UTR #51

v0.1.3 2.9M no-std #unicode-properties #unicode #unicode-text #no-alloc #text #emoji #unicode-emoji #unicode-general-category #general-category
unbom

Remove UTF-8 BOM from files

v0.2.2 app #unbom #utf-8 #txt
jx

An interactive JSON explorer for the command line

v0.5.0 app #interactive-cli #interactive #json #cli #explorer
sapling-streampager

streampager is a pager for command output or large files

v0.11.0 13K #pager #less #more #sapling
makepad-fonts-emoji

Makepad emoji fonts

v1.0.0 550 #font #makepad #makepad-fonts-emoji #build #fractals #makepad-example-ironfish
rust-persian-tools

Official Rust implementation of Persian Tools

v1.1.4 #persian #iran #tool #farsi #localization #text-processing
inkjet

A batteries-included syntax highlighting library for Rust, based on tree-sitter

v0.11.1 310 #syntax-highlighting #highlight #tree-sitter
asimov-module-cli

ASIMOV Module Command-Line Interface (CLI)

v25.0.0-dev.3 400 bin+lib #asimov #artificial-intelligence #module #cli
grok2

popular java & ruby grok library which allows easy text and log file processing with composable patterns. A fork of the grok crate.

v2.0.1 #grok2 #alias #grok #processing
jetscii

A tiny library to efficiently search strings and byte slices for sets of ASCII characters or bytes

v0.5.3 62K #ascii #string-search #ascii-text #ascii-string #search #byte #simd #string
svgbob_cli

Transform your ascii diagrams into happy little SVG

v0.7.6 1.2K app #svg #ascii #bob #convert
zet

zet finds the union, intersection, set difference, etc of files considered as sets of lines

v2.0.1 bin+lib #set-operations #uniq #zet #union #set #intersection #command
patchkit

parsing and manipulating patch files

v0.2.1 3.2K #patch #hunk-line #parse-patch #content-patch #vec
uast

Unicode Aware Saṃskṛta Transliteration in Rust 🦀

v6.0.1 bin+lib #uast #iast #devanāgarī #transliteration #देवनाग #ગુજરાત
twars-url2md

A powerful CLI tool that fetches web pages and converts them to clean Markdown format using Monolith for content extraction and htmd for conversion

v1.4.2 bin+lib #render-markdown #converter #web #html #html-converter #markdown #convert
case_insensitive_hashmap

A HashMap that uses case-insensitive strings as keys

v1.0.1 1.4K #case-insensitive #hash-map #case-folding #unicase
fuzzy-aho-corasick

Aho–Corasick automaton with fuzzy matching

v0.2.2 350 #aho-corasick #case-insensitive #fuzzy #matching #replacer
rust_string_utils

String utilities for rust based on org.apache.commons.lang3

v0.1.20 #ignore-case #byte-array #delimiter #lang3 #status #utilities
vlazba

Lojban words generator and analyzer

v0.7.14 bin+lib #nlp #lojban #conlang #jvozba #analyzer #generator
tesseract-rs

Rust bindings for Tesseract OCR with optional built-in compilation

v0.1.19 sys #tesseract #ocr #computer-vision #text-recognition #compilation
harper-ls

The language checker for developers

v0.38.0 1.8K app #harper-ls #harper #english #development-tools
whatwg-infra

Tiny Rust-based implementation of the WHATWG Infra Standard

v1.1.0 no-std #specification #standard #whatwg #infra #string
nu_plugin_emoji

a nushell plugin called emoji

v0.13.0 190 app #emoji #plugin #nu-plugin-emoji #unicode #nushell-plugin #nu-shell #unicode-version #code-point #utf-8 #shortcodes
asciimath-unicode

Convert asciimath to unicode

v0.1.4 300 bin+lib #unicode #asciimath-unicode #asciimath #binary
bogrep

Full-text search for bookmarks from multiple browsers

v0.10.1 1.3K bin+lib #full-text-search #bookmarks #grep #source #browser #cli
fuzzt

Implementations of string similarity metrics. Includes Hamming, Levenshtein, OSA, Damerau-Levenshtein, Jaro, Jaro-Winkler, and Sørensen-Dice.

v0.3.1 9.6K #levenshtein #string-similarity #hamming #jaro #string #similarity
askalono-cli

detect the contents of license files

v0.5.0 app #askalono #askalono-cli #text #line #notice
rascii_art

Advanced ASCII Art Generator

v0.4.5 390 bin+lib #ascii #generator #image #art #filename #charset #ascii-art
newline_normalizer

Zero-copy newline normalization to \n or \r\n with SIMD acceleration

v0.1.6 420 #newlines #unix-windows #unix #windows #normalize #crlf #text
paswitch-rs

List and swap to pulse sinks by name

v0.3.15 140 app #name #paswitch-rs #paswitch #interactive #git #list
tkrar

Count frequency of words in a file or a directory

v0.3.0 app #cli #directory #word-count #word #format #character #stdin #stop-words #pattern #count
ticker-sniffer

extracting multiple stock ticker symbols from a text document

v0.1.0-alpha9 bin+lib #ticker #extract #sniffer #progress
presenterm

A terminal slideshow presentation tool

v0.14.0 750 app #presentation #slide #tool #terminal #markdown #action #slideshow
xpath-cli

Evaluate XPath selectors on XML or HTML documents

v1.2.0 app #html #xml #xpath-cli #document #documents #bat
oxford_join

Join string slices with Oxford Commas!

v0.5.0 no-std #join #grammar #string #list #comma
evcxr

An Evaluation Context for Rust

v0.20.0 30K bin+lib #evcxr #notes #compilation
precis-tools

Tools and parsers to generate PRECIS tables from the Unicode Character Database (UCD)

v0.1.9 12K #compare #precis #internationalization #enforcement #preparation #comparison
async-utf8-decoder

Convert AsyncRead to incremental UTF8 string stream

v1.0.0 190 #async-stream #utf-8 #utf8-decoder #mpsc #await
say-rust

command-line tool which is an alternative to echo

v1.0.1 app #say #say-rust #text
furigana

Map furigana to a word given its reading

v0.1.11 160 #japanese #furigana #物ものの
substudy

Language-learning tools for working with parallel, bilingual subtitles and media files

v0.5.2 bin+lib #text #substudy #srt #progress
llm-tui

A Terminal User Interface (TUI) for interacting with Language Learning Models (LLM) using llm-cli

v0.1.1 170 app #artificial-intelligence #tui #chat-bot #llm
anycase

a case conversion library for Rust

v0.1.0 170 no-std #pascal-case #snake-case #camel-case #unicode
emojic

Emoji constants

v0.4.1 1.0K no-std #gender #pair #tone #emoji
rustic_print

A versatile Rust library for enhancing console output. It offers a range of features to create a more engaging and informative command-line interface.

v0.2.1 #text-styling #rust-library #formatted-output #console-printing #cli-enhancement
the_rock

A command line King James bible viewer

v0.9.2 app #rock #viewer #bookmarks #screenshot
unidoc

Unite all Markdown

v0.7.8 app #markdown #unidoc #text #block #context #table #emoji #hyperlink #heading #hr
textalyzer

Analyze key metrics like number of words, readability, and complexity of any kind of text

v0.3.0 bin+lib #text #nlp #analysis
thoth-note

note-taking app written in Rust

v0.1.1 app #markdown #note-taking #tui #rust #note
mdbook-cmdrun

mdbook preprocessor to run arbitrary commands

v0.7.1 550 bin+lib #mdbook #mdbook-preprocessor #mdbook-pre-processor #runcmd #cmdrun #file #command
what-rs

Identify what something is! A pyWhat reimplementation in Rust

v0.4.1 app #regex #nlp #identifier
mdbook-d2

D2 diagram generator plugin for MdBook

v0.3.4 370 bin+lib #mdbook #common-mark #markdown #d2
codetypo-dict

Source Code Spelling Correction

v0.12.7 #spelling #codetypo #codetypo-dict #correction #spell-check #development #development-tools #monorepo #pr
mdbook-tailor

mdbook preprocessor for image-tailor

v0.8.2 bin+lib #image #tailor #mdbook
rake

Rapid Automatic Keyword Extraction (RAKE) algorithm

v0.3.6 180 #algorithm #keyword #rake
gemini-map

A command-line tool to run files in parallel through Google Gemini

v0.1.2 app #gemini-map #gemini #pdf #split-pdf #html
mktoc

Generate Table of Contents from Markdown files

v4.0.0 bin+lib #mktoc #markdown #toc #min-depth #generator #command-line-tool
kaff_sso

Small-buffer-optimized generic buffer and UTF-8 string type

v0.2.2 600 #sso #kaff-sso #napi
diffside

A CLI tool for side-by-side file diffs with themed highlighting

v0.1.0 160 app #diff #diffside #detect #highlighting #numbers #alignment #txt #tooling #terminal #side-by-side
figlet-comment

quickly create banner to use as comments

v0.4.0 app #text #figlet-comment #comments #clipboard #stdout
madato

command line tool for reading and writing tabular data (XLS, ODS, CSV, YAML), and Markdown

v0.7.0 2.4K bin+lib #markdown #yaml #csv #excel
data-streams

Extension traits for reading and writing data with streams

v2.0.0-pre.3 no-std #io #stream #data-stream
mdbook-open-on-gh

mdbook preprocessor to add a open-on-github link on every page

v2.4.3 170 bin+lib #page #mdbook-open-on-gh #mdbook
nu-utils

Nushell utility functions

v0.104.1 20K bin+lib #nu-utils #array #default #nu-shell
timug

It has been created for personal blog creation purpose. Timus has its limits, but it fulfills the purposes for which it was created.

v0.1.3 110 app #markdown #blog #page-generator #static-page #generator #social-media #statistics #code-block #project
mdbook-private

An mdbook preprocessor that controls visibility of private chapters and sections within them

v0.2.3 bin+lib #private #mdbook-private #section
gh-emoji

Convert :emoji: to Unicode using GitHub’s emoji names

v1.0.8 6.3K #emoji #unicode #markdown #github #convert
unicode-bidi-mirroring

Unicode Bidi Mirroring property detection

v0.4.0 248K #unicode #detect #mirroring #detection
scanix

search a text or pattern in files. A fast and lightwight text tool.

v0.5.1 app #scanix #config #production
rusty-tesseract

wrapper for Google Tesseract

v1.1.10 1.5K bin+lib #tesseract #rusty-tesseract #parameters #image
secular

No Diacr!

v1.0.1 2.5K #unicode-normalization #diacritics #secular #normalization #diacr #unicode
indent

Functions for indenting multiline strings

v0.1.1 120K #multi-line #indentation #string
kas-text

Text layout and font management

v0.7.0 180 #shaping #bidi #glyph #navigation #harfbuzz #management #sub-range #text
pukram-formatting

A type to represent the formatting of the pukram markup language

v0.2.1 #formatting #text #markup #pukram
mdbook-angular

mdbook renderer to run angular code samples

v0.4.0 bin+lib #angular #mdbook #sample #ts #action #block #flags #config #tags #input
topiary-cli

CLI app for Topiary, the universal code formatter

v0.6.0 190 app #code-formatter #tree-sitter #text #cli
libharu_ng

Easily generate PDFs from your Rust app

v1.0.10 120 sys #libharu-ng #api-bindings #libharu #pdf #haru
langram

Natural language detection library

v0.5.1 550 #nlp #detect #detect-language #recognise #detector #language #language-detect
vesti

A preprocessor that compiles into LaTeX

v0.15.0 app #latex #vesti #tectonic #transpiler #end #geometry #begin #figure #document #section
sk-skimmer

Fuzzy Finder in rust!

v0.13.6 250 bin+lib #menu #fuzzy #skim #utilities #mode #fzf #sk
yara-x-fmt

A code-formatting library for YARA rules

v0.15.0 120 #yara #fmt #yara-x-fmt #yara-x
yeslogic-ucd-generate

A program for generating packed representations of the Unicode character database that can be efficiently searched with support for additional tables

v0.7.0 app #generate #unicode #fst #character #table
charx

A replacement for char::is_ascii*

v1.1.0 #charx #is-ascii #build
pulldown-cmark-toc

Generate a table of contents from a Markdown document

v0.7.0 #markdown #pulldown-cmark #github #toc #common-mark
iotext_rs

IoText data protocol

v0.5.0 bin+lib #data #iot #protocols #com #crc #bieli #timestamp
ib-pinyin

一个高性能拼音匹配库

v0.2.5 #pinyin #cjk #ib-pinyin #unicode #匹配库 #个高性能拼 #ib-pinyin-lib #py
html-query

jq, but for HTML

v1.2.2 650 app #html-query #html #parser #hq
lipsum

lorem ipsum text generation library. It generates pseudo-random Latin text. Use this if you need filler or dummy text for your application. The text is generated using a simple Markov chain…

v0.9.1 45K #markov-chain #text #typography #random
krafna

terminal-based alternative to Obsidian's Dataview plugin, allowing you to query your Markdown files using standard SQL syntax

v0.5.6 bin+lib #markdown #sql #obsidian #json #query #execute #field
in_definite

Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.

v1.0.0 5.4K #nlp #english #grammar #text
pathmut

Command line utility for manipulating path strings

v0.7.0 nightly bin+lib #file-path #file-extension #string #component #file #path #prefix #stem
see-cat

A cute cat(1)

v0.8.1 app #cat #viewer #syntax-highlighting #terminal #markdown
reggy

friendly, resumable regular expressions for text analytics

v0.0.6 #analytics #reggy #buf-reader #nlp #regex #file
rust-ai

A collection of 3rd-party AI APIs for Rust

v0.1.22 bin+lib #openai #voice #openai-api #azure #com #ai-api #sk #xxxxxxxxxxxxxxxx #westus #xxxxxxxxxx
csvpeek-rs

A CLI tool to quickly peek into, list, and filter CSV data

v0.1.0 app #csv #filter #peek #data #cli
hebrew_unicode_script

A low-level library designed to ascertain whether a character belongs to the Hebrew Unicode script. It supports checks for individual characters as well as for membership within collections

v2.0.0 no-std #apf #hebrew #unicode-text #unicode-characters #utf-8 #no-std #collection
taos-query

Driver for TDengine - a timeseries database and analysis platform

v0.12.4 170 #query #platform #client
epcmanager

EPC text tool for RFID

v0.1.0 app #rfid #ascii #epc
fontconfig

Safe, higher-level wrapper around the Fontconfig library

v0.10.0 1.9K #fontconfig #wrapper #font #search
stam

powerful library for dealing with stand-off annotations on text. This is the Rust library.

v0.16.5 600 #annotations #nlp #linguistics #standoff #text-processing #annotation
doxygen-bindgen

Converts Doxygen comments into Rustdoc markdown

v0.1.3 2.6K #markdown #doxygen-bindgen #bindgen #build-dependencies #arguments #debugging #derive
string-auto-indent

Normalizes multi-line string indentation while preserving platform-specific line endings

v0.1.2 #auto-indent #auto #string-auto-indent
mdbook-pikchr

A mdbook preprocessor to render pikchr code blocks as images in your book

v0.1.9 app #mdbook #pic #pikchr #md #markdown
forbidden-bands

8-bit string handling library

v0.2.3 #c64 #ascii #8-bit #unicode #string #ascii-text
kathoey

text feminization using open corpus linguistics data

v1.1.5 #russian #nlp #kathoey #data #org #performance #up #regular #format #feminization
cskk

C ABIから使う事を目的とした SKK(Simple Kana Kanji henkan)方式のかな漢字変換ライブラリ

v3.1.4 1.0K #henkan #cskk #dictionary #version #html #skk #方式のかな #input-methods #lib-cskk
harfbuzz_rs

A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust

v2.0.1 600 #harfbuzz #shaping #textlayout #font #api-bindings
ponsic-winsafe

The dependency of the ponsic crate

v1.3.0 100 #winapi #ponsic-winsafe #ponsic #风格封装 #包含了 #编程的 #工具函数和宏
mdbook-aquascope

Interactive Aquascope editor for your mdBook

v0.3.5 bin+lib #aquascope #mdbook #mdbook-aquascope #run-time #test-tube #toml #interpreter #test-tube-warning
repvar

A tiny CLI tool that replaces variables of the style ${KEY} in text with their respective value. It can also be used as a rust library

v0.14.1 bin+lib #command-line-tool #variables #replace #cli
ascii_help

help you quickly convert ASCII codes

v1.2.1 230 app #ascii #codes #help #character
mdbook-callouts

mdBook preprocessor to add Obsidian Flavored Markdown's Callouts to your book

v0.2.2 110 bin+lib #mdbook-preprocessor #mdbook #mdbook-pre-processor #markdown #obsidian #callouts
wit_ai_rs

An unofficial Rust crate for interacting with the wit.ai API

v0.2.1 #wit #nlp #witai #api #traits #entities #audio #utterances
djotters

Djot (Markdown) parser that runs at hyper speeds!

v0.1.17 #djotters #html #list #speeds
tree-sitter-stack-graphs-javascript

Stack graphs definition for JavaScript using tree-sitter-javascript

v0.3.0 220 bin+lib #stack-graphs #tree-sitter #javascript
mds

A skim-based *.md explore and surf note-taking tool

v0.19.2 app #note-taking #markdown #skim #pkm
inlet_manifold

A general purpose highlighting library

v0.2.0 #regex #highlighting #tailspin #default
rustyink

Blazing fast static site generator

v0.2.1 app #generator #site #static-site-generator #themes #sitemap
html-compare

compare html files

v0.1.4 #mrml #compare #html-compare #mjml #extension
mnm

Mnemonic sentences for BitTorrent info-hashes

v1.0.3 260 app #info-hashes #mnm #mnemonic #word #rationale #glick #complaisantly #definissent #tuilleries #pilotin
basalt-tui

Basalt TUI application for Obsidian notes

v0.3.7 850 bin+lib #tui #basalt #markdown
our-string

Customizable shared strings with inlining

v0.1.4 #string #our-string #inlining
sugarloaf

Rio rendering engine, designed to be multiplatform. It is based on WebGPU, Rust library for Desktops and WebAssembly for Web (JavaScript). This project is created and maintained for…

v0.2.16 700 #sugarloaf #language #font #testing #rio-terminal
okkhor

English to Bangla phonetic conversion following the 'Avro' rules

v0.7.0 #rules #okkhor
mdbook-tocjs

A mdbook preprocessor which adds extra js and css file for ToC hydration

v0.1.4 bin+lib #mdbook #mdbook-tocjs #tocjs #save-dir #theme-dir #wing
catalog-of-markdown

Generate the catalog of markdown file

v0.1.6 350 bin+lib #catalog-of-markdown #title #markdown #sub-sub-title
flowquad

that helps you build UI stuff with Macroquad

v1.1.2 #text-input #toggle #button #label #container #macroquad
vi

An input method library for vietnamese IME

v0.7.0 #vi #ime #vietnamese #vni
ised

An interactive tool for find-and-replace across many files

v0.3.1 500 bin+lib #ised #config #command-line-tool
creature_feature

Composable n-gram combinators that are ergonomic and bare-metal fast

v0.1.7 bin+lib #book #bag #nlp #hash #hashed-a #tokenize #ization #featur #derive #ml
make87_messages

Message Types for Rust SDK for make87 platform

v0.2.6 1.1K #make87-messages #message #make87
puppet-fmt

Automatic code formatter for puppet manifests

v0.1.2 app #manifest #puppet-fmt #string #alignment #white-space #manifests #output
armnod

random string generator

v0.10.0 bin+lib #armnod #random #api
ankinase

A parser which generates Anki cards from CommonMark

v0.1.2 450 app #anki #ankinase #tabs
dprint-plugin-markdown

Markdown formatter for dprint

v0.18.0 4.3K #plugin #markdown #dprint-plugin
shell2batch

Coverts simple basic shell scripts to windows batch scripts

v0.4.5 37K #shell #scripting #batch #convert
case

A set of letter case string helpers

v1.0.0 60K #ascii-text #ascii #ascii-string #string #camel #snake #alphabet #helper
jedi

Juggernaut Electronic Data Interchange package. This library provides a data exchange layer extended through the holy crate.

v0.1.14 140 #jedi #monorepo
string_more

Extension traits for String and &str types

v0.3.0 130 #string #in-place #edit-distance #extension
regex-charclass

Manipulate and convert regex character classes

v1.0.3 #regex #complement #difference #intersection #union
dcsv

Dyanmic csv reader,writer,editor

v0.3.4-beta.2 #csv #cli #editor #value #text-processing
wikibase

access Wikibase

v0.7.3 170 #wikidata #mediawiki #php #instance
mdbook-chess

An mdbook preprocessing plugin to generate chess boards

v0.2.2 app #chess #mdbook #chess-board #markdown
rustkorean

processing Korean characters. It provides functionalities to check if a character is Korean, classify Korean characters, verify if a character is a leading consonant (choseong), a medial vowel (jungseong)…

v1.1.2 #korean #rustkorean #compose-korean #character #rust-korean
addbib

An app to add linked bibliographies to markdown files

v0.1.0 app #markdown #bibliography #citation #automation #documentation
mdbook-linkcheck2

A backend for mdbook which will check your links for you

v0.9.1 bin+lib #mdbook #http-header #link #link-check2
tbll

tbll outputs data in tabular format

v1.1.0 app #cli #table #format
strloin

copy on write slices of a string

v0.3.0 #copy-on-write #slice #strloin #string #cow
aneubeck-daachorse

Daachorse: Double-Array Aho-Corasick

v1.1.1 4.1K no-std #aho-corasick #double-array #search #multi #text-search #text
easy_reader

easily navigating forward, backward or randomly through the lines of huge files

v0.5.2 5.4K #reader #line #reverse #backward #random
rustclock

a stopwatch or timer cli made in rust

v0.2.2 app #minutes #rustclock #clock #minuttes
strs_tools

Tools to manipulate strings

v0.19.0 390 no-std #general-purpose #wtools #strs-tools
advent-ocr

Converts ASCII-art representations of letters generated by Advent of Code puzzles into a String containing those letters

v0.1.5 #advent-of-code #ascii #ocr
gazetta-render-ext

A static site generator framework. Extra render code.

v0.4.0 110 #static-site #blog #gazetta #website
lancelot

binary analysis framework for x32/x64 PE files

v0.9.7 #graph #flow #lancelot #reverse-engineering #malware-analysis
nucleo-matcher

plug and play high performance fuzzy matcher

v0.3.1 30K #nucleo #matcher #fuzzy-matching
notion2html

Convert Notion pages to HTML

v1.0.1 app #notion #html #markdown #render-markdown
arf-strings

Encoding and decoding for ARF strings

v0.7.3 1.6K #string #arf-strings #arf #nul-escaped-portion
mdbook-tabs

mdBook plugin for rendering content in tabs

v0.2.3 1.8K bin+lib #mdbook-tabs #tabs #mdbook
unicount

Alphabetic counter supporting unicode

v0.1.4 170 app #unicode #unicount #separator #english-lower #ct #cv #ac
malachi

A domain specific pattern matching language made for defining bot commands

v0.9.8 #discord-bot #regex #bot #dsl #discord #pattern-matching
hlight

dedicated to delivering exceptional syntax highlighting capabilities

v0.0.11 210 #syntax-highlighting #syntax-set #hlight #theme-set #highlighting #syntax #file #start
replaxe

A command-line tool to replace text in files with easy patterns

v0.1.1 app #text #command-line #replace #pattern #text-replacement
polars-compute

Private compute kernels for the Polars DataFrame library

v0.48.1 93K #polars #compute #polars-compute #arrow
enma

serving anime and manga information 📦

v0.9.2 #web-scraping #manga #anime #rust #otaku
indent_write

Write adapters to add line indentation

v2.2.0 328K no-std #indent-write #write #indent #indentation
mdbook-toc

mdbook preprocessor to add Table of Contents

v0.14.2 3.5K bin+lib #toc #mdbook-toc #content #marker #level #contents
pandoc

API that wraps calls to the pandoc 2.x executable

v0.8.11 1.6K #markdown #latex #pandoc #executable
utf8_iter

Iterator by char over potentially-invalid UTF-8 in &[u8]

v1.0.4 9.6M #utf-8 #iterator #unicode
picodiff

Tiny GUI app to compare text easily

v0.9.4 app #diff #text #productivity #compare
to_markdown_table

An easy way to format any data structure into a Markdown table

v0.1.5 9.5K #table-row #markdown-tables #markdown #u32
regexnight

Command-line tool to print syntax-highlighted versions of regular expressions and spot errors

v0.2.0 bin+lib #error #regex #regexnight #right #light
cesu8

Convert to and from CESU-8 encoding (similar to UTF-8)

v1.1.0 2.4M #utf-8 #cesu8 #valid
encoding-next

Character encoding support for Rust

v0.3.0 1.2K #unicode #charset #ascii #iso-8859-1
bpetok

CLI for tokenizing text input using Byte Pair Encoding (BPE)

v0.1.2 app #tokenize #bpe #cli #text #tokenizer
utf64

encode utf-8 strings into utf-64, and decode them back

v1.0.2 #string #unicode #traits #utility #unicode-text #text
rush

shell

v0.1.14 app #shell #rush #highlighting #experience #mode #multiplexer #startup #s-hell
kalosm-language

A set of pretrained language models

v0.4.1 250 #artificial-intelligence #llama #llm #mistral #nlp
cocomo

(Constructive Cost Model) CLI utility and library

v0.10.3 bin+lib #tokei #loc #cloc #scc #sloc #month #arguments
nib-cli

A cli for a yet another static site generator Nib

v0.0.5 app #cli #text #nib #config
hyperscan

bindings for Rust with Multiple Pattern and Streaming Scan

v0.3.2 3.8K #hyperscan #streaming #regex #scan #run-time
jpush

集成极光App推送

v0.3.0 #jpush #channel #notifications #集成极光app推 #toml
symbolic_expressions

A symbolic-expression parser/writer

v5.0.3 56K #s-exp #symbolic-expression #ki-cad
clima

A minimal Markdown reader in the terminal

v1.1.1 800 app #markdown #clima #md #terminal #skin
tergo-formatter

Formatter for tergo

v0.2.10 #tergo #formatter #tergo-formatter #project
simple-string-patterns

Makes it easier to match, split and extract strings in Rust without regular expressions. The parallel string-patterns crate provides extensions to work with regular expressions via the Regex library

v0.3.17 #pattern #expression #character #set #string #bounds-builder #enums #floats
epson

support for communicating with Epson brand thermal POS printers

v0.1.12 #printing #epson #model #write #scheme #async-write
badascii

Backend rendering library for BadASCII diagrams. Block diagrams in ASCII.

v0.2.0 380 #ascii #mdbook #block #diagram #plugin #mdbook-plugins #jobs
code_generator

A code generator (Currently only targets C)

v0.2.0 #codegen #generator #generation
wikidot-normalize

provide Wikidot-compatible string normalization

v0.12.0 10K #normalization #normal #slug #wikidot
droid-wrap

用于Rust的Android API的高级封装

v0.4.0 #java-jni #sdk #android #java #jni
advancedresearch-translate

translation or reading ancient texts in their original language

v0.1.1 #text-translation #research #language #text #ancient
utf16string

String types to work directly with UTF-16 encoded strings

v0.2.0 154K #utf-16 #wstring #string
superfold

A multilingual Rust library and CLI to process UTF-8 strings to exclude diacritics and fold non-phonetic graphemes into their phonetic ASCII representation

v0.1.1 280 bin+lib #fold #ascii #unicode #transliteration #cli
hat-splitter

HAT splitter

v0.1.10 150 #splitter #hat #hat-splitter
ean-rs

generating and validating EAN barcodes

v0.2.2 #barcode #ean #ean-rs #codes
tu

CLI tool to convert a natural language date/time string to UTC

v0.3.0 bin+lib #date-time #utc #nlp #time
md-tui

A terminal markdown viewer

v0.8.7 bin+lib #tui-viewer #viewer #tui #markdown
drova_sdk

Sdk for absolute converter of formats for dalet

v3.0.1 1.0K #drova #sdk #dalet
filenamify

Convert a string to a valid filename

v0.1.2 2.0K #filename #normalize #filenamify
xee-ir

Xee intermediate representation and compilation to bytecode

v0.1.4 1.4K #xslt #xpath #xml #xee #bytecode
share-clipboard-rs

A lightweight, cross-platform utility written in Rust to seamlessly share your clipboard content across multiple devices on your local network

v0.1.1 270 app #share-clipboard-rs #clipboard #share #compatibility
substring

method for string types

v1.4.5 113K no-std #substring #string #slice
repr

The regular-expression-as-linear-logic interpretation and its implementation

v0.7.0 nightly no-std #regex #repr #complete #index #interval #kinds
eliza

natural language processing program developed by Joseph Weizenbaum in 1966

v2.0.1 bin+lib #chat-bot #nlp #linguistics #weizenbaum
rewrite

Safely rewrite file contents from stdin, even when file is open as an input

v1.0.0 app #rewrite #redirect #in-place #sponge
mdbook-external-links

Open external links inside your mdBooks in a different tab

v0.1.2 app #mdbook #link #external #mdbook-plugins #tabs
kalosm-streams

A set of streams for pretrained models in Kalosm

v0.4.0 270 #kalosm #kalosm-streams #stream #whisper
mdsh

Markdown shell pre-processor

v0.7.0 bin+lib #shell #markdown #pre-processor
stellar-axelar-std

Standard libraries for Axelar contracts

v1.1.2 200 no-std #stellar #stellar-axelar-std #std
mdbook-codeblocks

A mdbook preprocessor to prepend customizable vignette to code blocks

v0.1.21 app #mdbook-preprocessor #mdbook #mdbook-pre-processor #code-block
screeps-body-utils

Adds calculation functionality related to creep bodies in Screeps: World

v0.1.1 280 #world #body #part
libarc2

Low-level interface library for ArC TWO™

v0.6.0 #libarc2 #instructions #channel #ar-c2
gen-mdbook-summary

generate SUMMARY.md for mdbook project

v0.0.5 app #gen-mdbook-summary #summary #ignore #file
lister-cli

Lister: Navigate Markdown Lists

v0.1.4 bin+lib #list #ui #markdown
pdf

PDF reader

v0.9.0 12K #pdf #reader #color-space
unleash-types

API types for Unleash (https://github.com/Unleash/unleash) client features API response

v0.15.14 1.0K #response #unleash #unleash-types #metrics #api
html-auto-p

function like wpautop in Wordpress. It uses a group of regex replaces used to identify text formatted with newlines and replace double line-breaks with HTML paragraph tags.

v0.2.4 #html #br #paragraph #wpautop #autop
trxx

pack and unpack text files

v0.1.5 200 app #trxx #svg #locking #jpeg #png #个用于文本 #包和还原的 #件和图片文件 #功能特点 #将目录下的
fmtt

A diff-friendly text formatter that breaks lines on sensible punctuations and words to fit a line width

v0.8.0 190 bin+lib #fmtt #paragraph #text #body #figure #content #begin #pdf #end #lorem
px-wsdom-ts-convert

wsdom crate

v0.0.3 bin+lib #wsdom #convert #px-wsdom-ts-convert #demo #live-everything
md-ulb-pwrap

Markdown paragraph wrapper using Unicode Line Breaking Algorithm

v0.1.3 130 #ulb #pwrap #markdown #python #unicode
mkwebsite

build websites using markdown

v0.6.0 app #mkwebsite #markdown
prompt-input

lightweight library for user input prompts in Rust, designed to make input handling straightforward

v1.0.0 #user-input #prompt #cli #input #user
pulldown-cmark-escape

An escape library for HTML created in the pulldown-cmark project

v0.11.0 442K #common-mark #escaping #markdown #html #render-markdown
jom

convert JSON data to markdown by replacing placeholders with JSON values

v0.1.4 270 bin+lib #jom #detail #kubernetes #json-to-markdown
pangu2

Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)

v0.2.0 130 #spacing #pangu #pangu2 #objective-c #clojure #elixir #go #java #browser #php
float-pretty-print

Format f64 for showing to user, not for serialisation

v0.1.1 524K #pretty-print #float #format #pretty #human #serialization
simstring_rust

A native Rust implementation of the SimString algorithm

v0.1.2 #string-matching #nlp #simstring #cpmerge #algorithm #hash-db #ngrams #cosine
str_inflector

Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…

v0.12.0 22K #inflection #pluralize #foo-bar #snake #camel #inflector
minimo

terminal ui library combining alot of things from here and there and making it slightly easier to play with

v0.5.42 190 #printing #terminal #color #cli
tre-regex

Rust safe bindings to the TRE regex module

v0.4.2 150 #regex #safe-bindings #tre #api-bindings
rwkv-tokenizer

A fast RWKV Tokenizer

v0.9.1 350 #tokenize #rwkv-tokenizer #tokenizer #world-tokenizer
vidyut-kosha

A Sanskrit key-value store

v0.2.0 #sanskrit #kosha #store #lexicon #nlp
erebus

A CLI message generation library

v0.1.8 #erebus #panic
text-editing

string with utilities for editing

v0.2.2 #text-editing #editing #text #text-line
poriborton

Interconversion between Unicode and various Bengali ANSI encodings

v0.2.3 #unicode #ascii #ansi #bijoy #bengali
mdast_util_to_markdown

Markdown to AST

v0.0.2 420 #markdown #ast #extension #common-mark #extensions #markdown-rs
quranize

Encoding transliterations into Quran forms

v1.0.0 #quranize #suffix-tree #text #quran #wasm
fast_symspell

Spelling correction & Fuzzy search

v0.1.10 bin+lib #spell-check #symspell #edit-distance #verbosity #sym-spell #strategy #spell-checking #spellcheck
makepad-rustybuzz

A complete harfbuzz shaping algorithm port to Rust

v0.8.0 350 no-std #true-type #opentype #text-shaping #shaping
flat_string

FlatString is fixed allocated size String that that can be created direcly on the stack

v1.0.1 #string #stack-allocated #stack-string #local-string #flat-structure
renamer-rs

process and rename files or text

v0.3.0 500 #delimiter #selector #input-type #text
n_gram

training n-gram language models

v0.1.12 #ngrams #simple #lm #eos #corpus #model
mdbook-repl

based mdbook preprocessor that allows you to execute code in your mdbook without any server. Python, Typescript, Javascript etc.

v0.2.6 app #mdbook-preprocessor #mdbook #repl #plugin #mdbook-pre-processor
pulldown-html-ext-cli

CLI tool for extended HTML rendering of Markdown with pulldown-cmark

v0.5.0 app #html #pulldown-html-ext-cli #pulldown-cmark
tiny-clean

A lightweight, high-performance string sanitizer with configurable rules

v0.1.0 110 #encoder #sanitizer #string #utility #text
keyphrases

Rapid Automatic Keyword Extraction (RAKE) implementation in Rust

v0.3.3 410 #extract #keyphrases #nlp #keyword #rake
tiefdownlib

manage and convert TiefDown projects

v0.8.1 290 #pandoc #document-conversion #markdown #latex #converter #filter #template
maybe-regex

Wrapper for strings that may be either a regex or a plain-text string

v0.2.1 #string #utility #regex
utf16_lit

macro_rules to make utf-16 literals

v2.0.2 110K #utf-16 #lit #utf16-lit #utf16-null
sgrep

grep util for those lazy to remember many command line options

v1.0.9 420 app #sgrep #directory #insensitive #bash
textpod

Local, web-based notetaking app inspired by 'One Big Text File' idea

v0.1.5 app #file #textpod #attachment #markdown #copy
wecom-agent

企业微信API的轻封装，让消息发送更加便捷。

v0.1.16 #wecom #企业微信api的 #agent #便捷 #让消息发送更 #content
somedoc

A very simple document model and markup generator

v0.2.10 800 #somedoc #model #markdown-flavor #writer
spongebob

convert text to spongebob case a.k.a tHe MoCkInG sPoNgEbOb MeMe

v2.0.1 bin+lib #spongebob #world #wo-rld #wo-rl-d
morse_n_s

Test program that plays Morse code "N"s using Rust and CPAL, inspired by its use in historical aviation communications, including transmissions by Amelia Earhart

v0.1.0 app #morse #morse-n-s #character
iregex

Intermediate representation for Regular Expressions

v0.1.3 #iregex
mylibrary_

my personal library

v1.2.7 #mylibrary #regex #algorithm
huozi

typography engine for CJK languages, especially designed for game rich-text

v0.8.0 #huozi #text-section #true-type #field #otf #总览 #活huó字zì-rust #开发中 #删除线 #输出为图片或
bubble-bath

Small and quick HTML sanitizer

v0.2.1 130 #input-validation #html #xss #security
domrs

Document builder and serializer

v0.0.17 190 #serialization #css #svg #html #builder #web-page #serializer
hubble

Official Hubble plugin SDK for Rust

v0.1.2 420 #hubble #assume #safety
owned_str

Provide a stack allocated String for no-std or const environement

v0.1.2 #owned #string #owned-str #push-str #unsized-str #environement #hello #buff #world
duvet

A requirements traceability tool

v0.4.1 500 bin+lib #tool #duvet #report #start #phase #repository
mdbook_fork4ls

Fork of mdBook for mdBook_LS

v0.4.48 2.4K bin+lib #rust-book #mdbook #gitbook #book #markdown
fetch-catnip

fetch displaying system information and a cute cat

v0.2.3 app #fetch #fetch-catnip #distro #color #customization #cat #come #system-information #command-line #statistics
moto

motivated automation

v0.2.29 bin+lib #automation #moto #run-time #variables #task #block #system
readability

Port of arc90's readability project to rust

v0.3.0 17K #readability #extractor #readability-rs #toml
tantivy-stemmers

A collection of Tantivy stemmer tokenizers

v0.4.0 100 #tokenize #stemmer #tantivy #tokenizer #algorithm
quake_text

Utils for Quake strings and characters

v0.3.0 #quake-world #quake #string #text
frawk

an efficient Awk-like language

v0.4.8 app #tsv #csv-tsv #awk #csv #language
deindent

A command line utility and Rust library to format overly-indented text

v1.0.1 bin+lib #indentation #formatter #deindent #indent #clipboard
asciidoctor-client

A kludge to improve the performance of static site generators that use asciidoc through its cli

v0.4.3 bin+lib #client #asciidoctor-client #asciidoctor #cli #server
cedarwood

efficiently-updatable double-array trie in Rust (ported from cedar)

v0.4.6 40K #cedar #trie #string #string-search #search #text-search #text
tectonic

A modernized, complete, embeddable TeX/LaTeX engine. Tectonic is forked from the XeTeX extension to the classic “Web2C” implementation of TeX and uses the TeXLive distribution of support files.

v0.15.0 700 bin+lib #typesetting #latex #tectonic #tex #font
swift-check

High-performance, robust, and expressive searching and validation (uses SIMD on x86_64, aarch64, and WASM)

v0.2.1 600 no-std #search #validation #simd #no-alloc
giff

Visualizes the differences between the current HEAD and a specified branch in a git repository using a formatted table output in your terminal. The differences are displayed with color-coded…

v0.2.1 app #cmd #git #diff #head
filename-refactor

Command to refactor file names

v0.2.1 app #name #translation #filename #subcommand #names #f2h #character #command
mdbook-curly-quotes

mdBook preprocessor that replaces straight quotes with curlyquotes, except within code blocks or code spans

v0.4.37 app #markdown #mdbook #quote #mdbook-plugins
text_utils_s

edit array. Example delete duplicate in array. Clear string

v0.1.5 #string #regex #unique #deduplicate #collection
colored_text

adding colors and styles to terminal text

v0.3.0 #ansi-term #terminal-colors #ansi-terminal #text-formatting #terminal
overlap-chunk

splitting text into chunks of specified size with adjustable overlap percentage

v0.0.3 bin+lib #overlap #text #chunking #size
inflections

High performance inflection transformation library for changing properties of words like the case

v1.1.1 797K #camel-case #inflection #traits #inflect #camel #case
re_ui

Rerun GUI theme and helpers, built around egui

v0.23.2 24K #re-run #egui #icons #multimodal
lnk

parse and write Windows shortcut files (.lnk)

v0.6.1 750 bin+lib #lnk #shell-link #unicode
mdbook-merjong

A preprocessor for mdbook to add merjong support

v0.1.1 bin+lib #mdbook #merjong #mdbook-merjong #mdbook-plugins
wdl-ast

An abstract syntax tree for Workflow Description Language (WDL) documents

v0.12.1 1.0K #document #wdl #ast #documents
libchai

汉字编码优化算法

v0.2.6 310 bin+lib #libchai #汉字编码输入 #汉字编码优化 #案优化算法 #的图形界面来 #赖来使用 #项目中安装为 #字自动拆分系 #后者可以通过 #以及基于退火
editdistancek

Fast algorithm for computing edit distance

v1.0.2 9.8K #edit-distance #text #editdistancek
vds

Visibly distinguishable string types for identifiers and codes

v1.0.3 200 no-std #serde #identifier #string #code #no-std
aki-mcycle

mark up text with cycling color

v0.1.29 bin+lib #text #filter #color
yore

decoding/encoding character sets according to OEM code pages

v1.1.0 7.1K #charset #page #cp864 #api #pages #encoding
vidyut-lipi

A Sanskrit transliterator

v0.2.0 bin+lib #sanskrit #transliterator #vidyut-lipi #scheme
notion2md

converting Notion pages to Markdown

v0.1.0-alpha.3 400 #markdown-converter #markdown #documentation #notion #converter
agentai

designed to simplify the creation of AI agents

v0.1.4 110 #chatgpt #agent #generative-ai #gemini
quicksilverx

easy to use grep clone

v0.1.0 app #quicksilverx #clone
uwurs

UwUify your strings with uwurs!

v0.3.4 130 #uwurs #mapping #emoji #probability #emoticon #interjections #text
ADA_Standards

help you handle checks on your ADA projects, especially good to build scripts to check coding standards conformity

v0.3.0 170 #string-parser #analysis #code #ada #string
xfont

font query

v0.3.0 180 #xfont #macos-ios #linux #a-font-matcher-match #windows
array_tool

Helper methods for processing collections

v1.0.3 22K #string #grapheme #vector #unique #substitution #collection
corlib

A various ideas library

v0.4.1 #events #idea #various #general #non-option
svgbob

Transform your ascii diagrams into happy little SVG

v0.7.6 3.6K #ascii #svg #diagram #bob #text
fontspector-profile-universal

Fontspector checks for OpenType font best practices

v1.0.2 420 #profile #fontspector #universal #component #practice #version #below
asdi

Simplistic Datalog Implementation (in Rust)

v0.2.5 #asdi #predicate #variables #datalog #inference #logic-programming
tform

format plain text into well-structured Markdown or HTML

v0.1.1 #markdown #tform #formatter #config #io
cobweb_asset_format

COB definition with parsing and ser/de

v0.2.0 240 #container #key #alias #scene #character #manifest #import #global #defs #comments
mdbook-hints

mdBook preprocessor to add hover hints to your book

v0.1.5 bin+lib #mdbook #mdbook-preprocessor #hint #tooltip #mdbook-pre-processor
mdopen

Preview markdown files in a browser

v0.5.0 app #markdown #browser #mdopen
rustfmt-nightly

find and fix Rust formatting issues

v1.4.21 500 bin+lib #rustfmt #issue #rustfmt-nightly
scx_rustland

BPF component (dispatcher) that implements the low level sched-ext functionalities and a user-space counterpart (scheduler), written in Rust, that implements the actual scheduling policy…

v1.0.12 150 app #tree #sched-ext #sched #scx #com #case
unicode-display-width

Unicode 15.1.0 compliant utility for determining the number of columns required to display an arbitrary string

v0.3.0 1.5K #unicode-width #unicode #east-asian-width #wcwidth #wcswidth #width
iregex-syntax

Common syntax for regular expressions

v0.1.3 #regex #iregex-syntax #syntax
notmecab

tokenizing text with mecab dictionaries. Not a mecab wrapper.

v0.5.1 #notmecab #持っ #これ
swamp-vm-instr-build

builds opcodes for the swamp vm

v0.1.16 #swamp-vm-instr-build #vm #build #issue #terms #engine #embedding
svgdx-pandoc

pandoc filter for svgdx codeblocks in Markdown

v0.5.0 120 app #pandoc #svg #diagram #svgdx
enc-check

inspect utf-8 and utf-16 character encodings

v0.2.1 app #unicode #utf-8 #enc-check #encoding #inspect
mdlink

Auto-convert HTTP links for your favorite services into nice Markdown links

v0.2.12 app #link #mdlink #links
cbfr

A buffer that run on stack, focusing on performance and speed

v0.1.6 bin+lib #text #buffer #byte #string
linebreak

breaking a given text into lines within a specified width

v0.3.1 #line-break #wrap #break #line
lexical-sort

Sort Unicode strings lexically

v0.3.1 137K no-std #sorting #transliteration #unicode #lexicographical #no-std
rust-regex-dsl-creator

Regular expression DSL derive macros

v0.1.8 bin+lib #dsl #regex #derive
cai

The fastest CLI tool for prompting LLMs

v0.10.0 bin+lib #artificial-intelligence #llm #ml #gpt #cli
syllabize-es

Syllabize Spanish text, and much more

v0.5.2 nightly bin+lib #syllable #spanish #text #syllabize
subtitler

parsing and generating subtitles

v0.0.4 bin+lib #vtt #srt #subtitle
gte-rs

Text embedding and re-ranking pipelines

v0.9.1 #nlp #text-embeddings #reranking #pipeline #model
colonnade

format tabular data for display

v1.3.3 #text #table #alignment #wrap #justify
ripsecrets

A command-line tool to prevent committing secret keys into your source code

v0.1.9 bin+lib #secret #security #ripsecrets #search
lgtmeow

🐾 —— 「本喵觉得很不错～」

v0.6.1 app #meow #lgtm #cli #emoji-kitchen
rustpython-parser-vendored

RustPython parser vendored third-party crates

v0.4.0 61K #vendored #python #rustpython
crankshaft-config

Configuration facilities for Crankshaft

v0.2.0 3.1K #crankshaft #crankshaft-config #task
asimov-brightdata-module

ASIMOV module for data import powered by the Bright Data web data platform

v0.0.2 230 no-std bin+lib #artificial-intelligence #asimov-module #asimov #api-bindings #dataset
cloc

Count, or compute differences of, lines of source code and comments

v0.6.2 app #cloc #e-g #secs #multi #这个语言的 #没有就不填
extract-strings

Extract ascii strings from files

v0.4.0 bin+lib #string #ascii #ascii-text
raylib_interactive

An interactive library for Raylib

v0.1.5 #raylib #interactive #button
chord3

Create pdf songbooks from chopro source

v0.3.4 app #lyrics #music #guitar #chopro #mandolin
mdfried

A markdown viewer for the terminal that renders images and big headers

v0.12.1 220 app #header #markdown #mdfried #sixel #ratatui #kitty
tag2upload-service-manager

Debian tag2upload service manager

v0.1.1 bin+lib #manager #service-manager #service
jawk

JSON AWK

v0.1.15 bin+lib #jawk #array #arguments #awk
aho-corasick-unsafe

Fast multiple substring searching

v0.0.4 no-std #aho-corasick #multi #search-pattern #string-search #string #text-search #pattern #text
lemmeknow

Identify any mysterious text or analyze strings from a file

v0.8.0 950 bin+lib #cryptography #regex #security #identify #forensics
aristech-nlp-client

client library for the Aristech Natrual Language Processing API

v1.0.2 440 #real-time-streaming #nlp #client-library #api-bindings
pragmatic-segmenter

Rust port of pySBD v3.1.0

v0.1.3 #sentence #boundary #nlp #segmentation #sbd
lyon_extra

Various optional utilities for the lyon crate

v1.0.3 53K #lyon #lyon-extra #numbers
ahtml-from-markdown

Convert Markdown to ahtml HTML element trees

v0.1.0 #markdown #tree #ahtml-from-markdown #website
rust_file_encode_mode_convert

这是一个rust的库，用于检测文件的编码格式。支持GBK,GBK2312 , UTF8, UTF16LE, UTF16BE, UTF8+BOM,UTF32 等多种编码格式。

v11.45.14 bin+lib #charset #unicode #convert #等多种编码格
mchr

Lenient implementations of encodings. Zero allocations, zero dependencies!

v0.1.7 130 #mchr
tectonic_bridge_core

Exposing core backend APIs to the Tectonic C/C++ code

v0.4.1 500 sys #tectonic #tectonic-bridge-core #bridge #path #xetex #typesetting #single #component #unused-imports
httpwg

Test cases for RFC 9113 (HTTP/2)

v0.2.7 #http2 #ascii #httpwg #loona
unaccent

remove accents from strings, inspired by PostgreSQL's unaccent extension

v0.1.1 850 #unicode-normalization #diacritics #string-utils #text-processing #normalization #unicode
mdbook-nice

A mdbook plugin to add nice css to your book

v0.1.0 app #mdbook-nice #nice #book
lindera-unidic-builder

A Japanese morphological dictionary builder for UniDic

v0.32.3 12K #morphological #japanese #builder #dictionary #unidic
string-replace-all

String replacement utility inspired by JavaScript, allowing pattern-based substitutions with support for both exact matches and regex patterns

v0.2.1 #regex #string #string-replace-all
furze

finite state transducers (fst) writen in rust

v0.1.1 #search-engine #fst #builder
alphabet_detector

Natural language alphabet detection library

v0.7.0 460 bin+lib #language #word #split #match #unicode
libannict

Annict API のクライアントライブラリ

v0.3.0 #anime #annict #ライブラリ #のクライアン #作品の
soft-ascii-string

char/str/string wrappers which add a "is-ascii" soft constraint

v1.1.0 #ascii #ascii-text #string #safe #ascii-string #constraints #bug
scraps_libs

Scraps is a static site generator based on Markdown files written with simple Wiki-link notation, designed for personal and team knowledge management

v0.23.1 1.2K #scraps #tags #libs #markdown #wiki #static-site-generator #personal-knowledge-management
caseless

Unicode caseless matching

v0.2.2 76K #matching #caseless #rust-caseless
nuhound

Improve error handling capability

v0.2.0 160 #error #debugging #result #options
lingua-german-language-model

The German language model for Lingua, an accurate natural language detection library

v1.2.0 18K #language-recognition #lingua #language-detection #nlp
wai-parser

Parser for WAI syntax

v0.2.3 64K #wai #syntax #interface
runiq

An efficient way to filter duplicate lines from input, à la uniq

v2.0.0 bin+lib #unique #filtering #logging
cronus_spec

The definitions for cronus API spec

v0.4.4 240 #typescript #specification #cronus #axum #openapi #async-trait
diacritics

Remove diacritics from letters, for example when standardizing input for a search

v0.2.2 1.1K #text-search #diacritics #search #text #normalize
deliminator

Universal code documentation generator

v0.3.1 bin+lib #deliminator #list #tags #versatile #repl #bash #md #n-a #txt
rins_markdown_parser

markdown parser written on Rust

v0.1.2 bin+lib #markdown-parser #parser #rins-markdown-parser #paragraph #grammar #heading #rules #image #link #console
reason-shell

Reason: A Shell for Research Papers

v0.3.10 app #paper #reason #title #academic-paper #config #command-line #research #chowdhury #printf
expunge

redact and transform struct fields declaratively

v0.3.4 34K #sensitive #zeroize #redact #secret #pii
streplace

A tiny library for matching and replacing in strings and slices with user-defined functions

v1.0.0 100 #streplace #matchable #testing
chamkho

Khmer, Lao, Myanmar, and Thai word segmentation/breaking library and command line

v1.4.3 110 app #thai #nlp #library #lao #text
unicode-intervals

Search for Unicode code points intervals by including/excluding categories, ranges, and custom characters sets

v0.2.0 #code-point #interval #unicode #unicode-category #lowercase-letter #include-characters #max-codepoint #codepoint
facebookexperimental/hgproto

A Scalable, User-Friendly Source Control System

GitHub 0.1.0 260K #hgproto #parser #complete #scm #protocols
bfom-lib

Brendan's Flavor of Markdown: I'll build my own markdown format, what could go wrong?

v0.1.52 #markdown #bfom #bfom-lib #wrong
cli_app_capo

CLI application with Unix-like tools

v0.1.2 app #command-line-tool #unix #cli
analiticcl

approximate string matching or fuzzy-matching system that can be used to find variants for spelling correction or text normalisation

v0.4.8 bin+lib #spelling-correction #nlp #linguistics #spell-check #levenshtein #text-processing
iroh-test

Internal utilities to support testing of iroh

v0.31.0 #iroh #iroh-test #testing
bwrap

A fast, lightweight, embedded systems-friendly library for wrapping text

v1.3.0 5.4K no-std #wrap #formatting #line-feed #80-column #no-std
maelstrom-plot

Fork of egui_plot with added stacked line graph functionality

v0.14.0 #plot #maelstrom #maelstrom-plot #cargo-subcommand #maelstrom-web #egui-plot #golang #distributed-systems #python #pytest
extstd

intended as an extension of the standard library

v0.6.0 130 bin+lib #extstd
poppler

Wrapper for the GPL-licensed Poppler PDF rendering library

v0.6.0 440 #poppler #pdf #bindings #cairo #libpoppler
textwrap-macros

procedural macros to use textwrap utilities at compile time

v0.3.0 4.3K no-std #text-formatting #macro #typesetting #wrap
typeshare-cli

Command Line Tool for generating language files with typeshare

v1.13.2 3.0K app #typeshare #typeshare-cli #target-os #cli
ere

A compile-time alternative for POSIX extended regular expressions

v0.1.0 120 #ere #work-in-progress #in-progress
wit-bindgen-markdown

Markdown generator for WIT and the component model, typically used through the wit-bindgen-cli crate

v0.42.1 3.0K #wit-bindgen #wasi #markdown
emoji

Every emoji, their metadata, and localized annotations

v0.2.1 1.6K #man #woman #kiss #annotations #variant #glyph #name #language #version
vader_sentiment

Bindings for Rust from the original Python VaderSentiment analysis tool

v0.1.1 650 bin+lib #sentiment #vader-sentiment #demo #analyse #vader-sentiment-analysis #lol #content #words-phrases #sux #kinda
vibrato

viterbi-based accelerated tokenizer

v0.5.2 1.3K #japanese #tokenize #analyzer #morphological #tokenizer
src2md

Turn source code into a Markdown document with syntax highlighting, or extract it back

v0.1.4 bin+lib #markdown #extract #documentation #code
unic-ucd-ident

UNIC — Unicode Character Database — Identifier Properties

v0.9.0 384K #character-property #unic #unicode #unicode-text #text
linkcheck2

extracting and validating links

v0.8.0 #link-checker #linkcheck #link #check #links
turn-uppercase

Small command to uppercase text in command line and copy to clipboard

v0.1.1 app #turn-uppercase #clipboard #upper-case
asimov-config-cli

ASIMOV Configuration Command-Line Interface (CLI)

v25.0.0-dev.0 app #asimov #cli #artificial-intelligence
ascii-img-cli

Command-line tool for using ascii-img

v0.1.4 410 app #ascii-img #cli #ascii-img-cli #ascii
pullup

Convert between markup formats

v0.3.8 #format #pullup #events #formats
lowcharts

draw low-resolution graphs in terminal

v0.5.8 17K bin+lib #graph #grep #plot #troubleshooting #text #console
vyder_std

Standard library for vyder

v0.3.4 #vyder #vyder-std #std
ctreg

Compile-time regular expressions the way they were always meant to be

v1.0.3 7.3K #regex #ctreg #greeting
uniquewords-rs

Count the frequencies of words in text file(s) or stdin

v0.9.1 app #stdin #uniquewords-rs #file #pre-processor
gregex

Regex solver utilizing NFA

v0.7.2 #regex-automata #nfa-automata #regex #nfa
str-utils

some traits to extend types which implement AsRef<[u8]> or AsRef<str>

v0.1.7 600 no-std #ascii-text #ascii-string #string #ascii #starts-with #caseless #ends-with
named_entity_parsing

Named entity parser. Used in Rusev to parse a list of tokens into a list of entities.

v0.4.0 #nlp #ner #seq-eval
context-notation

Featherweight semantic notation for text

v0.1.4 130 #text #context #notation
notan_draw

2D API for Notan

v0.13.0 500 #notan #draw #notan-draw
hidden_watermark

Hidden Watermark in Rust

v0.1.8 #watermark #blind-watermark #text #image
mdbook-kroki-preprocessor

render kroki diagrams from files or code blocks in mdbook

v0.2.0 100 app #mdbook #kroki #diagram #proprocessor
fast_whitespace_collapse

Collapse consecutive spaces and tabs into a single space using SIMD

v0.1.0 #white-space #collapse #simd
sydney

Vim-like, Command-line Gemini Client

v0.1.11 app #gemini-client #client #gemini #ui #command
pingmoji

Useless CLI utility that parses chains of emojis and bitwise operations as ipv4 addresses and pings the result

v1.0.0 140 app #pingmoji
iconv-native

A lightweight text encoding converter based on platform native API or libiconv

v0.1.0 #unicode #iconv #wasm
minigrep_jeck

minigrep is a grep clone that takes a query and searches for the query in the file; with added support for regex

v0.1.1 bin+lib #mini-grep #minigrep-jeck #jeck
dmos-cli

Djot HTML renderer with advanced features - CLI

v0.6.1 280 app #syntax-highlighting #djot #dmos #cli #file #stdin #highlighting #handler #syntax #anchor
casile

The command line interface to the CaSILE toolkit, a book publishing workflow employing SILE and other wizardry

v0.14.4 bin+lib #casile #ci #pdf #sile #status #setup #wizardry #toolkit #pdf-generation #typesetting
ascii-canvas

canvas for drawing lines and styled text and emitting to the terminal

v4.0.0 1.4M #terminal #ascii #canvas
stylish-ansi

stylish helpers for writing styles as ANSI escape codes

v0.1.2 4.8K no-std #stylish #ansi #stylish-ansi #codes #string
eddie

Fast and well-tested implementations of edit distance/string similarity metrics: Levenshtein, Damerau-Levenshtein, Hamming, Jaro, and Jaro-Winkler

v0.4.2 130 #levenshtein #hamming #text #jaro
appendlist

An append-only list that preserves references to its elements

v1.4.0 12K #element #appendlist #borrowing #list #elements #case #structure
bump-bin

Increments version with semver specification

v0.4.3 bin+lib #version-bump #semver #specification #cli #bump
ens-normalize-rs

Ethereum Name Service (ENS) name normalization

v0.1.1 #normalization #ens-normalize-rs #ens
mdbook-trunk

mdBook plugin which bundles packages using Trunk and includes them as iframes

v0.2.3 270 bin+lib #trunk #mdbook-trunk #mdbook #web
bobo_html_parser

parser of html markdown

v0.1.1 bin+lib #html-parser #pest-parser #pest #bobo #grammar #markdown #rules
santoka

Translations of 668 of Taneda Santoka's free-verse haiku

v1.0.2 #haiku-poetry #dataset #literature #japan #poetry #haiku #translator
uklatn

Ukrainian Cyrillic transliteration to Latin script

v1.20.0 #transliteration #ukraine #romanization #script
egg-mode-text

Text parsing for Twitter: character counting, hashtag/mention extraction

v1.15.1 #twitter #extract #egg-mode-text #entities #length #character-count #individually #23 #twitter-text
meddl_translate

Translate German to Meddlfrängisch

v1.2.1 #translation #ignored #meddl-translate #hello #long-text
portmanteau

create portmanteaux

v0.2.2 #portmanteau #vowel #word #portmanteaux #lower-case #long #cases #offensive
dicexp

A Dice Expression Interpreter program and library for parsing (and rolling) role-playing game style dice notations (e.g. "2d8+5")

v1.1.1 110 bin+lib #dice #roll-dice #ttrpg #2d8-5 #dice-bag
tailcall-valid

validating multiple inputs, collecting all possible errors instead of failing at the first error. Useful for scenarios where comprehensive feedback is required for user inputs or configuration settings.

v0.1.3 270 #validation #tailcall-valid #valid #trace
abbreviation_extractor

extracting abbreviations from text

v0.1.4 #abbreviation #extractor #nlp #biomedical #text-processing #extract
spc-core

A command-line tool for processing and analyzing data from SPC files

v0.1.0 #reserved #spc #units
pandoc_types

Rust port of pandoc-types

v0.6.0 110 #pandoc #pandoc-types #inline
sqdj

sqdj shortens delimited data

v0.2.3 app #shortener #delimited #cli #data
cnv

Command-line tool to convert between units of measurement

v0.8.0 bin+lib #measurement #cnv #convert #km
rustdoc-md

Convert Rust documentation JSON into clean, organized Markdown files

v0.1.0 700 bin+lib #converter #documentation #api #rustdoc #markdown #item
crustdown

A static site generator for markdown content

v0.1.0 150 app #content #crustdown #directory #metadata
codetypo-vars

Source Code Spelling Correction

v0.9.1 #spelling #codetypo #variables #pr #correction #spell-check #development #development-tools
regex_generate

Use regular expressions to generate text

v0.2.3 500 #regex #text-generation #generation
widget-forge

A Widget Based Application Engine for Ascii-Forge

v0.1.0 240 #forge #widgets #ascii-forge
inflector-plus

Adds String based inflections for Rust. Snake, kebab, camel, word, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…

v0.11.7 #inflection #pluralize #foo-bar #snake #camel
hanconv

Convert between Chinese characters variants

v0.3.4 bin+lib #simplified-chinese #traditional-chinese #chinese #utf-8
html-linter

An HTML linting library for checking HTML structure and semantics

v0.1.1 #linter #semantic #html-linter #text-content #rules #pattern #compound
docket

markdown to HTML documentation rendering

v0.7.1 app #rendering #docket #markdown #static-site-generator
b2c2-jis-x-201

UTF-8とJIS-X-201を雑に変換処理する

v0.1.1 #b2c2 #のソースコー #basic言語風の #のプログラミ #ファイルから #ファイルを #グ言語 #utf #8とjis #201を
fmtm

A diff-friendly Markdown formatter that breaks lines on sensible punctuations and words to fit a line width

v0.0.3 170 bin+lib #fmtm #markdown #finally #extension
ngrammatic

Character-oriented ngram generator and fuzzy matching library

v0.4.0 220 #ngrams #fuzzy #shingles #pad
gst-plugin-regex

GStreamer Regular Expression Plugin

v0.13.0 #plugin #regex #gst-plugin-regex
idna-cli

Encode/decode Unicode domain names to/from IDNA ASCII

v0.2.3 bin+lib #ascii #domain #idna-cli #csv #json
gaoya

Locality Sensitive Hashing Data Structures

v0.2.0 950 #dedup #lsh #min-hash #neardup #simhash #similarity #structures #whitespace-split #document
csml_interpreter

The CSML Interpreter is the official interpreter for the CSML programming language, a DSL designed to make it extremely easy to create rich and powerful chatbots

v1.11.2 170 #chat-bot #programming-language #csml #interpreter
formatjson

Formats JSON files

v0.3.1 2.0K bin+lib #json #formatting #formatting-json #formatter
scx_rusty

multi-domain, BPF / user space hybrid scheduler used within sched_ext, which is a Linux kernel feature which enables implementing kernel thread schedulers in BPF and dynamically loading them…

v1.0.12 150 app #sched #scx #load-balancing #ext #com #tree #sched-ext #case
zp

Copy the contents of the source file or the standard output buffer to the clipboard, with support for maintaining a history of copied content, allowing users to easily paste into another file or program

v1.2.1 bin+lib #cmd #copy #copy-to-clipboard #daemon #file
mdi

markdown include

v0.0.39 bin+lib #markdown #mdi #中嵌入版本号
harfbuzz

Rust bindings to the HarfBuzz text shaping engine

v0.6.0 1.6K no-std #font-shaping #opentype #unicode #font #shaping #unicode-text
like

A SQL like style pattern matching

v0.3.1 3.7K #pattern-matching #like #escaping #pattern #matching
ncase

Enforce a case style

v0.3.2 bin+lib #style #convert #case #convert-text #word
mdbook-fs-summary

Summary generator for mdbook

v0.2.1 120 app #mdbook #summary #markdown #static
biometrics

provide the vitals of a process in the form of counters, gauges, moments, and T-digests

v0.11.0 110 #biometrics #biometrics-pb #updating
ascii-hangman

customizable Hangman game with ASCII-art rewarding for children (desktop version)

v5.7.2 app #ascii-hangman #version #applications #true #licence #ascii-art #getreu #kids
natural

Pure rust library for natural language processing

v0.5.0 24K #natural #soundex #tokenize #classification #tf-idf #padding #ngrams #distance #phonetic #slow
rs3a

Lib for reading and writing 3a format

v1.0.6 #format #rs3a #header #io #rendering
pavex_miette

A custom Miette theme for Pavex CLI errors

v0.1.80 #pavex #miette #pavex-miette
gulagcleaner_rs

Ad removal tool for PDFs

v0.16.0 160 #pdf #gulagcleaner #wuolah #studocu #stucleaner
stringmatch

Allow the use of regular expressions or strings wherever you need string comparison

v0.4.0 14K #compare #regex #string #comparison
treebender

An HDPSG inspired symbolic NLP library for Rust

v0.1.1 bin+lib #nlp #earley #parser #syntax #hdpsg
unicodeit

Converts LaTeX to Unicode (rust port)

v0.2.0 #unicodeit #port #latex
monument_cli

CLI interface to Monument, a fast and flexible composition generator

v0.14.5 bin+lib #monument #music #composition
nu_plugin_regex

nu plugin to search text with regex

v0.13.0 210 app #regex #plugin #groups #nushell-plugin #flatten #nu-shell #num-d #word-w #a-c
unicode-canonical-combining-class

Fast lookup of the Canonical Combining Class property

v1.0.0 5.1K no-std #class #canonical #combining #unicode #no-std #unicode-properties
pulumi_gestalt_core

Core Pulumi Gestalt implementation

v0.0.2 #pulumi-gestalt #language #pulumi-gestalt-core
html_to_epub

A command line converts .html file to .epub file

v0.1.4 bin+lib #epub #html #html-to-epub #title #author #cover
rs-tool

A command-line tool to perform reservoir sampling on a file or a stream

v0.1.1 app #stream #reservoir #logging #statistics #sample
pukram2html

converting Pukram-formatted text to HTML

v0.3.0 bin+lib #markup #html #pukram #text-processing
greek_number

Convert numbers to Greek number strings

v0.1.2 #numbers #greek #string #to-greek-uppercase #numeber #format
tfidf-text-summarizer

extractive text summarization system which uses TF-IDF scores of words present in the text to rank sentences and generate a summary

v0.0.3 #tf-idf #text-summarization #nlp #summary #android
gspell

Rust bindings for gspell

v0.7.0 #gspell #gnome #gtk
kbnf-regex-automata

A forked version of regex-automata for kbnf

v0.4.10 390 no-std #nfa-automata #dfa-automata #regex-automata #regex #dfa
rust_readability

A package to assess the complexity of texts using a variety of readability formulas

v0.2.0 #readability #nlp #lix #flesch-kincaid #coleman-liau #write #rix #txt #string #ari
lindera-cc-cedict-builder

A Chinese morphological dictionary builder for CC-CEDICT

v0.32.3 12K #cc-cedict #builder #morphological #dictionary #chinese
json-predicate

JSON Predicate lib based on draft-snell-json-07

v0.1.16 #predicate #json-predicate #abc #string #draft-snell-json-07 #xyz
hydroperx-utf16

Work with UTF-16 in Rust

v1.0.0 110 #utf-16 #string #hydroperx-utf16
glean

SDK Rust language bindings

v64.3.1 1.5K #telemetry #glean #initialization #bindings #book #documentation #cfg
mdbook-presentation-preprocessor

A preprocessor for utilizing an MDBook as slides for a presentation

v0.3.1 app #pre-processor #mdbook #rust-book #markdown #gitbook
cqtool

converting between CQ strings and message segment arrays

v0.1.0 #cqtool #array #可以完成cq字 #串与消息段数 #之间的 #arrays #将消息转为消 #段数组格式 #将消息转为cq #符串格式
natord-plus-plus

Natural ordering for Rust

v2.0.0 8.6K #sorting-order #natural #natord-plus-plus #sorting #order
rvpacker-txt-rs-lib

providing functions for rvpacker-txt-rs

v5.1.6 230 #processing #txt #lib
simple-logging

logger for the log facade

v2.0.2 15K #logging #simple #facade
fr_alebref_libbrefdata

BrefData library

v0.4.1 #fr-alebref-libbrefdata #libbrefdata #alebref
enum-ts

TypeScript Enum pattern matcher codegen

v0.2.6 app #pattern-match #typescript #mvvm #modeling #match
rawcode

Implements a simple as-is encoding format

v0.3.2 no-std #format #rawcode #utf-8 #struct
trust_pdf

Verifies signed PDFs against the originals, checking for sneaky modifications

v3.0.1 #pdf #trust #added #problem #right #wrong
ferret

A trigram-based tool for detecting similarity in groups of text documents or program code

v1.1.1 bin+lib #similarity #code #plagiarism #text #collusion #document #count
scanlex

lexical scanner for parsing text into tokens

v0.1.4 4.3K #input #scan #tokenize #text
toster

A simple-as-toast tester for C++ solutions to competitive programming exercises

v1.2.2 app #exercise #testing #toster #filename #cpp #directory #once #io #compile-command #character
wikipedia_prosesize

Count Wikipedia prose size

v0.3.0-rc.2 #wikipedia #mediawiki #prosesize #size
iregex-automata

Finite automata definitions for the iregex crate

v0.1.3 #nfa-automata #regex-automata #dfa-automata #regex
mdbook-plantuml

A preprocessor for mdbook which will convert plantuml code blocks into inline SVG diagrams

v0.8.0 950 bin+lib #plant-uml #mdbook #markdown #common-mark #diagram
owlz

"Owlz" ascii emojis, created randomly or by design

v0.1.2 #emoji #owls #owlz #generation
libanubhav

management system written in Rust

v0.2.1 #book #libanubhav #books #id #system #exit #language #nichols #martin
naming_utils

generating naming conventions, pluralizing words, and rest api paths in Rust

v0.1.1 #naming #pluralize #utility #case-conversion #path
jumpcut

CLI for converting Fountain-formatted text files into FDX and HTML formats

v0.7.2 bin+lib #screenwriting #fountain #jumpcut
markdown-extract

Extract sections of a markdown file

v2.0.0 340 bin+lib #extract #markdown #markdown-extract #md #welcome
xml_magic

A reasonably fast XML formatter

v1.0.0 app #xml #cli #formatter #file #style
fimdoc

Firendship is Magic Document, converts Markdown into FIMFiction BBCode

v0.6.1 bin+lib #fimdoc #bbcode #document #obsidian-vault #fiction #mlp #pony #mylittlepony #fanfiction
text-tokenizer

Custom text tokenizer

v0.6.5 260 #tokenize #text-tokenizer #tokenizer
fyi_ansi

Compile-time ANSI formatting macros for FYI

v2.1.1 380 #ansi #csi #ansi-csi
firm_netter

测试，请勿使用！

v0.1.10 380 #firm-netter #测试 #请勿使用 #函数 #sql模块暂未测 #本项目使用的
CLI_Project_Scott_Coakley

CLI Project in Rust

v0.1.0 app #cli #scott #coakley
cozo

A general-purpose, transactional, relational database that uses Datalog and focuses on graph data and algorithms

v0.7.6 2.4K #graph #cozo #token-stream #graph-database #documentation #artificial-intelligence #embedded-database #graph-algorithms #cross-platform #client-server
mdbook-github-authors

mdbook preprocessor to display Github profiles of authors of a page

v0.1.0 120 bin+lib #mdbook-github-authors #author #github #contributors #page #chapter #github-authors #user-name
prescript

parsing and executing Prescript scripts

v0.1.1 #prescript #font #comments #structure #ni-pdf
tgrep

Toy grep that honors .gitignore

v1.6.10 bin+lib #search-pattern #grep #gitignore #pattern
mcp-spec

Core types for Model Context Protocol

v0.1.0 6.0K #protocols #mcp #specification #rust-sdk
clone-spl-token-metadata-interface

Solana Program Library Token Metadata Interface

v0.7.0 170 #interface #spl #metadata #field #emit #authority #key #initialization #state #case
check_build

verify a VCF file against hg19 and hg38 references using a streaming, low-memory approach

v0.1.0 app #check-build #vcf #mismatches
modularize_imports

AST Transforms for import modularizer

v0.86.0 4.3K #import #modularize-imports #modularize #modularizer #plugin
thesaurus

An offline thesaurus library for Rust

v0.5.2 #synonyms #thesaurus #thesaurus-rs #process
noctisroll

Text-based TRPG dice rolling system

v0.1.5 300 #text-based #roll #trpg #roll-dice #system
ascii-izer

converting an image into ASCII art

v0.3.1 #ascii-izer #art #color
timeblok

A language for event scheduling in plain text

v0.5.0 170 #events #ics #timeblok #calendar #text #dsl #compiler #productivity #shorthand #filter
fast-str

A flexible, easy-to-use, immutable, efficient String replacement for Rust

v1.0.0 #string #serialize #serde #serialization #serde-serialize
ps-str

String transcoding library

v0.1.0-2 #ps-str #string
safe-string

safe interface for interacting with multi-byte strings in Rust, namely IndexedStr, IndexedString, and IndexedSlice

v0.2.0 1.0K #safe-strings #indexed-slice #string
markdown-toc

Markdown Table of Contents generator

v0.2.0 bin+lib #markdown #toc #header #table-of-contents #link #generator
lib_tsshow

A visualiser for template-switch alignments

v0.18.0 380 #alignment #switch #lib #aligner
yahv

hex viewer

v0.2.1 app #viewer #yahv
stringer

An easy way to turn an Unsafe *const c_char into a Rust String type and return a pointer

v0.2.0 #c-strings #ffi #string #ffistrings
kproc

Knowledge Processing library

v0.2.1 320 #kproc #processor #codec
hemoglobin

Bloodless

v0.9.2 800 #bloodless #hemoglobin #properties
hephae

A personalized, opinionated Bevy plugin that adds support for drawing and batching arbitrary vertices and indices

v0.7.2 #hephae #risk
kdump

A small utility that disassembles and reads KSM and KO files for use with KerbalOS

v2.0.1 bin+lib #disassembly #output #kdump #text
yozuk

Chatbot for Programmers

v0.22.11 #yozuk #programmers #telegram-bot #chat-bot #text-based #command-line-tool #development-tools #nlp
crowbook

Render a Markdown book in HTML, PDF or Epub

v0.16.1 bin+lib #markdown #book #epub #pdf #latex #html
spezilinter

spezifisch's linter for different file formats, linting for weirdly specific stuff

v1.1.2 bin+lib #spezilinter #once #install
cglue-bindgen

cleanup cbindgen headers for CGlue

v0.3.0 app #c-glue #abi #cbindgen #ffi
crowbook-text-processing

some utilities functions for escaping text (HTML/LaTeX) and formatting it according to typographic rules (smart quotes, ellipsis, french typograhic rules)

v1.1.1 120 bin+lib #text #escaping #latex #rules #meaning
parse2csv

parse log-file and output to stdout as csv file by regex

v0.2.0 app #regex #parse2csv #csv #run #ua
mdbook-langtabs

An mdbook preprocessor that adds language tabs for code blocks

v0.1.1 bin+lib #mdbook-preprocessor #mdbook #mdbook-pre-processor #tabs #markdown #language #block
crud

CLI generator for your favorite CRUD REST API

v0.1.7 #crud #cli #string
slugify-rs

generate slugs from strings

v0.0.3 15K #slug #slugify #macro #string
bitutils2

A package of tools for bit manipulations, including bit indexing, bitfields, and a variation of regular expressions for binary data

v0.1.4 430 #bit-field #regex #bit-fields #bitfields #binary
substring-replace

developer-friendly methods to manipulate strings with character indices

v0.2.2 #substring #substring-replace #string
meme_generator_memes

Meme generator built-in memes

v0.2.1 180 #meme #generator #meme-generator-memes #memes #meme-generator-rs #表情列表 #查看 #用于制作各种 #雕表情包 #表情包生成器
treegrep

A pattern matcher frontend or backend which displays results in a tree

v0.1.4 270 app #tree #grep #regex #search-tree #search
textgridde-rs

dealing with Praat TextGrid files. MIT licensed.

v0.1.5 #file-format #phonetic #linguistics #textgrid #praat
keyvalues-parser

A parser/renderer for vdf text

v0.2.0 7.0K #key-value #vdf #steam #parser #key-values
swon-parol

SWON parser implementation using Parol

v0.1.0 130 #bind #swon #parol
modeling

tools to analysis different languages by Ctags

v0.6.2 bin+lib #ctags #modeling #golang #java #opt #optional #regex #cpp #typescript #result
fmri

IPS package identifier - FMRI

v1.0.3 #fmri #oi #lib #pkg #publisher #version
pray

A tui tool for preparing a prompt to the llms

v1.5.0 app #tui #llm #clipboard #text-processing
zspell-cli

Command line interface for the ZSpell spellchecking library

v0.5.5 app #spell-check #spelling #cli #spell-checking #spellcheck
markdown-to-html

Markdown parser that runs at hyper speeds!

v0.1.3 #markdown-rendering #html #markdown #speeds #prose
mdbook-linkcheck

A backend for mdbook which will check your links for you

v0.7.7 4.1K bin+lib #mdbook #linkcheck #http-header #book #link-check
typope

Pedantic source code checker for orthotypography mistakes and other typographical errors

v0.4.0 bin+lib #spelling #typography #pedantic #error
diffy-imara

Tools for finding and manipulating differences between files

v0.3.2 350 #patch #diff #merge
mitex

TeX2Typst converter

v0.2.4 #tex #io-mitex #converter #typst #math #tool #package #solution #latex
unicode-matching

match Unicode open/close brackets

v0.5.6 160 #brackets #txt #unicode #find-matching
pink_accents

Replacement of patterns in string to simulate speech accents

v0.0.6 bin+lib #replace #accents #rules #text #format
trillium-prometheus

Trillium handler for Prometheus metrics scrapes

v0.3.0 850 #scrape #trillium-prometheus #prometheus #handler #scrapes #run
cynic-querygen

Generates code for using cynic from GraphQL query input

v3.11.0 220 #cynic #input #object
easy-regex

Make long regular expressions like pseudocodes

v0.11.7 #multi-language #meta #readable #easy #regex
mdbook-quiz-validate

Input validation for quizzes used in mdbook-quiz

v0.3.12 370 #markdown #mdbook-quiz #validation #tracing
codebook_downloader

Dictionary downloading utility for the Codebook spell checker

v0.1.0 160 #spell-check #dictionary #download #codebook
samvadsetu

LLM API for commonly used LLM services including Gemini, ChatGPT, and Ollama. The name implies a bridge for dialogue since the library facilitates communication and interaction between…

v0.1.2 #llm #ollama #large-language-model #gemini
jpreprocess

Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)

v0.12.0 210 bin+lib #open-j-talk #text-to-speech #library
rmosh

R6RS & R7RS Scheme Interpreter

v0.0.13 bin+lib #rmosh #interpreter #instructions
asciidork-tck

Asciidork TCK Adapter

v0.20.2 160 bin+lib #asciidork #asciidork-tck #tck #adapter
wdict

Create dictionaries by scraping webpages or crawling local files

v0.1.19 210 bin+lib #web-crawler #resume #wdict #dictionary #web-scraping #ce-wl #ce-w-le-r #word-list #dictionary-attack #wordlist-generator
curlicue

Helix keybinding utilities

v0.1.0 #utilities #curlicue
ascii-webcam

A webcam that visualizes its output as ASCII art directly in the terminal

v0.1.2 bin+lib #ratatui #opencv #webcam #ascii
nipdf-reader

iced pdf GUI reader

v0.1.1 app #nipdf-reader #reader #viewer #structure #ni-pdf
tracery

Text-expansion library

v0.2.1 #text #tracery #rules #execute #string #flatten #macro
markov_str

Markov Chain implementation optimized for text generation

v0.3.0 #markov-chain #text-generation #string #markov
streampager

pager for command output or large files

v0.10.3 800 bin+lib #pager #less #more #config
skyspell_kak

skyspell - kakoune integration

v3.0.1 bin+lib #spell-check #kakoune #skyspell
zipcodes

Query US zipcodes without SQLite

v0.3.4 1.3K #sqlite #state #filter #zipcode #us
runi

a CLI tool to generate unicode fonts

v0.1.4 app #font #generator #unicode #cli #abcdef
sre-engine

A low-level implementation of Python's SRE regex engine

v0.4.3 650 #regex #engine #sre
csmlinterpreter

The CSML (Conversational Standard Meta Language) is a Domain-Specific Language developed for creating conversational experiences easily

v0.3.2 #language-interpreter #chat-bot #csml #events #interpreter
zz-data

Data structures for Zanzarah apis

v0.2.6 #api #zz-data #magic
rust-tfidf

calculate TF-IDF (Term Frequency - Inverse Document Frequency) for generic documents

v1.1.1 #text-document #tf-idf #document #statistics
merge-whitespace

Procedural macros for merging whitespace in const contexts

v1.1.0 macro #white-space #graphql #proc-macro #context
piet-cosmic-text

A text layout engine for piet based on cosmic-text

v0.3.4 no-std #cosmic-text #piet #piet-cosmic-text
utf8toipv4

Convert UTF-8 to ipv4 addresses and vice versa

v1.0.0 150 bin+lib #versa #utf8toipv4 #code-point #package
pulldown_mdbook

A pull parser for mdBook

v0.3.2 #mdbook #pulldown-mdbook #pull-down
ogam

A markup language for story writers

v1.3.0 290 #parser #writer #markup #paragraph
mdbook-variables

mdBook proprocessor for risolve variables configured from book.toml

v0.2.4 1.1K bin+lib #mdbook #variables #markdown #toml #pre-processor
obmrs

As a participant, you will create a structure to receive and hold the exchange-distributed order book. This structure will be called the OrderBoard, and will hold the order book's bids and asks as a price-sorted map…

v0.1.2 #order-book #market-order-book #exchange #market #market-orderbook #book
annotate_celeste_map

rendering celeste maps, and overlaying recorded paths, lobby entrances etc

v0.4.0 bin+lib #map #annotate-celeste-map #lobby-entrances #recording #celestetools #output
linurgy

Manipulate the output of multiple newlines. Replace/Insert/Append newlines with text. Input and output from stdio/files/buffers

v0.6.0 #newlines #text #ending #crlf #stream #newline
sublime_fuzzy

Fuzzy matching algorithm based on Sublime Text's string search

v0.7.0 52K bin+lib #fuzzy-search #match #text-search #search #text
binatime

A binary clock in the terminal

v1.0.1 app #binatime
draw_triangle

A CLI tool to draw equilateral triangles in the terminal

v0.1.2 app #triangle #draw #draw-triangle
writings

The Bahá’í Sacred Writings for use in Rust projects and APIs

v0.1.0 #writings #api #section #source
phonetisaurus-g2p

Phonemization in Rust using a finite state transducer (FST) trained with Phonetisaurus

v0.1.1 #fst #linguistics #phonetisaurus #g2p #phonemizer
utf16_iter

Iterator by char over potentially-invalid UTF-16 in &[u16]

v1.0.5 8.2M #utf-16 #iterator #unicode
onig_sys

onig_sys crate contains raw rust bindings to the oniguruma library. This crate exposes a set of unsafe functions which can then be used by other crates to create safe wrappers around Oniguruma…

v69.9.1 709K sys #onig #onig-sys #regex #oniguruma
divvunspell-bin

Spellchecker for ZHFST/BHFST spellers, with case handling and tokenization support

v1.0.0 app #divvunspell #divvunspell-bin #bin #hfst-ospell #accuracy #spell-check
bilingual

A cmdline tool used for markdown translation via calling Chinese translation api cloud services

v0.1.3 bin+lib #bilingual #文本 #文件的 #tags #腾讯 #百度 #小牛 #使用翻译云服 #样式 #文件也包含很
antex

Styled text and tree in terminal

v0.0.8 bin+lib #text #styled #ansi #tree #ascii
litua

Read a text document, receive its tree in Lua and manipulate it before representing it as string

v2.0.0 bin+lib #markup #lua #content-tree #document-generation
rslint_parser

An extremely fast ECMAScript parser made for the rslint project

v0.3.1 6.2K #parser #typescript #events #node #javascript-linter #parallel
aho-corasick

Fast multiple substring searching

v1.1.3 18.0M no-std #multi #aho-corasick #search-pattern #string-search #string #text-search #pattern #text
aki-xtee

copy standard input to each files and standard output

v0.1.25 bin+lib #text #filter #aki-xtee
format-bytes

A macro to format bytestrings

v0.3.0 15K #format-bytes #byte-string #display-bytes
man

Generate structured man pages

v0.3.0 3.8K #page #man #flags #com #author #short #long #output #note #name
regexml

XPath compatible regex engine

v0.2.1 1.5K #xpath #xml-schema #engine #regex #xml
franz

friendly, and blazingly fast alternative to Apache Kafka

v0.7.5 bin+lib #ring-buffer #kafka #franz #queue #protocols #key #instance #netcat
interactive-clap

Interactive mode extension crate to Command Line Arguments Parser (https://crates.io/crates/clap)

v0.3.2 2.4K #interactive-clap #clap #interactive #cli #io-crates-clap
nlf

A CLI to append newline characters (LF) at the end of text file

v0.2.0 app #nlf #简体中文 #txt
orion_cfmt

Format output without Rust code segment in binary to reduce the ultimate binary size

v0.1.2 #size #orion-cfmt #orion #printf #lld #llu #llx
simple-ssg

Plain and simple static site generator for Djot and Markdown light markup languages

v4.1.0 app #ssg #simple-ssg #html #djot #language #site #static-site-generator #effort #goals #resources
acorns

Generate an AsciiDoc release notes document from tracking tickets

v1.0.0 bin+lib #documentation #release-notes #asciidoc #redhat
mdbook-pagetoc

A mdbook plugin that provides a table of contents for each page

v0.2.0 1.3K bin+lib #mdbook #content #toc #table #pagetoc #contents
symspell

Spelling correction & Fuzzy search

v0.4.3 1.9K #spell-check #symspell #edit-distance #verbosity #strategy #spell-checking #spellcheck #spelling-correction
goofy-animals

Generate a name in adjective-adjective-animal form

v0.0.2 no-std bin+lib #random #no-std #naming #random-generator #forms #generator
regex-chunker

Iterate over the data in a Read type in a regular-expression-delimited way

v0.3.0 bin+lib #regex #read #chunking #iterator
choco

markup language for dialogue systems

v0.2.2 #text #system #choco #syntax #contain
mdbook_header_footer

mdBook preprocessor to prepend header and append footer to certain chapters

v0.0.3 180 bin+lib #chapter #mdbook #header
block-list

A minimalist hosts-based tool for managing block lists and ad-blocking

v1.1.4 app #privacy #host #ads #block #privacy-tools
pulldown-html-ext

Extended HTML rendering capabilities for pulldown-cmark

v0.5.0 #element #block #html #class #pulldown-cmark #rendering #writer #config #highlighting #mapping
sbert

Sentence Bert (SBert)

v0.4.1 bin+lib #bert #nlp #transformer #embedding
metatron

core library

v1.1.1 #pdf #template-engine #report-generation #text-report #data-reporting
pprint

Flexible and lightweight pretty printing library for Rust

v0.2.2 #pretty-print #pretty #rust #printing #documentation
mdbook-ai-pocket-reference

mdbook preprocessor for the ai-pocket-reference project

v0.1.3 bin+lib #reference #artificial-intelligence #pocket #ai-pocket-reference #tabs
gstring

String with support for Unicode graphemes

v0.11.0 470 #grapheme #gstring #g-string #string #index #301
swc-neuron

CLI utility for interacting with SWC neuronal morphology files

v0.4.1 bin+lib #swc #neuron #swc-neuron #header
mdbook-ocirun

mdbook preprocessor to run arbitrary commands and code snippets inside containers

v0.2.1 bin+lib #mdbook #mdbook-preprocessor #ocirun #mdbook-pre-processor #snippets #container
freesia

some string operators

v0.1.2 #freesia #trim-whitespace #upper-case
hns

Human numeric sorting program — does what sort -h is supposed to do!

v0.2.0 app #stdin #stdout #coreutils #stdio #numeric-sorting #human-numeric-sort
fhe

Fully Homomorphic Encryption in Rust

v0.1.0-beta.8 #encryption-key #fhe #thread-rng #secret-key
ainconv

Converts Ainu words between different scripts (Katakana, Latin, Cyrillic)

v0.2.0 220 #converter #script #scripting-language #text #ainu
limace

Slugify some strings

v0.1.1 230 #limace #slugifier #string #separator #character #lower-case
matcher_py

A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust

v0.5.7 #multi #string-matching #search-pattern #string-search #text #text-search #string #pattern
eqlog-eqlog

Datalog with equality

v0.8.0 #eqlog #el #equality #rules
regexy

lightweight Rust library for working with regular expressions. The regexy crate provides an easy-to-use interface for matching patterns in strings using regex

v0.2.0 #regex #regexy #text #is-match
broken-md-links

A command-line tool and library to detect broken links in Markdown files

v2.1.1 bin+lib #broken-links #link #broken-md-links #md
ipynb-to-md

Convert Jupyter Notebooks to Markdown files

v0.2.0 app #jupyter-notebook #markdown #convert #notebook #jupyter
schemaorg-rs

Generated Rust types from Schema.org JSON-LD vocabulary

v0.1.0 140 #org #encoding-format #schema-version
pulldown-cmark-mdcat

Render pulldown-cmark events to TTY

v2.7.1 900 #cmark #markdown #cat #less
codepage-strings

encode / decode strings for Windows code pages

v1.0.2 200 #page #codepage #codepage-strings #pages
rustfmt_lib

Rustfmt as a library

v2.0.0-rc.2 nightly #rustfmt #attributes #newlines #formatter #code-formatter
rust-regex-dsl

Regular expression DSL

v0.1.8 #dsl #regex #rust-regex-dsl #why
i_shape_js

Boolean Operations for 2D Polygons. Supported operations: intersection, union, difference, XOR, and self-intersections for all polygon varieties.

v0.7.1 #shape #js #i-shape-js #union #boolean-operations #demo #rotation #editor #self-intersection #polygon-clipping
untanglr

Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies

v1.1.0 bin+lib #split #string #untanglr #dog
recase

Changes the convention case of input text

v0.3.0 500 #text #conventions #recase #case
jfmt

command-line tool for formatting json files in both readable and compact formats. It supports stdin/stdout shell usage, as well as working on files directly.

v1.2.1 app #cli #formatter #json
stam-tools

Command-line tools for working with stand-off annotations on text (STAM)

v0.9.2 420 bin+lib #annotations #nlp #linguistics #standoff #text-processing #annotation #alignment
mdbook-typst-pdf

mdbook typst pdf backend

v0.6.2 250 app #pdf #mdbook #typst #back-end #book
rsnltk

Rust-based Natural Language Toolkit

v0.1.3 bin+lib #nlp #semantic #stanza #nltk #text-analysis
viterbi_pos_tagger

A part-of-speech (POS) tagger using the Viterbi algorithm

v0.1.0 bin+lib #pos #tagger #nlp #part-of-speech
loe

Very fast and yet another line ending (CRLF <-> LF) converter written in Rust

v0.3.0 bin+lib #newlines #lf #eol #crlf
lookbook

Component preview framework for Dioxus

v0.2.0-alpha.1 #dioxus #lookbook #preview #component
ruby_inflector

Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…

v0.0.10 1.5K #inflection #pluralize #snake-case #snake #camel
shaperglot

Test font files for OpenType language support

v1.0.0 #provider #shaperglot #shaping #coverage #interface
vndb_tags_get

convert VNDB tag list (JSON to markdown)

v1.2.1 app #markdown #tags #vndb-tags-get #list
csv-groupby

execute a sql-like group-by on arbitrary text or csv files

v0.10.0 app #csv #regex #report #sql #text
surt-rs

Sort-friendly URI Reordering Transform (SURT)

v0.1.3 bin+lib #normalization #surt #archive #web-archiving #generate-surt #url
less

pager utility for displaying file contents or piped input, with dynamic scrolling and search functionality

v0.1.0 app #pager #viewer #text #cli
stylin

Convert markdown to pandoc markdown with custom styles

v0.9.4 bin+lib #style #paragraph #block #caption #spans #name #title #webp #com-qtfkwk-stylin #pipeline
lformat

Clone of Lua string.format in Rust based on C s(n)printf

v0.2.2 490 #format #string #format-text #text #lua
nfa_regex

NFA regex engine for text processing

v1.0.1 #nfa #regex #nfa-regex
mdbook-auto-gen-summary

A preprocessor and cli tool for mdbook to auto generate summary

v0.1.10 230 app #mdbook #summary #markdown #md #auto-gen-summary
esl01-minibytes

Shared reference-counted bytes with zero-copy slicing support

v0.3.0 1.4K #esl01-minibytes #esl01 #minibytes #scm
r4d

Text oriented macro processor

v3.1.0 210 bin+lib #processor #macro #rad #text-processing #cli
java_string

Java strings, tolerant of invalid UTF-16 encoding

v0.1.2 #java #utf-16 #string #encoding
fast_trie

A memory efficient trie library

v0.1.4 #string-matching #library #efficient #serde #trie
loc

Count lines of code (cloc) fast

v0.5.0 350 bin+lib #loc #language #seconds #ci #bench #src #count #inspect #sh #32
rust_jsc_sys

Low-level bindings to JavaScriptCore

v0.2.2 sys #javascriptcore #javascript #character-set #libs
mdbook-treesitter

mdBook preprocessor for html adding tree-sitter highlighting support

v1.0.0 bin+lib #mdbook #tree-sitter #mdbook-treesitter #javascript #scm
twas

A text substitution application for using random look-up tables to generate text in a manner similar to the Mad Libs game

v1.0.0 bin+lib #substitution #random #mad-lib #text #reference
wtf8-rs

WTF-8 encoding

v1.1.0 #unicode #wtf8-rs #wtf8
odict

A blazingly-fast dictionary file format for human languages

v2.8.0 550 #language #dictionary #odict #file-format #language-learning #linguistics
lsp-ty

type definitons for LSP

v0.2.2 #lsp #lsp-ty #position #response #lsp-io
typos-cli

Source Code Spelling Correction

v1.32.0 19K bin+lib #spelling #typos-cli #correction #typo #checker #cli #development #spell-check
markdown-it-footnotes

Creates footnotes and lists of footnotes in Markdown documents

v0.1.0 #footnotes #markdown-it-footnotes #reference #foo
sanitizer

A collection of methods and macros to sanitize struct fields

v0.1.6 5.5K #sanitizer #trim #validation #e164 #case #field
model2vec-rs

Official Rust Implementation of Model2Vec

v0.1.0 130 bin+lib #embedding #nlp #rust #model
string-overlap

A helper crate for "layering" ASCII art

v1.0.0 #overlap #string #layer #ascii
m2h

Convert Markdown to HTML with syntax highlighting

v0.1.0 app #render-markdown #syntax-highlighting #html
owned_chars

Owned iterators with the same output as Chars and CharIndices

v0.3.2 2.7K #owned #string #iterator #char
stylish-style

Internal implementation details of stylish-core

v0.1.1 4.8K no-std #style #stylish #stylish-style #stylish-core
lindera-decompress

A morphological analysis library

v0.32.3 12K #morphological-analysis #library #decompression #morphological #analysis
verba

working with Latin words

v0.5.1 #noun #verba #ending #verb #adjective
mdbook-plugin-utils

mdBook plugins

v0.2.3 1.1K #mdbook-plugin-utils #plugin #mdbook-plugins
py-regex

A small wrapper around the Python regex module via PyO3

v0.1.1 270 #py-regex #regex #substitution #position #matching
ironcalc_base

Open source spreadsheet engine

v0.5.0 #base #ironcalc #engine #ecosystem #api
ipa-translate

translating between IPA and ASCII text

v0.2.0 #translation #ipa-translate #ipa #text
aki-unbody

output first or last n lines, like a head and tail of linux command

v0.1.19 bin+lib #text #filter #aki-unbody #inverse #txt
malvolio

programmatically generating HTML

v0.4.1 #html #web-apps #text #child #post #submit
easy_io

Fast and dead-simple IO for competitive programming in Rust

v0.3.0 #io #competitive-programming #input-reader #output-writer
zen-rs

generating non-interactive content like cards or files

v0.1.6 #pdf #svg-pdf #svg #non-interactive #html #html-rendering #page #card
mdzk

Plain text Zettelkasten based on mdBook

v0.5.2 bin+lib #zettelkasten #mdzk #mdbook #markdown #notes
ultra-nlp

A NLP library

v0.8.0 #ultra #nlp #ngrams #word #char #extract #chinese #letter #consecutive
anslatortray

translate from English to Pig Latin!

v0.5.0 bin+lib #text-translation #translator #latin #localization #pig #text
merge3

merge tool for three-way merges

v0.2.0 2.2K bin+lib #merge3 #merge #text #base #merged
heart-strings

Quickly get random heart emojis to copy!

v1.0.0 app #emoji #fun #heart #copy #revolving-hearts-cupid #gift-heart-heartpulse #cupid-heartpulse
fancy-regex-fork-pb

A custom fork of the fancy-regex crate. You probably don't want to use this.

v0.3.2 #regex #fork #fancy-regex-fork-pb #fancy-regex #re #ac #backtracking
rsonpath-lib

Blazing fast JSONPath query engine powered by SIMD. Core library of rsonpath.

v0.9.4 #json-path #simd #search #query #json
english

language decliner

v0.0.3 #english #decliner #linguistics #nlp #conjugator #inflector
diff-man

diff utility lib

v0.1.7 bin+lib #diff #lib #diff-man
regex_quote_fixer

Rewrites grep regexpressions for the use in the regex crate

v0.2.1 #regex #regex-quote-fixer #quote #character-class
myanmar_util

A collection of tools for processing Myanmar text including syllable breaking and other utilities

v0.1.0 150 bin+lib #nlp #myanmar #burmese #text-processing
f-tree

the cli app to execute the commands from forester

v0.2.5 app #f-tree #forester #tree #sim
file-expert

Expert system for recognizing source code files, similar to GitHub/lingust

v1.1.0 bin+lib #source-code #expert-system #linguist #linguist-heuristics
rmw-utf8

Short text compression algorithm for utf-8 (optimized for Chinese , developed based on rust programming language). 面向utf-8的短文本压缩算法（为中文压缩优化，基于rust编程语言开发）。

v0.0.6 #8的短文本压缩 #rmw-utf8 #开发 #基于rust编程语 #面向utf #为中文压缩优 #utf
lammps-analyser

A CLI tool and language server for LAMMPS simulation input scripts

v0.1.0-pre-release-3 bin+lib #analyser #diagnostics #script #line
libretranslate

A wrapper for the LibreTranslate web API

v0.5.2 #translation #language #libretranslate #api
readable-regex

Regex made for humans. Wrapper to build regexes in a verbose style.

v0.1.0-alpha1 #regex #readable #lazy-evaluation #12 #01 #31 #why #human
typeline_ext_utils

operators for typeline

v0.1.0 #pipeline #shell #tl #stream
unescape

Unescapes strings with escape sequences written out as literal characters

v0.1.0 170K #escaping #unicode #string
mle

The markup link extractor (mle) extracts links from markup files (Markdown and HTML)

v0.25.2 bin+lib #render-markdown #link #markup #html #link-extractor #markdown
leptos-markdown

A component which can render markdown as html element in leptos

v0.1.0 app #leptos #leptos-markdown #markdown
random_ascii

A random [rainbow] ascii-art picker which matches your current terminal size

v0.1.5 app #ascii #random #file #size #window #checking #better
mecab

Safe Rust wrapper for mecab a japanese language part-of-speech and morphological analyzer library

v0.1.6 180 #japanese #morphological #japanese-morphological #analyzer #libmecab
afrim-translator

Manage the predication system of the afrim input method

v0.2.1 #ime #autocomplete #translator #predication #afrim #predicate
tengwar

Transliterate text into J.R.R. Tolkien's Tengwar.

v1.1.0 bin+lib #unicode #tengwar #quenya #sindarin #unicode-text #text
byte_string

Wrapper types for outputting byte strings (b"Hello") using the Debug ({:?}) format

v1.0.0 62K #debugging #ascii #ascii-text #ascii-string #string #debug
crawdad

ChaRActer-Wise Double-Array Dictionary

v0.4.0 3.6K no-std #double-array #double-array-trie #trie #text #text-search #search
mdbook-llms-txt-tools

convert mdbook to llmstxt.org format

v0.1.1 app #mdbook #llm #documentation #converter
uclanr

A random word picker that gives you actually useful words

v2.1.0 app #word #uclanr #amount
image-to-ascii

Converts images and gifs to ascii art

v0.7.0 bin+lib #art #image #image-to-ascii #character #gif #image-path #font #ascii-image #jpeg #base
record-query

doing record analysis and transformation

v1.0.4 bin+lib #query #javascript #record #command-line-tool
uo_rst_parser

fork of rst_parser with fixes for upstream-ontologist

v0.4.3 2.2K #parser #restructuredtext #upstream-ontologist #renderer
gliclass-rs

Inference engine for GLiClass models

v0.9.0 #classification #nlp #model
pithy

Ultra-fast, spookily accurate text summarizer that works on any language

v0.1.7 bin+lib #nlp #summarization #summarize #text-summarization #text
cali

A terminal calculator with real-time evaluation, unit conversions, and natural language expressions

v0.9.0 500 app #cali #line #variables #convert #operation #conversion
simplers

Simplification of too complex stuff in rust

v1.0.10 #simplers #color #input #input-color
skyline

helping patch and modify Nintendo Switch games

v0.2.1 2.1K nightly no-std #skyline #games #hex-dump
texcore

Create LaTeX documents using native Rust types

v0.7.2 nightly #latex #element #texcore
rustsay

CLI tool in Rust that mimics the classic cowsay program, allowing a cow to speak your text in the terminal

v0.2.0 110 app #terminal #cowsay #cli #ascii
rrename

" Opinionated tool to rename files in batch. Match regular expression, replace some characters I consider noise to kebab case

v0.1.2 150 bin+lib #case #rrename #regex #directory #txt #ls #run #mocking #co
rsrusl

A really simple useful library ported to Rust

v0.1.5 bin+lib #standard #simple #rusl #useful
split-identifier

Rust package that provides functions to split programmatic identifiers according to case conventions

v0.1.0 #split #identifier #split-identifier #package
analyse-json

CLI tool for inspecting (Newline Delimited) NDJSON or JSON to understand the contents

v0.6.1 bin+lib #ndjson #content #json #cli #file-path #object #processing #glob #explode-arrays #inspect-arrays
soup

Inspired by the python library BeautifulSoup, this is a layer on top of html5ever that adds a different API for querying and manipulating HTML

v0.5.1 2.4K #html #soup #tags #error #id #ul #body #story-title-head
loki_text

advanced string manipulation with pattern searching and replacement capabilities

v0.1.4 #text #search #base64
vec-embed-store

thin wrapper around LanceDb (VectorDb) meant to provide a means to create/store/query embeddings in a LanceDb without the need to grok the lower level Arrow/ColumnarDb tech

v0.3.2 #embed #store #vec #similarity-search #embeddings-db #text-search
rustc-demangle-capi

C API for the rustc-demangle crate

v0.1.1 280 #rustc-demangle #rustc-demangle-capi #demangle
bep-core

An opinionated library for building LLM powered applications

v0.5.0 #bep #embedding-model #openai
mrdirector

A narrative game development package for the Turbo Game Engine

v0.2.0 #mrdirector #risk #integration #roll #directory #narratives #choice #action #comments
interslavic

in rust

v0.2.1 #interslavic #gender #stems #language #com #csv
rosie

Interface for the Rosie Pattern Language, for efficient and maintainable text pattern matching and search

v0.1.1 #regex #matching #rosie #fsa #pattern-matching
plagiarismbasic_lib

Basic plagiarism checker written in Rust

v1.2.0 #lib #plagiarism #wip
dysql-tpl

Experimental Mustache-like templating engine

v2.0.0 bin+lib #content #dysql-tpl #section #以及分页
rk-utils

A collection of utility functions and data structures for rust

v0.2.2 #topological-sorting #string-processing #trie #longest-match #node
cesu8-str

CESU-8 and Java CESU-8 string validation and manipulation

v1.2.1 no-std #string #cesu8 #cesu8-str #utf-8
unicode-ellipsis

truncate Unicode strings to a certain width, automatically adding an ellipsis if the string is too long

v0.3.0 3.1K #unicode #string #word #unicode-text #text
harfbuzz-sys

Rust bindings to the HarfBuzz text shaping engine

v0.6.1 15K sys #opentype #font-shaping #layout #shaping #font #unicode #unicode-text
ttaw

talking to a wall, a piecemeal natural language processing library

v0.3.0 #nlp #cmudict #rhyme #alliteration #double-metahone
hexroll3-scroll

HEXROLL3 Scroll - the sandbox content generator

v0.1.1 #generator #scroll #hexroll3-scroll #included #model #rpg
unicode-casing

Titlecase helper function on characters

v0.1.0 7.4K #unicode-casing #unicode #casing
katex

Rust bindings to KaTeX

v0.4.6 5.8K #katex #latex #math #api-bindings
annotated-string

String with ability to annotate (format) its individual fragments

v0.2.1 #fragment #hi-doc #annotated
regex-split

split_inclusive for the regex crate

v0.1.0 3.0K #regex-split #regex #split-inclusive #day
tpt

Pure Rust implementation of the Unix concatenate (cat), word-count (wc) and echo command

v0.3.0 bin+lib #cat #cli #wc #command
ewts

Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (lib)

v0.1.3 #converter #ewts #tibetan #lib #localization #symbols #transliteration #ewts-converter
mdbook_ls

mdBook Language Server

v0.0.5 170 bin+lib #mdbook #mdbook-ls #server #ls #live #preview #assets #com
slicestring

slicing Strings

v0.3.3 #substring #slice #string #cut #string-slicing
hyper-static-server

friendly library to build static servers with hyper HTTP server

v0.5.1 #server #static #hyper
langlang_value

langlang is a parser generator based on Parsing Expression Grammars (library)

v0.1.2 #value #langlang-value #langlang #vm #grammar #parser-generator #parser-library #pattern-matching #toolkit #parsing-expression-grammar
atlaspathwaysai

An opinionated library for building LLM powered applications

v1.1.3 #embedding-model #documentation #atlaspathwaysai #repo #nbsp
text-transliterate

transliterate texts using the SO iconv from POSIX

v2.0.0 #posix #text #text-transliterate
tree-sitter-stack-graphs-python

Stack graphs definition for Python using tree-sitter-python

v0.3.0 210 bin+lib #stack-graphs #tree-sitter #python
google-book-scraper

downloading the contents of books hosted on books.google.com for offline viewing

v0.3.5 bin+lib #web-scraping #book #download
rusty_regex

A minimalistic regex engine in Rust using the pipeline: Regex -> AST -> NFA -> DFA -> Match(String)

v0.2.0 bin+lib #regex #digits #string #character #underscore #default #position #alternative
mask-text

mask text with multiple masking options

v0.1.2 5.1K #mask #mask-text #regex #thanks
unified-diff

GNU unified diff format

v0.2.1 30K bin+lib #unified-diff #format #package #toml #00s
cursive-async-view

A loading-screen wrapper for gyscos/cursive views

v0.8.0 900 #cursive-views #loading #progress #terminal #cursive
ob

A Blog and RSS system written in Rust

v1.0.10 app #rss #blog #ob #script #static-site-generator
tuilet

A textual user interface for Toilet, the ANSI-art text generator

v0.3.1 bin+lib #ascii #figlet #ansi #toilet
escrit

learning languages by reading texts

v0.3.0 160 app #language-learning #language #text #input
basalt-core

core functionality for Basalt TUI application

v0.4.3 470 #obsidian #markdown #basalt-core #applications #text #ratatui
lspt

Language Server Protocol (LSP) types made easy

v0.2.0 100 #lsp #proposed #documentation
unveil-rs

Unveil Rs is a tool to create presentations from markdown files

v0.1.2-alpha1 bin+lib #css #unveil #js #markdown #slide #reveal
jira-clean

clean up Jira task description that is an output of jira-cli tool

v0.1.2 app #clean #jira #issue #description
datatroll

a robust and user-friendly Rust library for efficiently loading, manipulating, and exporting data stored in CSV files

v0.1.3 #csv #data #datatroll #data-science #pagination
rnltk

Natural Language Toolkit for Rust

v0.4.0 #nlp #sentiment #stemming
webspeeddial

A dial system for websites

v1.0.0 bin+lib #website #webspeeddial #bookmarks #com #gentoo #forms #fzf #dmenu #lars-zauberer #wofi
textspan

Text span utility

v0.5.2 #nlp #algorithm #python #text #utility #align-spans #remove-span-overlaps #remove-span-overlaps-idx
gds21

Integrated Circuit Layout Parser & Writer

v3.0.0-pre.2 #writer #date-time #text
epub2mdbook

convert EPUB files to MDBook format

v0.15.0 bin+lib #ebook #converter #epub #mdbook
ornn

Gen const for smart contract

v0.1.8 bin+lib #ornn #contract #codes #orn
ammonia

HTML Sanitization

v4.1.0 235K #input-validation #html #security #xss
bukvalno

A cli tool for converting images to ascii art

v0.3.0 bin+lib #ascii-art #image #art
re_view_text_log

A view that shows text entries in a table and scrolls with the active time

v0.23.2 21K #view #logging #text
chars_data

Build-dependency for chars, the unicode character information CLI

v0.7.0 #unicode #build-dependencies #codegen
quickner-core

A fast and simple NER tool

v0.0.1-alpha.20 #nlp #ner #annotations #named-entity
cli_app_capo15

CLI application with Unix-like tools

v0.1.1 app #command-line-tool #unix #cli
rst_parser

a reStructuredText parser

v0.4.2 280 #restructuredtext #parser #rst-parser
story-dl

Story web scraping

v0.6.0 bin+lib #story #scraping #epub #site #note
emojicon

Find Emoji by using Emoticons and GitHub's, Bengali emoji names

v0.4.0 #emoji #emoticon #unicode #bengali #gemoji
unic-emoji-char

UNIC — Unicode Emoji — Emoji Character Properties

v0.9.0 104K #emoji #character-property #unic #unicode #unicode-text #text
render_readme

Render Markdown or reStructuredText with syntax highlighting and image filtering similar to GitHub's

v0.14.0 160 #github #markdown #convert #html #readme
rust-texas

generate latex documents

v0.3.6 #pdf-generation #latex #texas #pdf
tagsearch

Filter plaintext files based on @keyword tags

v0.37.0 bin+lib #tags #tagsearch #service #model
avatarsay

Beautiful quotes from Avatar: The Last Airbender

v0.1.3 app #airbender #avatarsay #terminal #kitty
royal_road_archiver

An archival program and library for the webnovel site RoyalRoad

v1.0.3 bin+lib #road #archiver #royal-road-archiver
nib

static site generator

v0.0.10 #text #cli #generator
lindera-ipadic-builder

A Japanese morphological dictionary builder for IPADIC

v0.32.3 12K #morphological #japanese #builder #dictionary #ipadic
geoipsed

Inline decoration of IPv4 and IPv6 address geolocations

v0.1.3 app #ip-geolocation #dfir #regex
utf8streamreader

lookahead iterator on an utf8 byte stream

v0.1.0 #utf8streamreader #utf8-reader #stream
committable

Keccak256-based structured commitments

v0.2.4 10K #commitment #committable #default
kakasi

Romanize hiragana, katakana and kanji (Japanese text)

v0.1.0 500 bin+lib #kakasi #hiragana #text #is-japanese #alphabet
markdown2pdf

Create PDF with Markdown files (a md to pdf transpiler)

v0.1.3 bin+lib #pdf #markdown #md #markdown-to-pdf
asimov-account-cli

ASIMOV Account Command-Line Interface (CLI)

v25.0.0-dev.0 app #asimov #cli #artificial-intelligence
fsays

flavored replacement for the classic cowsay

v0.3.0 app #cowsay #rustaceans #fsays #print #ferris
xml_writer

writes xml, not pretty, but faaast

v0.4.0 2.8K #xml-writer #xml #faaast #text #root #node #abc #begin-elem
glyph-names

Mapping of characters to glyph names according to the Adobe Glyph List Specification

v0.2.0 4.4K #font-glyph #name #glyph #font #specification
kicad-text-injector

A tiny CLI tool that replaces variables of the style ${KEY} within KiCad PCB (pcbnew) files

v0.3.1 bin+lib #ki-cad #ci #cli #injector
matchpick

Find and replace multi-lines using a match-case

v0.2.1 bin+lib #match-case #pattern #file #stdin
wildcard_ex

extended wildcards that allows VB-like specifications

v0.1.2 bin+lib #wildcard #pattern #wildcard-ex #ex
warrah

command-line utility and Rust library that sloppily removes code comments from a text file, supporting 60+ programming languages

v0.1.1 230 bin+lib #llm #cli #warrah #text #language #py
md-bakery

Markdown Bakery CLI app

v1.2.0 app #bakery #md-bakery #derive #debugging #source #hash-map #end #snippet-b #snippet-a
cargo-markdown

Local crates.io readme development server with ultra-fast hot reloading goodness

v1.0.3 app #cargo-subcommand #mockups #readme
mdbook-najan

Preprocessor for the Najan mdBook

v0.3.1 bin+lib #mdbook-najan #mdbook #najan
mdbook-skill-tree

mdbook plugin to show roadmaps

v3.0.0 app #skill-tree #mdbook-skill-tree #tree #roadmaps
scie

research about how to build simple code identify engine for different languages

v0.1.0 app #scie #testing #fs
commit_crafter

AI powered tool for Git commit message generator

v0.1.5 bin+lib #artificial-intelligence #commit-message #nlp #productivity #git
dnd-character

A Dungeons and Dragons character generator

v0.16.0 900 #character #dnd-character #generator
crustword

Crusty Crosswords

v0.1.0 app #crosswords #crustword #crossword #output #mode #list #language #game #terminal #crossword-generator
zummi

fun lib that produces spoonerisms

v0.1.2 bin+lib #zummi #spoonerisms #horld #world #foonerisms
lingua-english-language-model

The English language model for Lingua, an accurate natural language detection library

v1.2.0 19K #language-recognition #lingua #language-detection #nlp
baste64

A base64 codec

v0.1.0 #baste64 #encode #v128 #js-value #wasm-bindgen #based64 #codec
opstr

‘Operate on strings’ command line utility

v1.1.0 bin+lib #utility #unicode #default #letter #string #output #archive #platform #link
neo-mime

Strongly Typed Mimes

v0.1.1 28K #media-type #mime #media-extensions #media-range
rusty-axml

A parser for Android AXML files

v0.2.0 bin+lib #axml #name #rusty-axml #status
advanced_string_generator

A command-line tool for generating strings based on customizable regex patterns

v0.1.2 bin+lib #rust #generator #regex #cli
maudit

Framework for generating static websites

v0.2.0 #maudit #string
perm-text

curling straight/dumb quotation marks ("") and apostrophes (') into their curly/smart (“”’) equivalents

v1.0.4 app #quote #text #apostrophes #curly
schmfy

Schmfication library

v0.3.0 #schmfy #everything #string
latex-packer

CLI that goes though the file and subsequent \input, \include and packs all the content to the single output file

v0.1.1 170 app #packer #latex-packer #file #document #tex #article #world #end #begin #documentclass
marktask

A CLI tool for parsing and manipulating Markdown tasks

v0.2.0 bin+lib #task #todo #markdown
roxy_markdown_parser

Roxy plugin for parsing Markdown

v0.1.2 #markdown-parser #roxy-markdown-parser #markdown
imperative

Check for imperative mood in text

v1.0.6 16K #imperative #word #text #description #contribute
pattern-generator

solving Sudoku puzzles. It takes a Sudoku puzzle input and provides the solved grid.

v0.1.2 bin+lib #generator #pattern #pattern-generator
grepster

command-line tool for searching text in files

v0.1.2 280 bin+lib #grep #search #text-search #cli #text #result
c6o-obsidian-export

associated CLI program to export an Obsidian vault to regular Markdown

v21.9.0 bin+lib #obsidian #markdown #export #front-matter #groenen #embed #notes #bot #commit #version
markdown-tables

generating markdown-formatted tables

v0.1.0 #markdown-tables #escaping #character #content #as-table
mdbook-metadata

mdBook preprocessor to parse markdown metadata

v0.1.1 app #metadata #mdbook-metadata #mdbook #pre-processor
convert_string

A trait to convert Strings to safe non-keywords and/or convert a Strings case (snake_case, PascalCase, ...)

v0.2.0 180 #reserved #formatting #keyword #string-formatting #case
incredimo

just another font for your terminal

v0.1.17 bin+lib #banner #terminal #art #ascii-art
tabiew

A lightweight TUI application to view and query tabular data files, such as CSV, TSV, and parquet

v0.9.4 750 bin+lib #search #tabiew #excel #themes #command #key-bindings
text_lines

Information about lines of text in a string

v0.6.0 69K #text-lines #text #line
lucide-rs

Provide lucide icon for rust

v0.1.0 bin+lib #lucide-rs #lucide #points
topfew

CLI to find high frequency occurrences in structured text files

v0.2.3 bin+lib #field #cli #topfew
komga

REST API Client generated from OpenAPI specification

v1.9.2 #komga #thumbnail #specification #openapi #model #bindings
ascii-img

Convert images to ASCII

v0.3.0 410 #ascii #ascii-img #image #ascii-art
mairs

was created for simply programing CLI programs, with the simplest console graphical interface

v0.1.6 #mairs
nesty

Generate code with with human readable indentation

v0.2.0 1.0K #nesty #indentation #world #newlines #produce #if-expr #crlf #newline
char-ranges

Iterate chars and their start and end byte positions

v0.1.2 no-std #range #text #char #no-std #position
generic_symbolic_expressions

fork of symbolic-expressions, which tweaks it to be more normal. The original crate had weird rules around putting extra double quotes.

v5.0.4 180 #s-exp #symbolic-expression #generic #symbolic #sexp
pygmentize

wrapper for syntax highlighting

v0.2.0 #syntax-highlighting #highlighter-coloring #html #highlighter #syntax-coloring #syntax-highlighter #coloring
tagalyzer

A CLI tool to gather statistics on collections of plaintext-adjacent files

v0.3.0 bin+lib #tags #word-analysis #writing-analysis #processing
bstr

A string type that is not required to be valid UTF-8

v1.12.0 7.4M no-std #string #byte #unicode #text
just-enough-emojis

text to emoji cli

v2.0.0 app #emoji #cli #text
subject-classifier

classifying a commit by it's subject

v0.4.2 #changelog #subject #classification
rtss

A command-line tool to annotate stdout/stderr with elapsed times

v0.6.2 bin+lib #timestamp #filter #command-line-tool
clarifai_grpc

The official Clarifai gRPC Rust client

v8.0.0 #deep-learning #artificial-intelligence #grpc-client #image-recognition #neural-network #clarifai #computer-vision
markdown-it-heading-anchors

A markdown-it plugin for parsing GFM tasklists

v0.3.0 290 #markdown-it #markdown #heading #tasklists
markdown-formatter

Flavored Markdown (ZH) content formatter

v0.0.13 bin+lib #formatter #markdown #markdown-formatter
aconv

Converts texts from the auto-detected encoding to UTF-8 or a specified encoding

v0.1.4 bin+lib #internationalization #unicode #cjk #iconv
akiaki

A good old fashioned wiki engine with a flat-file database

v0.0.3 app #networking #fast-cgi #wiki #server
mdplayscript

An extension of Markdown for play scripts

v0.6.0 #play #pulldown-cmark #markdown #script
path2regex

Express style path to RegExp utility

v0.0.4 #routing #express #regex #routes
reedy

A terminal-based RSS reader with a clean TUI interface

v0.1.4 bin+lib #rss #rss-feed #rss-reader #tui #reader
knowledge

The launcher app for the interacive book

v0.4.5 bin+lib #launcher #book #knowledge
fip

Field Parser, roughly emulating "awk '{print $<field-number>}'"

v1.0.2 app #fip #field-number #find-nth-field
fluxus

lightweight stream processing engine written in Rust, designed for efficient real-time data processing and analysis

v0.2.0 450 #fluxus #engine #api #stream-processing #aggregate #data-processing #windows #extend
rust_cascade

bloom filter cascade implementation in Rust

v1.5.0 1.1K #cascade #rust-cascade #salt #pdf #github
garde-fr

Validation library

v0.18.1 #validation #garde #ascii #rules #valid
st7789_rs

A driver and graphics library for st7789 displays, primarily used on a Raspberry Pi

v0.1.5 #st7789-rs #display #st7789 #pi #lcd #foundation #eventually #computer-microcontroller #devices
edit

Open a file in the default text editor

v0.1.5 22K #edit #editor #editing #cli
pi_ucd

unicode字符函数，获得字符的语言区间段；及根据文字排版的需要，判断字符是否为单字字符或字母字符

v0.1.0 #pi #unicode #unicode字符函 #判断字符是否 #字符 #单字字符或字 #获得字符的 #extension #symbols #forms
runestr-pancjkv

rune-based Pan-CJKV support

v0.1.1 #runestr-pancjkv #pan-cjkv-region #pancjkv #rune-string
supercat

A syntax highlighting alternative to cat

v0.1.0 app #cat #syntax-highlighting #cli #numbers
nanoid-dictionary

Popular alphabets for use with nanoid

v0.4.3 800 #nano-id #nanoid-dictionary #nolookalikes
flashtext2

The FlashText algorithm implemented in Rust

v0.2.0 #flashtext2 #keyword-processor #token #insensitive #string-matching #nlp #pyo3 #flashtext #extracting-keywords
rfsee-tf-idf

TF-IDF implementation for rfsee

v0.1.0 #tf-idf #rfsee #nvim #index #regex #neovim-plugin #simd-json #run-time #book #client
mastodon-async-entities

Types for (de)serializing entities from the Mastodon API; part of mastodon-async

v1.1.0 200 #mastodon #mastodon-async #entities
select-html

Extract HTML using CSS selectors in the command-line

v0.1.2 app #html #select-html #text #title #content
fiberplane-markdown

convert Fiberplane Notebooks to and from Markdown

v1.0.0-beta.14 #markdown #fiberplane #convert #notebook #conversion
unicode-normalization-alignments

functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15

v0.1.12 411K #unicode-normalization #alignment #recomposition #unicode-text #text #decomposition #normalization #unicode
bdf2surface

convert bdf font to sdl2 surface

v0.0.2 #surface #bdf2surface #text #bmp
comment-strip

Remove comments out of text files

v0.1.3 420 bin+lib #remove #command-line #comments #delete #strip
nlprule

A fast, low-resource Natural Language Processing and Error Correction library

v0.6.4 5.2K bin+lib #nlp #grammar #spelling #text
ripmors

encoding and decoding international Morse code and several variants

v0.1.0 bin+lib #ripmors #variant #greek #to-standard #mi-b-s #encoding-decoding #gi-b-s
text-utils

Text utils for unescaping and align

v0.4.3 1.2K #utilities #escaping #text #alignment #interface
text_unit

Newtypes for text offsets

v0.1.10 2.1K #text #unit #text-unit #offset
chinese

language nlp tools

v0.0.2 #chinese #nlp
radix_trie

Generic radix trie data-structure

v0.2.1 1.8M #trie #prefix #radix-trie #patricia #generic
himmelblau_kerberos_keytab

parse keytab kerberos files

v0.4.1 #keytab #kerberos #himmelblau #ascii #fs #file #content #arguments
sms_splitter

An SMS message splitter and part calculator with support for GSM and Unicode

v0.1.9 #unicode #splitter #sms #gsm #split-sms
cutters

Rule based sentence segmentation library

v0.1.4 #cutters #nlp #standard
commonregex

Rust port for CommonRegex. Find all times, dates, links, phone numbers, emails, ip addresses, prices, hex colors, and credit card numbers in a string. We did the hard work so you don't have to.

v0.2.0 650 #regex #commonregex #port #welcome
roman_numerals_fn

A function to convert integers to their roman numeral representation as strings. Values from 1 to 3999 are possible, otherwise it returns an OutOfRangeError. Zero has no representation in roman numerals.

v1.0.0 #roman-numeral #roman-numerals #roman #numeral
libefi-sys

Bindings for libefi on illumos

v0.1.0 6.5K sys #libefi-sys #version #illumos #construction #html #upper-case #camel-case #non-upper-case-globals #non-camel-case-types #non-snake-case
mdbook-embed

A preprocessor that simplifies embedded URL

v0.2.0 app #markdown #mdbook #url #mdbook-plugins
spider_scraper

A css scraper using html5ever

v0.1.2 1.2K #web-scraping #selector #html #element #text #attributes #document #fragment
knock-knock

CLI tool for obtaining and outputting domain name information in an easy-to-read format

v0.3.2 app #knock #information #knock-knock #format #cross-platform #source
hitori

Generic compile-time regular expressions

v0.2.3 no-std #regex #hitori #expression
grammalecte_client

Grammalecte HTTP client

v0.1.5 #spell-check #grammalecte #client #server #suggestions #spell-checking #spellcheck
rep-grep

wgrep/write-grep CLI

v0.0.7 app #find-replace #regex #grep #sed
rupantor

A Bengali Phonetic Parser which is very flexible and supports Avro Phonetic

v0.3.0 #avro #avro-phonetic #bangla #bengali
wai-bindgen-gen-markdown

Generate Markdown API docs for a WAI interface

v0.2.3 #interface #wai #markdown
apple-notes-exporter

CLI tool for exporting Apple Notes to Markdown

v0.1.0 bin+lib #notes #markdown #apple #export #attachment
esoteric-vm

An esoteric virtual machine

v0.2.1 #esoteric #machine #stack #world #text #numbers #unused #ωtheendisnear #ωskiptothechase #n-0
wordfreq

port of wordfreq for looking up the frequencies of words in many languages

v0.2.3 #nlp #wordfreq #word-freq #freq1 #freq2 #assert-relative-eq
xmlwriter

streaming XML writer

v0.1.0 271K #xml-writer #xml #svg
mdbook-spec

An mdBook preprocessor to help with the Rust specification

v0.1.1 bin+lib #specification #mdbook-spec #mdbook
ironsmith-parser

Transforms Smithy 2.0 IDL files into an abstract syntax tree

v0.1.0 #text #ironsmith #tree
simple_peg

A command-line peg parser implemented by Rust

v0.3.0 app #peg #compile #simple-peg #peg-parser
ropey

A fast and robust text rope for Rust

v2.0.0-alpha.2 155K #rope #edit #buffer #text-edit #text
correct_word

A No brainer 'did you mean' library for Rust

v0.2.0 #levenshtein #correct-word #did-you-mean #word #algorithm
chunk_norris

splitting large text into smaller batches for LLM input

v0.2.1 #batching #tokenize #nlp #llm #text
popgetter-core

Core library for popgetter

v0.2.2 160 #popgetter #popgetter-core #name #package #interface
compact_str

A memory efficient string type that transparently stores strings on the stack, when possible

v0.9.0 2.4M no-std #string #compact #mutable #small #memory
hunspell-rs

Rust bindings to the Hunspell library

v0.4.0 7.4K #spell-check #hunspell #hunspell-rs #spell-checking #spellcheck
pillar

small tool to format lines into columns

v0.1.2 app #table-column #column #tabs #pad #padding #table
crate-starter

starter

v1.1.0 app #library #example #rust #starter
realhydroper-utf16

Work with UTF-16 in Rust

v1.1.0 #utf-16 #string #realhydroper-utf16
xml1

sane, non compliant xml parser without allocations

v0.1.6 #allocation #xml1 #xml-event #dbg #evn #sequeces #allocations
uci-parser

Universal Chess Interface parser

v1.1.0 150 #parser #chess #uci #uci-command
squishyid

Shorten and obfuscate IDs

v0.1.1 #squishyid #squishy-id #php #sv1f-js3-zi-hm #yz-nre7t-i4yq-pv-xm0 #k9oxn-fa-wu-dp-ol-g-ag #2bj-lh-rdu-c6-tb8-q5c #string
udp-logger-rs

Log macro for log's kv-unstable backend and a UDP socket logger

v0.1.4 #key-value #logging #udp #kv
fj-text

creating text in fornjot

v0.2.0 #text #fj-text #welcome
rsmorphy

Morphological analyzer / inflection engine for Russian and Ukrainian (soon) languages (WIP)

v0.4.0 #inflection #russian #pluralize #nlp #ukrainian
nugine-rust-utils

Nugine's personal Rust utilities

v0.3.1 11K no-std #utilities #nugine-rust-utils #nugine #idea
tet_rs

A third-party implementation of Text Entry Throughput (ref. https://doi.org/10.1145/3290605.3300866) for Rust

v0.3.1 #hci #text-entry-benchmark #text-entry #distribution
kana-converter

converter for half-width/full-width Japanese language characters (katakana, hiragana, and ASCII)

v0.1.2 #kana #full-width #half-width #byte-conversion
eml-codec

Email enCOder DECoder in Rust. Support Internet Message Format and MIME (RFC 822, 5322, 2045, 2046, 2047, 2048, 2049).

v0.1.2 bin+lib #ascii #line #com #charset-us-ascii #message #bug #reference
slicedisplay

Simplistic Display implementation for Vecs and slices

v0.2.2 130 #string #display #text #slice
refac

Transform some text given a generic natural language prompt

v0.1.2 app #refac #transform #top-p #editor
auto-regex

Automagically finds a regex that best matches an example and a sample list

v0.1.3 #string #regex #dataset #text #filter
libflagup

Display a country's flag as an emoji

v0.0.8 #libflagup #country #quiz #flagup #homebrew
pretty

Wadler-style pretty-printing combinators in Rust

v0.12.4 363K #pretty-print #functional #console #testing
mdbook-svgbob2

Alternative mdbook preprocessor for svgbob

v0.3.0 bin+lib #svg #markdown #mdbook #svgbob
enpsrlib

English Phrase Structure Rules library

v0.1.0 #phrase #structure #english #linguistics #psr
mdbook-dtmo

Creates a book from markdown files with added plugins

v0.15.2 app #rust-book #markdown #gitbook #book
qpprint

console printing/formatting

v0.3.0 130 #terminal #console #terminal-console #format
inslice

A command-line utility for filtering text input by columns and rows

v1.1.0 bin+lib #row #inslice #path #colslc #rowslc
reg_match

A match style regex tool

v0.1.0 #regex #reg-match #macro #match
archive-pdf-urls

Extract all links from a PDF and archive the URLs in the Internet Archive's Wayback Machine

v0.5.1 app #url #machine #pdf
bottomify

Fantastic (maybe) CLI for translating between bottom and human-readable text

v1.2.0 bin+lib #bottomify #bottom #unicode #text #why
cow-rewrite

Rewrite copy-on-write types copying only when it's neccessary

v0.1.0 #rewrite #cow #neccessary
forestrie-builder

Build a trie and convert it TokenStream

v0.3.1 #forestrie #builder #forestrie-builder #token-stream
lightning-path

Route Recognizer library for lightning-fast matching

v1.0.2 #routes #router #lightning-path #matching
texting

string helpers

v0.0.7 #string #nlp #texting #text #helper #str
committed

Nitpicking commit history since beabf39

v1.1.7 180 bin+lib #development #beabf39 #committed #logging #git #styling #processing
hexstring

handling hexadecimal string

v0.1.4 nightly #hex-string #utility #lower-hex #byte #hex #string #literals
ptero-cli

A text steganography CLI tool for Social Media

v0.4.2 bin+lib #steganography #encoding-decoding #text #media
is_printable

Determine whether a given text-based value is printable

v0.1.1 #printable #utf-8 #ascii #char
regexgrep

ripgrep tool that suports regular expressions

v1.0.3 app #regex #regexgrep #expression #sensativity
pra

Print Random ASCII

v0.0.2 130 bin+lib #ascii #pra
inboxbot

A telegram bot to save messages to a file

v0.1.15 app #file #inboxbot #folder
enso-lazy-reader

An efficient buffered reader

v0.2.0 nightly #reader #utf #enso-lazy-reader #read
font-map

Macros and utilities for parsing font files

v0.2.9 #font #svg #macro #api-bindings
notegraf

Core library for building a graph-oriented notebook

v0.1.1 #note-taking #notebook #notegraf #markdown
acridotheres-3ds

Nintendo 3DS-specific file formats for Acridotheres

v0.2.1 #acridotheres #acridotheres-3ds #text #format #package #3ds-formats
sluggify

slug or clean url generator for rust. With default settings, you will get an hyphenized, lowercase, alphanumeric version of any string you please, with any diacritics removed, whitespace and dashes collapsed…

v0.1.0 #sluggify #slug #string #urlcleaner #slug-generator #slug-helper #character #string-utils #url-cleaner
shopping-parser

A Rust-based parser for parsing structured product information and shopping lists, supporting multiple currencies and units

v0.1.1 bin+lib #shopping #parser #cli-parser #rust
techlead

CLI is a command-line interface that enables developers to chat with an AI assistant powered by the OpenAI GPT language model, designed specifically to help with your Rust project

v0.2.0 app #chatgpt #client #openai #openai-api #api-client
stringutils

A collection of various and (hopefully) useful String utility functions

v0.0.3 #stringutils #byte-array #rust-stringutils
altium

processing Altium file types

v0.2.1 #altium #sch-lib #record
mdbook-yml-header

mdBook preprocessor for removing yml header

v0.1.4 app #mdbook-preprocessor #mdbook #rust-book #markdown #book #mdbook-pre-processor
human_regex

A regex library for humans

v0.3.0 #human-readable #regex #end #character #flags #repeat
segtok

Sentence segmentation and word tokenization tools

v0.1.5 #tokenize #split #segmenter #word #tokenizer
ranting

Linguistic formatting placeholder extensions for rust

v0.2.1 #placeholder #noun #inflection #verb #indefinite-article
ufofmt

A fast, flexible UFO source file formatter based on the Norad library

v0.7.1 app #formatter #ufo #normalizer #font #graphics
text_layout

Text layout algorithms

v0.3.0 no-std #text-layout #text #graphics #layout #environment
strange

A static website generator

v0.9.0 app #website-generator #markdown #static-website #static #website #generator #web
twitter-stream-message

Types for Twitter Streaming API's messages

v0.3.0 #twitter #stream #user-profile
scraper

HTML parsing and querying with CSS selectors

v0.23.1 393K bin+lib #css-selectors #selector #css #scraping #web-page #html
twitter-text

in Rust

v0.2.0 260 #twitter-text #text #twitter #objective-c #testing #javascript #conformance #java #ruby #build
tzgrep

grep tar.gz

v0.2.0 180 bin+lib #grep #tar #gz
revstr

Simply reverses strings

v1.0.2 app #string #revstr
fx-mistral

leverage the Mistral API for OCR and data extraction from PDFs

v0.0.2 160 #mistral #ocr #pdf #completion #api
vngineer

Visual Novel game engine

v1.0.3 app #engine #vngineer #vns
name-engine

computing Markov chains to generate random names based on pronunciation

v0.1.0 #name #name-engine #pronunciation #wɛlz
jposta

A fast and intuitive Terminal User Interface (TUI) tool for searching Japanese postal codes and addresses

v0.1.0 app #tui #address #japan #postal
semchunk-rs

A fast and lightweight Rust library for splitting text into semantically meaningful chunks

v0.1.1 #chunking #semantic #nlp #tokenize #text #token
pdf-sign

extract signed date from pdf file

v0.1.1 app #date #pdf-sign #file #path-to-pdf
wdl-doc

Documentation generator for Workflow Description Language (WDL) documents

v0.3.2 1.0K #wdl #documentation #wdl-doc #workflow
mdbook-davids_cooking

A preprocesor for whatever https://davidsotomarchena.gitlab.io/davids-cooking/ needs

v0.1.3 app #mdbook #davids-cooking #preprocesor
betacode

conversion

v1.2.0 #ascii #betacode #validation #converter #convert
kvarn-chute

A Markdown converter designed to use the Kvarn templating engine

v0.4.0 bin+lib #kvarn #markdown #template #kvarn-extension
zhlint

A linting tool for Chinese text content

v0.0.3 bin+lib #zhlint #mark #token
mdbook-check-missing-md

A backend for mdbook which will find Markdowns you forgot on SUMMARY.md

v0.1.1 bin+lib #mdbook-check-missing-md #mdbook #check
langsan

sanitizing language model input and output

v0.0.10 #language-model #input-validation #language #model
ultron-syntaxes-themes

Syntaxes and themes dump for ultron

v0.4.0 bin+lib #syntax-highlighting #syntax-themes #ultron #themes #set #highlighting #syntax #package
dokkoo

Mokk (Macro Output Key Kit) implementation written in Rust

v0.5.0 nightly bin+lib #dokkoo #liquid #mokk #markdown
is-vowel

Heuristically test whether a character is a vowel letter

v0.1.0 150 #vowel #letter #is-vowel
asimov-apify-module

ASIMOV module for data import powered by the Apify web automation platform

v0.0.2 no-std bin+lib #asimov-module #artificial-intelligence #asimov #api-bindings #actor
sixbit

Small packed strings

v0.5.0 #string #unicode #small #unicode-text #text
hoedown

bindings for the Hoedown markdown processor

v6.0.0 500 #markdown #hoedown #html #processor #render
panda-re-sys

The official *-sys library for interfacing with PANDA (Platform for Architecture-Neutral Dynamic Analysis)

v0.8.0 340 sys #properties #analysis #memory-region
utf8-supported

Determine the UTF-8 support of the current locale

v0.1.0 210 #utf-8 #utf8-supported #command #write #terminal #xb2-xb3-xb9-n
oui-data

looking up information from the IEEE OUI database

v0.2.0 470 #oui #mac #oui-data
mdbook-scientific

Enables inline equations for mdbook to set by $..$ signs and $$..$$

v0.5.0-beta.3 bin+lib #mdbook #scientific #scientific-equation #equation
url-pattern

VERY INCOMPLETE implementation of the WhatWG URL Pattern standard https://https://urlpattern.spec.whatwg.org/. Seriously DON’T USE THIS (yet)!

v0.1.1 #url-pattern #regex #url
fuzzy-string-distance

Fuzzy string distance comparisons

v1.0.0 #edit-distance #levenshtein #compare #levenshtein-distance #fuzzy-search #comparison #text-processing
trevordmiller

Personal CLI

v1.1.4 app #trevordmiller #principles #cli
rusticsearch

A lightweight, Elasticsearch-compatible search server (early WIP)

v0.0.2 app #rusticsearch #alias #icsearch #operate #sense
whitespace-conf

Key-value configuration file delimited with whitespaces

v1.0.0 #whitespace-conf #white-space #conf #note #fs
bitflip

functions to generate bitflips of binary and UTF-8 strings

v0.1.0 370 #bitflip #bitflips #blip #bitsquatting
align

aligning text

v1.0.0 app #alignment #text #align-cr
textcat

detect text categories. It can be used to detect the language of a given text

v0.3.2 bin+lib #textcat #text #ngrams
mdbook-gitbook

mdBook preprocessor to properly render GitBook specific syntax

v1.0.3 bin+lib #mdbook-preprocessor #mdbook #gitbook #markdown #mdbook-pre-processor
basic-text-internals

Basic Text string literal implementation details

v0.19.2 #basic-text #basic-text-internals #detail
mdbook-indexing

mdbook preprocessor for index generation

v0.1.2 app #mdbook #indexing #mdbook-indexing #index #name #entries
rustrings

Strings manipulation for Rust

v1.0.2 #rustrings #rings #format-text
chunkr

A fast and quick chunking library for rust

v0.1.17 #chunkr #yourself #cli
ascii-pixel

Convert pixel art into ascii images

v1.0.0 app #ascii-pixel #image #ascii
mojibake

Encode/Decode bytes as emoji base2048

v0.2.1 #base2048 #mojibake #encode #sm #boy #yo-yo #fist-left #sleeping-bed #black-flag #clock930-ok-woman
unicode-width-16

Determine displayed width of char and str types according to Unicode Standard Annex #11 rules

v0.1.0 1.0K no-std #unicode-width #unicode #unicode-text #width #text
vndb-api

Fully Functional Visual Novel Database (VNDB) HTTPS API Wrapper

v1.0.3 #visual-novel #anime #vn #vndb #filter
repa

Peak Performance Pattern Seeker

v0.1.5 app #regex #hyperscan #grep #text-processing
unic-char-range

UNIC — Unicode Character Tools — Character Range and Iteration

v0.9.0 992K no-std #iteration #utilities #unicode #unicode-text #text
csv_to_table

pretty print CSV as a table

v0.8.0 340 #pretty-table #csv #csv-reader #print #format #table
yotasm

Assembler for my 16 bit CPU

v0.1.0 app #yotasm #bit
charclass

define and modify unicode character classes

v0.3.0 700 #charclass #class #character-classes
markov_strings

A simplistic Markov chain text generator

v0.1.5 #markov-chain #procedural-generation #text #markov
bk-tree

A Rust BK-tree implementation

v0.5.0 5.9K #fuzzy-search #bk-tree #search #metrics
google-fonts

Download and cache TTF fonts from Google

v0.1.5 #webp #true-type #font #graphics #api-bindings
outerspace

Methods for prefixing and suffixing the non-whitespace characters in a string

v0.2.1 #outerspace #outerspace-rs #hello
cabocha

Safe Rust wrapper for cabocha a japanese language dependency structure analyzer library

v0.2.0 #structure-analyzer #cabocha #japanese #dependencies #structure #analyzer
cur

that will hunt for your regular expression

v0.5.0 no-std #regex #expression #hunt #catch
minigrep_baolhq

Just getting started with Rust, enjoying it so far 😇

v0.1.0 bin+lib #mini-grep #case-insensitive #case-sensitive
markdown-extract-cli

Extract sections of a markdown file with a regular expression

v2.1.0 app #markdown #extract #markdown-extract-cli #expression #welcome #md
aprilasr

High-level wrapper for the april-asr C api (libaprilasr) using aprilasr-sys

v0.2.0 160 #audio #nlp #neural-network #wrapper
liwe

IWE core library

v0.0.31 #markdown #liwe #para #md #zettelkasten
typship

A cli for typst packages

v0.4.2 170 bin+lib #typship #publish #directory #notice #tool #help
korean_regex

Regex extension for Hangeul analysis

v0.3.0 #regex #korean #korean-regex
lazy-string-replace

A lazy version of String::replace, so that it can be formatted or recursively replaced without intermediate allocations

v0.1.3 #replace #lazy-evaluation #string #allocation
beans

A parser generator library based on the Earley parser

v8.0.0 bin+lib #variant #beans #value #parser #regex #earley-parser #source #parser-generator
detone

Decompose Vietnamese tone marks

v1.0.1 29K #unicode #vietnamese #mark
aws-sdk-cognitoidentityprovider

AWS SDK for Amazon Cognito Identity Provider

v1.83.0 45K #p-li #aws-sdk #enums #provider
lindera-ipadic-neologd-builder

A Japanese morphological dictionary builder for IPADIC NEologd

v0.32.3 11K #builder #japanese #neologd #ipadic #dictionary
base_u256

base-u256 is to utf-8 as base-64 is to ascii

v0.1.1 #ascii #u256 #base #encode-decode
mdbook-bibfile-referencing

An mdBook preprocessor to add bibfile referencing to each page

v0.3.0 app #mdbook #referencing #bibliography #citeproc
fmty

Composable core::fmt utilities

v0.1.1 no-std #display #text #utilities #string-format
strcursor

string cursor type for seeking through a string whilst respecting grapheme cluster and code point boundaries

v0.2.5 2.4K #string #grapheme #cursor #unicode
ascii-hangman-backend

customizable Hangman game with ASCII-art rewarding for children (backend)

v5.7.2 #ascii-hangman #back-end #hangman #version #true #licence #getreu #kids #ascii-art
japhonex

Japanese phone number checker for Rust

v0.1.1 #japhonex #regex #optional #hyphen #phone-number #japanese
searcher_txt

A copy of grep that i made to show that im bad at rust

v1.2.6 bin+lib #grep #search #txt #cli #case
ltxcut

formats a table-like stream into a LaTeX-table

v0.1.1 app #latex-table #ltxcut #field #delimiter #wrap-lines #wrap-fields #escape-fields #line #testing #csv
spigot

parser for valve's keyvalue file format (gameinfo.txt, vmt, etc.)

v0.1.2 #spigot #value
magic_string_rain

magic string

v0.3.5 6.5K #magic #rain #string #magic-string #napi
html2runes

An HTML to Text converter

v1.0.1 950 bin+lib #markdown-converter #plain-text #html #markdown
todo-to-issue

CLI tool that converts forgotten TODO comments into actionable GitHub issues

v0.1.1 app #issue #todo #github #comments #github-issue
pangu

Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)

v0.2.0 #spacing #pangu #objective-c #clojure #elixir #go #java #browser #php #python
roan-engine

The core engine for the Roan project

v0.1.6 #engine #roan-engine #roan
ucd-raw

Uninterpreted access to the unicode UCD

v0.5.0 no-std #ucd-raw #ucd #raw
diagnostic

Pretty diagnostic report

v0.6.4 240 #diagnostics #diagnostics-report #ansi-colors #language
porter-stemmer

Flexible and unicode friendly, Porter stemmer implementation

v0.1.2 130 #stemmer #stem #normalization #text #porter
johalun/module

FreeBSD kernel module in Rust

GitHub 0.1.0 3.6K nightly #module #binary-heap #html #machine
jcc

Convert Juniper configurations to set-style

v0.1.7 #jcc #set-style
finalfrontier

Train/use word embeddings with subword units

v0.9.4 bin+lib #finalfusion #text #binary #word #units #skipgram #fasttext
fuzzywuzzy

A pure-Rust clone of the incredibly useful fuzzy string matching python package, FuzzyWuzzy

v0.0.2 1.0K #string #text #utility #ratio #process
cerpton

A 'double' Caesar Cypher

v0.1.1 bin+lib #alphabet #cerpton #libcerpton-encode #ok #settings
hello_lib

Demonstrate Generics Function

v0.1.6 #demo #function #generics #hello
mdbook-keeper

An improved testing experience for mdbook

v0.5.0 bin+lib #keeper #mdbook-keeper #mdbook #book #done #skeptic
wcounter

Give the word and count the appearance

v0.2.4 app #zsh #fzf #wcounter #appearance #counter
quilltex

open-source Rust library designed to convert LaTeX documents into a Delta format that can be used with Quill.js and vice versa

v0.1.0 #quilltex #standard-package
butterkups-minigrep

Mini grep utility; very weak application, use grep instead

v0.1.1 bin+lib #mini-grep #grep #butterkups-minigrep #line
caser

Change text between PascalCase, camelCase, and snake_case

v1.1.1 bin+lib #snake-case #caser
jp-location-relation

隣接する市区町村の一覧を取得

v0.1.1 bin+lib #relation #jp-location-relation #location #の一覧を #隣接する #html #隣接エリアの #データソース #隣接街名の
invisible_unicode

finding invisible unicode characters

v1.0.0 #invisible #unicode #invisible-unicode #sample #검사
ellipse

Truncate and ellipse strings in a human-friendly way

v0.2.0 800 #ellipse #truncate #string #human
libphonenumber-sys

rust ffi bindings to libphonenumber

v0.1.1 sys #phone-number #libphonenumber #libphonenumber-sys #valid #hand #phone-number-util-error
pdfcr

render a codebase to a pdf

v1.3.0 app #pdf #pdfcr #file #title #required
translitrs

Transliteration utility for Serbian language

v0.2.2 bin+lib #transliteration #pandoc #filter #text
strip_markdown

remove markdown syntax from markdown files

v0.2.0 3.6K #markdown #strip-markdown #strip
xhtmlchardet

Character set detection for XML and HTML

v2.2.0 15K #detect #character-set #html #xml #character
cnpj

Brazilian CNPJ parsing, validating and formatting library

v0.2.2 no-std #cnpj #brazil #brasil #numbers #valid
re2

Wrapper for the re2 C++ regex library

v0.0.10 130 #re2 #syntax #matching #boundary #character #digits
awabi

A morphological analyzer using mecab dictionary

v0.3.0 bin+lib #mecab #dictionary #awabi #token #mecabrc #command #debian-ubuntu
rjoin

joining CSV data on command line

v0.2.0 bin+lib #join #rjoin #right #unlicense #line
glifnames

Mapping of characters to glyph names according to the Adobe Glyph List Specification

v0.2.0 #font-glyph #name #glyph #font #ufo #glif
mdbook-rustviz

An mdbook preprocessor that allows users to embed RustViz visualizations into mdbook projects

v0.2.0 app #pre-processor #rustviz #mdbook-rustviz #project
ascii_tree

generates ascii trees

v0.1.1 12K #tree #ascii #ascii-tree
codetypo-cli

Source Code Spelling Correction

v1.30.2 bin+lib #spelling #codetypo #correction #development
markdown-composer

composing markdown documents

v0.3.0 #markdown #markdown-composer #code-block
encoding8

various 8-bit encodings

v0.3.2 8.1K #encoding8 #encoding #encoding-8
esc

Escape characters in strings

v0.2.2 app #text #cli #string
msr-core

Industrial Automation Toolbox - Common core components

v0.3.7 1.5K #msr #component #msr-core #industrial-automation
delay_writer

Wraps a writer and delays its output after each newline

v0.2.1 #writer #delay #delay-writer
pandoc-ac

pandoc filter for converting acronym codes to LaTeX

v0.3.0 bin+lib #acronym #pandoc #pandoc-filter #latex
rreplace

designed to streamline string replacements. It can handle multiple unique replacements and iterates the string only once.

v0.1.0 #replace #string #substring #multiple #substitution
twie

fast and compact prefix tries

v0.5.0 #text #string #binary #tries
markdown-table

Creating markdown tables with Rust!

v0.2.0 1.6K #table #markdown-tables #markdown #utilities
forgiving-htmlescape

HTML entity encoding and decoding, with support for leaving malformed entities intact

v0.1.0 #forgiving-htmlescape #html-escape #intact
mepple

English dictionary as a library

v0.2.0 #mepple
kspconfigtool

KSP1 ConfigNode parser and block removal tool

v0.1.0 app #confignode #ksp #kerbal #tool
unicode-jp

convert Japanese Half-width-kana[半角ｶﾅ] and Wide-alphanumeric[全角英数] into normal ones

v0.4.0 1.4K bin+lib #unicode #kana #japanese #zenkaku #hankaku
ruSTLa

A reStructuredText → LarST ⊂ LaTeX transpiler

v0.38.0 bin+lib #restructuredtext #latex #transpiler
lindera-py

Python binding for Lindera

v0.42.2 160 #morphological-analysis #python #library
ripgrep

line-oriented search tool that recursively searches the current directory for a regex pattern while respecting gitignore rules. ripgrep has first class support on Windows, macOS and Linux.

v14.1.1 33K app #ripgrep #grep #search-pattern #pattern #regex
dr

Command-line data file processing in Rust

v0.7.0 app #dr #csv #schema #tl #bug #database
jcalendar

Japanese Calendar for Rust

v0.1.2 #calendar-week #calendar #week #console #koyomi #calendar-date
cologne_phonetics

generate phonetic cologne codes for utf8 strings

v0.1.0 no-std #cologne-phonetics #string #phonetic #cologne-code
koelner-phonetik

koelner_phonetik or cologne phonetics is a phonetic algorithm like soundex, but specialized for german words

v0.1.0 #phonetik-algorithm #koelner-phonetik #phonetik
AsgoreCore

A small rust library to manipulate arabic text to fit in non-supporting arabic games or programes

v0.1.2 bin+lib #programes #asgore-core
unicode_types

A mapping of all the unicode characters into convenience types (one enum per block of characters with one variant per character)

v0.2.0 130 #unicode #type #plain-text #convenience #boilerplate
omgwtf8

Optimized-Matching-Generalized Wobbly Transformation Format — 8-bit

v0.1.0 #unicode #surrogate #wtf8 #slice
typing_test

Typing speed test in rust

v0.3.5 bin+lib #typing-test #word #quote #in-progress #speed #done #input
mudders

Generating Lexicographically-Evenly-Spaced Strings, or: Mudder.js in Rust

v0.0.4 #edit-distance #lexicographical #mudders
polars_arrow_rvsry99dx

Apache Arrow

v0.17.1 nightly #arrow #analytics #duration #environment
esperanto-text

Convert Esperanto text between UTF-8, x-system and h-system transliterations

v1.0.0 bin+lib #esperanto #text #transliteration #string
pandoc_ast

deserializes and serializes the markdown ast for writing pandoc filters

v0.8.6 #pandoc #filter #ast #markdown #latex
wtf8

WTF-8 encoding. https://simonsapin.github.io/wtf-8/

v0.1.0 10K #unicode #surrogate #wtf8 #io-wtf-8 #encoding
vape

ｆｕｌｌｗｉｄｔｈａｅｓｔｈｅｔｉｃｓ

v0.4.0 app #full-width #aesthetic #vaporwave #ａｅｓｔｈｅ #ｉｃｓ
zalgo-codec

Convert an ASCII text string into a single unicode grapheme cluster and back. Provides a macro for embedding Rust source code that has been encoded in this way.

v0.13.5 100 no-std bin+lib #unicode #obfuscation #zalgo
testcall

companinon crate to bintest, implements test facilities

v1.3.0 #testing #facilities #testcall #capture
mime_4

Strongly Typed Mimes

v0.4.0-a.0 120 #media-type #mime #media-extensions #media-range
strmatch

Conditionally match strings in Rust using regex without much boilerplate

v0.1.1 #regex #boilerplate #strmatch #debugging
bge

Rust interface for BGE Small English Embedding Library

v0.2.0 #transformer #bert #text-embedding #sentence-similarity
bibutils-sys

Rust bindings for bibutils, a program for bibliography format interconversion

v0.1.1 sys #ffi #bibutils #bibutils-sys
uwubot

discord bot for uwuifying text

v0.3.0 bin+lib #text #uwubot #bot #setup #portal #choice
str_overlap

Methods for finding the overlap between two string slices

v0.4.3 650 no-std #string #overlap #intersection
yeslogic-unicode-script

Fast lookup of the Unicode Script property

v1.0.0 #internationalization #script #unicode #unicode-text #unicode-properties #text
mdbook-footnote

mdbook preprocessor for footnotes

v0.1.1 270 app #mdbook #footnotes #mdbook-footnote
steve

Search Technical Evidence Very Easily

v0.3.1 260 app #steve #audit #search #roast
minigrep_19283712349058

minigrep from The Rust Programming Language book

v0.1.0 130 bin+lib #book #mini-grep #minigrep-19283712349058
romulus

a stream editor like sed

v0.3.0 bin+lib #sed #awk #grep #cli #text #env-var
asciifolding

ascii folding library

v0.1.0 1.2K #ascii #folding #unicode #lucene
mdbook-hide

A preprocessor for mdbook that adds support for hidden chapters

v0.4.0 390 bin+lib #mdbook #hide #mdbook-hide #chapter
carnation

some string operators

v0.1.1 #carnation #trim-whitespace #upper-case
etch

Not just a text formatter, don't mark it down, etch it

v0.4.2 bin+lib #etch #document #word #syntax
uapi-version

Compare versions according to the UAPI Version Format Specification

v0.4.0 #version #uapi #systemd #specification
kvu

The simplest command line tool to manage key-value pair lines

v0.1.3 bin+lib #dotenv #key-value #config #environment #env
besida

Language for defining branching dialogue

v0.1.1 #dialog #besida #node #text #dialogue #format #func
slidedeck

Create an HTML slide deck from Markdown

v0.0.2 app #slidedeck #markdown
runanum

Существительные с правильными окончаниями после чисел

v0.1.1 #cases #runanum #chisel #яблок
glob-match

An extremely fast glob matcher

v0.2.1 426K #glob-match #glob #matcher
group-similar

Group similar values based on Jaro-Winkler distance

v0.2.2 bin+lib #similarity #jaro #group-similar #distance #partial-ord #string
mdbook-twiki

twiki backend for mdbook

v0.1.1 app #twiki #mdbook-twiki #mdbook #filename
allwords

Generate all the words over a given alphabet

v0.1.2 190 #word #alphabet #brute-force #iterator #fuzzy
axum-toml

Axum extractor for TOML

v0.2.0 #toml #axum #axum-toml
spacemod

A easy to understand and powerful text search-and-replace tool

v0.1.1 app #tool #spacemod #refactoring-tools #text-replace #codemod #grep
timfmt

A small utility for formatting code as Tim likes it

v0.2.0 app #fmt #tim #timfmt
string_morph

string case transformations with an emphasis on accuracy and performance. The case conversions are available as functions as well as traits on String types.

v0.1.0 5.3K #snake-case #camel-case #inflect #string
html_to_markdown

Convert HTML to Markdown

v0.1.0 3.4K #html-to-markdown #markdown #html #zed
runiq-lib

An efficient way to filter duplicate lines from input, à la uniq

v1.2.2 bin+lib #unique #filtering #logging #runiq
levenshtein_lite

No-frills implementation of a Levenshtein Automata and the Levenshtein Distance function

v0.1.1 #levenshtein #levenshtein-distance #automata #lite
swc_ecma_compat_common

Commons for compat transforms

v15.0.0 26K #swc #regex #transform #typescript-compiler
beediff

LCS algorithm in various applications

v0.1.2 #beediff #applications #case-sensitive
mdbook-preprocessor-utils

writing mdBook preprocessors

v0.2.0 150 #mdbook #pre-processor
unindenter

unindent text

v0.1.0 app #text #indent #unindent
readable-readability

Really fast readability

v0.4.0 1.2K #dom #extract #text #html #html-text #text-extract
uiua-doc-gen

Documentation generator for Uiua libraries

v0.16.1 170 app #documentation #documentation-generator #uiua #cli
crop

A pretty fast text rope

v0.4.3 4.3K no-std #rope #edit #tree #buffer
ccase

Command line interface to convert strings into any case

v0.4.1 bin+lib #case #string #casing #pattern
address_book

Інструмент командного рядка для парсингу телефонних номерів, ідентифікаторів, дат та неправильних…

v0.3.0 bin+lib #address-book #address #book #книга #полів
vc_8bit

This project is a virtual computer that takes a vector of bytes and runs it as instructions. Also included is a complete assembler and compiler.

v0.1.12 #vc-8bit #assembly #register #compiler
ctrl-z

A composable reader to treat 0x1A as an end-of-file marker

v0.1.0 #ctrl-z #substitution #eof #sub
mdbook-chapter-zero

A mdBook preprocessor that allows 0th (sub-)chapter

v0.1.0 bin+lib #chapter #zero #mdbook-chapter-zero #sub #chapter-zero #pre-processor
thousand_birds_deno

deno executable

v1.46.3 bin+lib #deno #executable #lock-files
summary

Extract the sentences which best summarize a document

v0.1.0 #summary #summarize #summarizer #tf-idf
rvim

A text editor in rust

v0.0.8 app #highlighting #rvim #js #sh #java #go #py #json #cs #rb
guarding

guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.

v0.2.6 bin+lib #guarding #model #tree-sitter #architecture-tests #arch-unit #function-name #guardian #archunit
rst-traverse

A terminal based file manager

v2.0.1 app #traversal #rst-traverse #restructuredtext #binary #rust-traverse #operation
trigram

Trigram-based string similarity for fuzzy matching

v0.4.4 5.1K #string-matching #fuzzy-matching #string #fuzzy
ascii-alphabetic-char

Traits for ASCII alphabetic characters

v0.1.1 #alphabetic #ascii #ascii-alphabetic-char
cursed_strings

Annoyed that Rust has two string types? Well it doesn't any more

v0.1.1 nightly #cursed-strings #cursed #char-indices #deref
line-straddler

Determine how lines through text (underlines, strikethroughs, etc) should be rendered

v0.2.3 no-std #line #line-straddler #cosmic-text #color
todo_r

command line utility that keeps track of your todo comments in code

v0.7.2 bin+lib #todo-r #user3 #style #syntax #found #blumberg #respected #directory #command-line
gnu-echo-rs

A rewrite of the echo GNU core utility in rust

v0.1.0 app #gnu-echo-rs #echo #escaping #baumann
mdrss

generating RSS feeds from markdown files

v0.1.0 #rss #rss-feed #markdown
base16384

Encode binary file to printable utf16be, and vice versa

v0.1.0 no-std #base16384 #safety #slice-as-chunks
g2-unicode-jp

convert Japanese Half-width-kana[半角ｶﾅ] and Wide-alphanumeric[全角英数] into normal ones

v0.4.1 #unicode #kana #japanese #zenkaku #hankaku
asciicast

file format used by Asciinema

v0.2.2 1.5K #asciicast #asciinema #tty #ascii
auk_markdown

Markdown support for Auk

v0.1.0 #auk #markdown #auk-markdown #syntax
mdbook-multicode

Allows you to give multilanguage code examples, toggled by a spinner

v0.1.0 bin+lib #mdbook-multicode #multicode #spinner
html-escaper

HTML escaping wrapper for core::fmt::Formatter

v0.2.0 1.7K #boilerplate #formatter #html #macro
azusa

String index transformer for Rust utf8 to JavaScript utf16

v1.0.1 #javascript #string #string-index #utf8-to-utf16
zalgo-text

A command line tool for generating zalgo text

v0.1.0 app #text #mark #zalgo-text #world
mdbook-grammar

A preprocessor for mdbook that adds grammar code block support

v0.1.0 app #mdbook-grammar #mdbook #grammar
ykoath-protocol

Implementaion of YKOATH Protocol

v0.2.0 #ykoath-protocol #protocols #ykoath #html
latex_snippet

Convert even erroneous LaTeX snippets into HTML

v0.3.3 #latex #html #latex-snippet
hex-utilities

working with hexadecimal numbers

v0.1.5 #utilities #hex #hex-utilities #numbers #text
random-bytes

generate random bytes

v1.0.2 #random-bytes #byte #random
ohos-ime-sys

Bindings to the inputmethod API of OpenHarmony

v0.1.4 5.6K #harmony-os #input-methods #open-harmony #ffi
capnp_conv

capnp write/read traits to convert from structs to readers/builders

v0.3.2 2.7K #capnp-conv #capnp #capnp-enum #enums #readers-builders #write #read #void #list #capnp-struct
case-conv

Faster case conversion crate

v0.1.6 nightly #conv #case-conv #case #result #linux
anystr

An abstraction over string encoding that supports ASCII, UTF-8, UTF-16 and UTF-32

v0.1.1 no-std #ascii-text #wide-string #ascii #ascii-string #any
mdbook-snips

Markers for hidden lines in rust blocks within an mdbook

v0.1.3 bin+lib #mdbook #mdbook-snips #snips #snip
freecut

A cut optimizer gui for cutting rectangular pieces from panels

v0.1.12 app #optimization #bin-packer #cuts #gui
repub

convert markdown documents to epub

v0.4.1 app #ebook #markdown #repub #epub
mdx

in Rust

v0.0.4 bin+lib #mdx #markdown #mdx-ast #anyway
lithe

A Slim template engine by using Pest

v0.0.3 #text #cli #lithe #pest
xim-ctext

compound text en/decoder

v0.3.0 140 no-std bin+lib #xim #ctext #xim-ctext #en-decoder #mode
mdbook-open-git-repo

mdbook preprocessor to add a open-on-git-repo link on every page

v0.0.4 bin+lib #mdbook #markdown #git #page
mathematica-notebook-filter

mathematica-notebook-filter parses Mathematica notebook files and strips them of superfluous information so that they can be committed into version control systems more easily

v0.2.2 app #mathematica #version-control #parser #cli-parser
august

& program for converting HTML to plain text

v2.4.0 100 bin+lib #html #text #converter #text-html #html-converter
arabic-script

An expressive API for the characters of the Arabic script

v0.1.0 #arabic #unicode #script
memchr

extremely fast (uses SIMD on x86_64, aarch64 and wasm32) routines for 1, 2 or 3 byte search and single substring search

v2.7.4 21.4M no-std #substring-search #search #memchr #substring #memmem
cindex

CSV indexing library

v0.6.0-beta.1 120 #indexing #csv #indexer #text-processing #query
trie-match

Fast match macro

v0.2.0 650 macro no-std #double-array #match #macro #text #no-alloc
psfparser

A PSF reader written in Rust

v0.1.2 #psfparser #testing #run
prune

struct

v0.1.6 nightly #prune #struct
inkline

Display colorized ascii art to the terminal

v1.0.0 #terminal #inkline #ansi-colors #dyn-colors #dmmmmmmm #od-mmmmmmm-ny #hmmmmmmmmo #ommmmmmmmh #dmmmmmmmmmmno #smmmmmmmmmmmmmmy
stringedits

Edit trait and associated iterators for small edits to strings

v0.2.0 #stringedits #edit #replace #string #spellcheck-toy
char_reader

Safely read wild streams as chars or lines

v0.1.1 1.3K #reader #unicode #char #line
spongebobizer

Command-line utility that outputs its stdin, converted to 'sPonGeBoB cAsE', and a library to support it

v0.4.1 bin+lib #spongebobizer
sejong

Buffer is a buffer that can receive ASCII bytes different from keyboard and send out UTF-32 Hangul string. This buffer allows deletion by Jamo.

v0.1.5 #korean #hangul #input #localization
freetypegl

Rust build helpers and bindings for freetype-gl

v0.4.0 #bit-set #font #bit-flags
rust_lemmatizer

A lemmatizing package for use with a .csv dictionary of lemmas and their corresponding words

v0.3.0 bin+lib #nlp #lemmatization #rust-lemmatizer #vec
codex

Human-friendly notation for Unicode symbols

v0.1.1 16K #symbols #unicode #codex
kl-hyphenate

Knuth-Liang hyphenation for a variety of languages

v0.7.3 #text #typesetting #kl-hyphenate #language
mdbook-fix-cjk-spacing

mdbook preprocess that fixes CJK line breaks

v0.1.1 bin+lib #mdbook #cjk #spacing #break #space
text-template

Small template engine for use with plain text (e.g. creating text email), not intended for HTML.

v0.1.0 #plain-text #template #text
rep-cli

Replace text file in bulk

v0.1.0 app #productivity #cli #rep-cli #replace #bulk #file
rut

A small UTF-8 parsing library for applications that need to parse individual chars

v0.4.2 #rut #byte #conformance
redpatterns

a list of patterns for scanners 📟

v0.2.0 #regex #secret #pomsky
neardup

near-duplicate matching

v0.1.0 bin+lib #matching #hash #neardup #dataset #10
textgrid

working with PRAAT .TextGrid files with parsing, riting, manipulation, and history tracking modulesfor TextGrid data

v0.1.0 #text-grid #textgrid #interval #format #merge
rblcheck

Checks DNS RBLs

v0.5.1 app #rblcheck #rbl #dabl
remake

writing maintainable regex and managing symbol soup

v0.1.0 #remake #numbers #run-time
mdbook-latex

An mdbook backend for generating LaTeX and PDF documents

v0.1.24 app #latex #mdbook #mdbook-latex #suggestions
gfm-autolinks

Parse GitHub Flavored Markdown autolinks

v0.2.0 100 #markdown-it #markdown #gfm-autolinks #autolinks
ik-rs

chinese segment, ik-analyzer for rust

v0.7.0 #information-retrieval #search #tantivy #ik-analyzer
floem-peniko

Unofficial peniko crate for Floem

v0.1.0 #floem-peniko #floem #peniko
print-positions

providing string segmentation on grapheme clusters and ANSI escape sequences for accurate length arithmetic based on visible print positions

v0.6.1 13K #ansi-escapes #unicode #grapheme #ansi-escaping #unicode-text #text #escaping #ansi
ascii_converter

converting between different ascii representations

v0.3.0 170 #ascii #converter #binary #hex
thesauromatic

command-line thesaurus that returns related words when given a word. The output words are one per line, making it easy to process in shell pipelines.

v0.0.11 bin+lib #nlp #thesaurus #synonyms
catdream

Sleeping cat dreams your text

v0.1.0 app #catdream #text
rustascii

Display Rust in ASCII

v0.1.2 #ascii #donis #rustascii
pager

pipe your output through an external pager

v0.16.1 10K #pager #less #more
tre-regex-sys

Rust bindgen bindings to the TRE regex module

v0.4.1 170 sys #ffi #regex #tre #bindings #ffi-bindings
strip-tags

Strip HTML and PHP tags from strings

v0.1.0 no-std #tags #strip #php #html #sanitize #string
ascii-to-hex

A small, simple library to converting an ASCII text string into its hexadecimal equivalent

v0.1.1 #ascii #string #ascii-to-hex
eaverdeja-minigrep

minigrep from chapter 12 of the Rust lang book

v0.1.1 bin+lib #mini-grep #eaverdeja-minigrep #book
uiuifree-normalize

uiuifree text normalize

v0.1.1 #normalize #uiuifree-normalize #uiuifree
veloci_levenshtein_automata

Creates Levenshtein Automata in an efficient manner

v0.1.0 310 #automata #levenshtein-automata #levenshtein #fuzzy
gpl-memo

Gemachain Program Library Memo

v3.0.1 #memo #gpl-memo #gpl
anthropic-text-editor

A micro-CLI to apply tool calls from Anthropic for their text_editor_20250124 built-in computer use tool

v0.2.0 app #anthropic #claude #text-editors #cli #tool-calls
wordninja

port of the Word Ninja English word splitting library

v0.1.0 bin+lib #wordninja #ninja #string #summary #py #usticeinsuredome #nitedstatesinord #ionfortheuniteds #atesofamerica #rtoformamoreperf
product-os-content

Product OS : Content provides a complete solution for content management for the purpose of serving content via Product OS : Server

v0.0.4 #product-os #content #product-os-content
afrim-memory

handle of sequential codes easier for an input method

v0.4.2 #memory #ime #data-structures #memory-data-structure #afrim #node #rc
indent_tokenizer

Generate tokens based on indentation

v0.4.0 #tokenize #indentation #token #tokenizer
leetcode

solutions in Rust

v0.1.4 #leetcode #leetcode-rs
texoder

A text stream which can encode/decode text in several encoding formats

v0.0.5 #texoder
termwrap

Wrap Unicode text with ANSI color codes

v0.1.4 #color #fold #unicode #wrap #string
sarcasm

tExT creation and validation library

v0.1.0 app #encoding-decoding #sarcasm #text #fun #text-encoding #localization
code-tour

Enhanced example-based learning, i.e. awesome examples user experience

v0.2.0 macro #example #learning #cli #experience #tour #derive
h4x_re

Hacky Regex's

v0.2.4 #regex #h4x-re #h4x
munemo-rs

Turn an integer into a more rememberable word, or vice-versa

v0.1.1 #munemo-rs #munemo #codec #integer
simple_bencode

bencode encoder and decoder, that uses neither rustc-serialize or Serde. Instead, it serializes from / deserializes to a tree using a 4-branch enum.

v0.1.4 #bencode #simple-bencode #array #decode-error #string
mdbook-to-example

Turns an mdbook book into a Rust example

v0.1.0 #mdbook-to-example #mdbook #set-name #package-book
md2gemtext

for converting Markdown into gemtext

v0.1.0 bin+lib #gemini #markdown #md2gemtext #gemtext
bitranslit

Bi-directional transliterator for Rust. Transliterates strings according to the rules specified in the language packs.

v0.3.1 #transliteration #transliterator #bidirectional #greek #russian #iso-8859-1
markdown-linkify

Markdown preprocessor for substiting link shorthands to valid links according to configurable regexes and custom substitution implementations

v0.3.1 bin+lib #linkify #markdown-linkify #markdown
drive-image-searcher

A CLI tool to stream a drive image, and search for one or more byte patterns

v0.2.2 app #pattern #hex #image #ascii #hello #text #false #value
esl01-renderdag

Render a graph into ASCII or Unicode text

v0.3.0 700 #esl01-renderdag #renderdag #esl01 #scm
aki-gsub

substitude text command, replace via regex

v0.1.38 bin+lib #text #aki-gsub #filter
hearthstone

simulator written in Rust

v0.1.0 bin+lib #hearthstone
pig_latin

applying Pig Latin to text

v0.1.0 bin+lib #text #pig-latin #latin
chromalog

A customizable logger with dynamic color coding and file logging

v0.0.2 #logging #colored #console #file
squidge

shortens delimited data

v0.2.3 #shortener #delimited #data #shorten-line #config
genpdf

User-friendly PDF generator written in pure Rust

v0.2.0 6.0K #pdf #text-layout #element #family #page #text #table #file #document #system
uwl

A management stream for bytes and characters

v0.6.0 65K #stream #uwl #character #partial-eq #token #lexer
help_crafter

help message generator without hussle

v0.3.1 #help-message #help #command #crafter #parameters #hussle #start
igpay-atinlay

Translate text to Pig Latin

v0.1.0 #latin #igpay-atinlay #igpay #vowel
mdbook-last-changed

mdbook preprocessor to add the last modification date per page

v0.1.4 bin+lib #last-changed #mdbook-last-changed #page
usage-cli

CLI for working with usage-based CLIs

v2.1.1 3.9K bin+lib #script #cli #sh #page #language #documentation
uwu-rs

uwuifying library

v1.0.0 #owo #uwu #web
zoitei

alphabet conversions

v0.1.0 #synthesis #zoitei #convert #conversion #conversions
html-to-pulldown-cmark-events

Parse HTML to pulldown-cmark's events

v0.1.12 #events #pulldown-cmark #html
fwuffgrep

Basic implementation of a grep command written in rust

v1.0.0 bin+lib #fwuffgrep #source #fwuff-grep #grep-like #study-project
rustextile

Textile markup language parser for Rust

v1.0.2 #html #markup #textile #text #block #table #image
dvd-term

A bouncing ASCII art DVD logo (or custom text) for the terminal

v0.1.43 app #term #dvd-term #figlet #ascii-art
rls-vfs

Virtual File System for the RLS

v0.8.0 420 #rls #vfs #rls-vfs
mdbook-docslab

mdBook preprocessor for interactive code with docslab

v0.1.0 app #docslab #mdbook-docslab #mdbook #pre-processor #documentation
bigstr

A command-line tool to make string BIG

v0.1.1 app #big #bigstr #font #command-line-tool
pest_ascii_tree

Helper crates converting the parsing result of any pest grammar into an ascii tree

v0.1.0 1.6K #pest #ascii #tree #expr
ftd-rt

ftd

v0.1.5 #ftd #ftd-rt #markdown #static-site-generator #html-css-javascript #json #com-ftd #ft-code-repo
regex-intersect

Find out if two regexes have a non-empty intersection

v1.2.0 420 #regex #intersect #intersection #non-empty #match
mdbook-numthm

An mdbook preprocessor for automatically numbering theorems, lemmas, etc

v0.2.0 bin+lib #mdbook-preprocessor #mdbook #mdbook-pre-processor #katex #label #title #key
unicode_escape

decoding escape sequences in strings

v0.1.0 #escaping #unicode-escape #unicode #decode #char #python
mdbook-bib

mdbook plugin allowing to load and present a bibliography in BibLaTex format in your books and cite its references

v0.0.6 bin+lib #pre-processor #bibliography #bib #mdbook #plugin
node-emoji

Convert :emoji: to Unicode using GitHub’s and EmojiDB’s emoji names

v1.0.7 #emoji #unicode #markdown #github
ruby-string

A string type that tracks Ruby glosses attached to parts of it

v0.1.0 #text #cjk #furigana #bopomofo
vec-string-to-static-str

providing utilities for converting vectors of Strings into vectors of &'static str

v1.0.0 #static #string #vec-string #utilities #vec #unsafe
deeprl

DeepL client library with all the things (blocking)

v0.4.0 #blocking #text-translation #language #document #glossaries
rust_baht_text

Convert number to Thai Baht text

v0.1.0 #text #thai #numbers #baht
re_view_text_document

view that shows a single text box

v0.23.2 21K #text-document #view #document
chinese2digits

The Best Tool of Chinese Number to Digits. A useful tool in NLP and robot project.

v1.0.0 #nlp #digits #chinese #extract
rten-text

Text tokenization and other ML pre/post-processing functions

v0.18.0 200 #tokenize #rten #rten-text #tokenizer #onnx
imagecli

A command line image processing tool

v0.2.1 bin+lib #guide #image #imagecli
jpreprocess-njd

Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)

v0.12.0 160 #open-j-talk #text-to-speech #library
stringutil

A collection of useful string utilities

v0.1.0 #string-utilities #string #tool #utilities
faker_rand

Fake data generators for lorem ipsum, names, emails, and more

v0.1.1 1.8K #fake-data #faker #seedable-rng #first-name #word #ascii-digit #data
ftrace

trace files and paths

v0.2.1 app #strace #file #trace #fs #syscalls #path
unidecode

pure ASCII transliterations of Unicode strings

v0.3.0 248K #transliteration #ascii #unidecode #unicode #unidecoder
aoutils

A tiny utilities package to test publishing to crates.io

v0.1.1 #aoutils #io #ensure-newline #learning
lingua-portuguese-language-model

The Portuguese language model for Lingua, an accurate natural language detection library

v1.2.0 13K #language-recognition #lingua #language-detection #nlp
kolorz

A silly little library for printing kolored text to the terminal

v0.10.1 bin+lib #kolorz #terminal #text #information
TextToEmoji

converting words to emoji representations

v0.1.0 bin+lib #emoji #text #converter #representation
humnum

Human numeric sorting program — does what sort -h is supposed to do!

v0.2.0 #stdin #stdout #coreutils #stdio #numeric-sorting #human-numeric-sort
split_ext

Extension traits for splitting

v0.1.1 #split #ext #split-ext #splitting
libgrep-rs

searching through text

v0.1.4 #regex #libgrep-rs #text #filename #txt #grep-rs
bsky-sdk

ATrium-based SDK for Bluesky

v0.1.19 1.5K #bluesky #atrium #bsky #at-proto #sdk #post
rex-regextract

extracts key value pairs out of text

v0.1.1 app #extract #regex #kv #rex #text
mdbook-tectonic

An mdbook backend for generating LaTeX and PDF documents

v0.3.0-beta.4 app #mdbook #mdbook-tectonic #latex #bookshelf #md2tex
grader

Stream-based CLI for binary sorting text files via a given shell command

v0.2.0 app #cli #stream #sorting
smoltoken

A fast library for Byte Pair Encoding (BPE) tokenization

v0.2.0 #tokenize #bpe #artificial-intelligence #tokenizer
encoding_rs_transcode

Transcode text within writers using encoding_rs

v0.8.3 #charset #unicode #transcode #write
sourcepawn_lsp

Language Server implemention for the SourcePawn programming language

v0.9.6 bin+lib #arguments #progress #sourcepawn #server
readability-rs

Port of arc90's readability project to rust

v0.5.0 100 #html #converter #html-converter #text-html #text
beautify

your terminal

v0.2.0 #color #beautify #terminal #gradients
logseq

Handle Logseq Markdown files in Rust

v0.3.0 #logseq #markdown #space #python #markdown-formatter #knowledge-graph #knowledge-base #come
stylish-plain

stylish helpers for discarding styles

v0.1.0 4.5K no-std #stylish #style #plain #string
toktrie

LLM Token Trie library

v0.7.23 22K #finished #toktrie #state #interface #u8 #bool #usize #self #output
baselinker

BaseLinker.com API client

v0.2.2 #baselinker #client #field #e-commerce
engish

A language utility for sampling letters and building words

v0.2.0 #word #english #language #words
zuk

Yozuk command-line interface

v0.22.11 140 app #yozuk #zuk #interface #programmers #telegram-bot #chat-bot #uuid #a601 #d6d2950d #eb7f
shoebill

A Wadler/Leijen style pretty-printer

v0.1.5 #pretty-print #pretty #wadler #leijen #printing
xenon-lexer

The Xenon compiler's lexer

v0.3.0-alpha-0 #programming-language #lexer #language #xenon #programming
mdbook-image-size

A mdbook preprocessor which support image size syntax

v0.2.1 bin+lib #syntax #image-size #height #size #center #right #left
marko

Programmtically format text with Markdown syntax

v0.3.0 #marko #syntax #markdown #task #hash-map #false
periodic_table

that provides a list of elements in the periodic table

v0.5.0 200 #periodic-table #table #ion-radius #com #andrejewski
syllable

counter for use with reading level calculations

v0.1.0 #syllable #word-count #english #readability #flesch-kincaid
mail-internals-ng

[mail-api] _internal_ parts for the mail-api crates

v0.2.4 #mail-api #email #newlines #mail-internal
gears

core implementation

v0.1.7 #gears #transformation #document #specification #check #html
ranpha

Generate QR code of your Wi-FI network

v0.1.1 app #ranpha #schema #key #size
yeslogic-fontconfig-sys

Raw bindings to Fontconfig without a vendored C library

v6.0.0 190K sys #fontconfig #bindings #font
yamlate

A cross-language interpreter library that dynamically evaluates YAML attributes given variable bindings

v0.1.1 nightly #yaml #interpreter #library #bindings
common-words-all

Most common words sorted by ngram frequency

v0.0.2 #word #english #chinese #french #german #hebrew #russian #spanish #ngrams #italian
chars_counter

The trait that implements character counting for the &str type

v0.1.1 #counter #chars-counter #char #start
l

my personal library

v1.2.7 #regex #algorithm #true
strings

String utilities, including an unbalanced Rope

v0.1.1 3.9K #rope #string #iterator #substring #postitions
unicode_reader

Adaptors which wrap byte-oriented readers and yield the UTF-8 data as Unicode code points or grapheme clusters

v1.0.2 24K #code-point #unicode #grapheme #reader #unicode-text #text
trpl

A support crate for The Rust Programming Language book

v0.2.0 1.3K #book #trpl #programming-language #mdbook
lindera-ko-dic

A Japanese morphological dictionary for ko-dic

v0.42.4 25K #morphological #ko-dic #dictionary #korean
wkhtmlapp

Convert html to pdf or image

v1.0.2 #image #pdf #html #wkhtmltoimage #wkhtmltopdf
gestalt_ratio

Calculate the gestalt pattern matching ratio between two strings

v0.2.1 230 #string-matching #string-similarity #ratio #string #similarity #gestalt #matching
nightscape

night sky in terminal

v0.1.0 app #nightscape #terminal #recomendado #instalación
prettythanks

frontend to dtolnay/prettyplease library

v0.1.0 app #command-line #ast #rust-fmt #pretty #formatting
chinese-ner

A CRF based Chinese Named-entity Recognition Library written in Rust

v0.2.4 #ner #chinese #nlp
fuzzy_mime

A Mime-Type parsing library for rust

v0.1.0 #fuzzy #mime #fuzzy-mime #borrowed-media-type #fail #subtypes
ut1_blocklist

UT1 blocklist URL/domain filters

v0.3.2 #blocklist #filter #ut1 #adult-content
book_lib

that provides an API for managing PDFs on your mac device in one place

v0.1.3 #book #lib #place #pdf
triangular-earth-calendar

An alternative timekeeping system cli tool

v0.2.0 app #calendar #earth #triangular #tool #time #alpha
mdbook-bash-tutorial

A mdbook preprocessor that allows embedding Bash scripts as tutorials

v0.1.6 bin+lib #mdbook-preprocessor #mdbook #tutorial #markdown #bash #mdbook-pre-processor
swot

community-driven or crowdsourced library for verifying that domain names and email addresses are tied to a legitimate university of college

v0.1.0 #education #validation #email #name #college
phonet

A CLI tool and library to validate phonotactic patterns for constructed languages

v1.0.2 bin+lib #language #regex #phone #conlang #phoner #lang #statement
markdown_to_html_parser

parses Markdown syntax into HTML

v0.1.0 bin+lib #html-parser #render-markdown #markdown-to-html-parser #lib #parse-markdown
mini-openai

An OpenAI API client with minimal dependencies

v0.1.2 #chatgpt #llm #ollama #api-bindings #artificial-intelligence #openai
indoc

Indented document literals

v2.0.6 6.0M macro no-std #literals #string #multi-line #heredoc #string-literal #nowdoc #no-alloc
markdown-it-latex

Allows for the insertion of math in Markdown documents using LaTeX

v0.1.0 #latex #markdown-it-latex #markdown #syntax
caseformat

Power flow case data format

v0.2.0 170 bin+lib #format #caseformat #directory
uniwhat

Display the unicode characters text

v0.2.0 app #unicode #unicode-text #uniwhat #text #name #space #signature #mark
epubparse

Parse epub and convert to text-only Book structure

v0.2.2 #ebook #epub #structure #chapter #wasm #ncx
unicode-vo

Unicode vertical orientation detection

v0.1.0 190K #unicode #detect #unicode-vo #detection
text-diff

text diffing and assertion library

v0.4.0 27K bin+lib #diff #difference #assert #change
byte-num

converting numbers to bytes, and bytes to numbers in base 10!

v0.1.3 #byte #byte-num #num #10
meme_generator_utils

Meme generator utils

v0.0.7 #meme #meme-generator-utils #generator #meme-generator-rs #表情列表 #查看 #表情包生成器 #雕表情包 #用于制作各种
ucd

Extends the char type to provide access to most fields of the UCD, Unicode Character Database, as of version 9.0.0. It aims to be compact, fast, and use minimal dependencies (only rust's core crate)…

v0.1.1 4.9K #ucd #unicode #character #unicode-text #text #code-point
stam-python

STAM is a library for dealing with standoff annotations on text, this is the python binding

v0.10.2 650 #annotations #nlp #linguistics #standoff #text-processing #annotation
writedown-html

Writedown HTML backend

v0.1.0 #back-end #writedown #html
whitespace_text_steganography

A steganography strategy that uses whitespace to hide text in other text

v0.2.1 #steganography #text #white-space #steg
transition-table

transition table utilities for keyword parser

v0.0.3 no-std #hobby #utilities #transition
p4d-mdproof

Markdown to PDF converter

v0.1.2 bin+lib #mdproof #converter #p4d-mdproof #executable #leroycep-mdproof #md #folder
hunspell-sys

Bindings to the hunspell C API

v0.3.1 7.5K sys #hunspell #hunspell-sys #target #api
skribo

low-level text layout

v0.1.0 #text-formatting #text-layout #graphics #layout
anon-csv-cli

anonymise CSV files, providing various options to substitute real data with plausable fake data

v1.0.4 app #csv #anonymization #anon
icu-data

International Components for Unicode (ICU) data in Rust structures

v0.1.0 #unicode #mapping #ucm
markov-text

creating a small markov model for text generation

v0.1.1 #markov-chain #markov-text #text #model #random #markov #command #basic
ucfirst

Uppercase the first letter of a string

v0.3.0 850 #upper-case #string #casing #capital
mdbook-nix-eval

mdbook preprocessor for evaluating nix expressions

v1.0.1 bin+lib #nix #mdbook #nixos #expression
flipperzero-sys

Flipper Zero

v0.15.0 240 #instance #duration #cryptography #applications #version
pinot

Fast, high-fidelity OpenType parser

v0.1.5 1.5K #opentype #parser #font #opentype-font #graphics
render_as_tree

visualizing tree data structures via text

v0.2.1 #text #render #tree #parent
kanjidic_types

A collection of types encompassing the variety of data about kanji available from Kanjidic

v0.1.4 #kanji #kanjidic-types #kanjidic #japanese
utils_rust

这是一个用于各种实用功能的 Rust 库

v0.1.1-alpha.1 #decode #encode #utils-rust #自用代码
hex_d_hex

HexDHex is a Rust Crate that encodes and decodes byte data to and from its hexidecimal representation. For instance, one may wish, on ocasion that is, to translate a utf8 or ASCII string…

v1.0.1 130 #hex-d-hex #hex #byte
tree-sitter-stack-graphs-java

Stack graphs for the Java programming language

v0.5.0 230 bin+lib #java #stack-graphs #tree-sitter
write-html

writing HTML in Rust

v0.1.3 #generator #doctype #write #footer
pretty-xmlish

Pretty print XML-ish data with unicode art

v0.1.13 3.2K #pretty-xmlish #art #pretty #sql
emoji-printer

Replace emoji shortcodes in string with emoji unicode (":sushi:" -> 🍣)

v0.4.3 370 #emoji #printing #unicode #shortcodes
phonics

Phonetic spelling algorithms in Rust

v0.1.0 #phonics #lein #phonics-encoder #included #linguistics #language #record-linkage #phonetic-spelling-algorithms
chanoma

Characters Normalization library. 文字列正規化処理用のライブラリです。

v0.1.2 bin+lib #nlp #japanese #chanoma #文字列正規化 #理用の #language #を指定する #文字から #ファイルの
translation-api-cn

Some useful structs for calling Chinese translation api cloud services. A helper tool for bilingual cmdline tool.

v0.1.3 #bilingual #api-bindings #translation #10 #文件的 #tags
typos-vars

Source Code Spelling Correction

v0.9.1 11K #spelling #typos-cli #variables #typo #pr #checker #correction #development #spell-check
badascii-mdbook

Embed badascii diagrams in your mdbook. See badascii.me for the editor.

v0.3.1 220 bin+lib #mdbook #ascii #block #diagram #plugin #mdbook-plugins
mdtrans

Markdown parser and transformer using pest.rs, focused on flexibility to a project’s needs

v0.1.8 #mdtrans #pest #default #derive
mdbook-force-relative-links

An mdbook pre-processor to transform all local links to relative ones

v0.1.2 app #rust-book #themes #markdown #book
framework

detector for different frameworks in one projects

v0.2.4 #framework #project #detector #detect #framework-detector #projects #path
aki-mcolor

mark up text with color

v0.1.32 bin+lib #text #color #aki-mcolor #filter
b64

Base64 encoding/decoding support. Originally from rustc-serialize.

v0.4.0 1.0K #b64 #encoding #character-set
matcher_c

A high-performance matcher designed to solve LOGICAL and TEXT VARIATIONS problems in word matching, implemented in Rust

v0.5.7 #search-pattern #multi #string-search #text-search #text #string #pattern #search
emojicons-2021

Parse :emoji: notation to unicode representation

v2.0.1 #emoji #emojicons-2021 #emojicons #cat
sauron-markdown

parsing markdown into sauron node

v0.45.0 #node #sauron #md
mdbook-preprocessor-boilerplate

Boilerplate code for mdbook preprocessors

v0.1.2 3.7K #mdbook #mdbook-preprocessor #boilerplate #proprocessor
ngram-search

Ngram-based indexing of strings into a binary file

v0.1.1 bin+lib #ngrams #indexing #text-search #full-text
lunir

A universal intermediate representation oriented towards Lua

v0.2.0 #lunir #indentation #optimisations
csv-sanity

Sanitize and transform large CSVs with millions of records quickly and efficiently

v0.1.0 bin+lib #csv #csv-sanity #email #capitalize #transformer #regex #trim #choice #date #zipcode
sttx

belt for transforming speech-to-text data

v0.1.0 app #text-to-speech #time-series #utility #whisper-cpp #speech-recognition #stt
mrn-generator

generating valid MRNs based on ISO 6346

v0.3.2 bin+lib #category #mrn-generator #generator #office #6346 #country-code
yagenerator

Application that uses tinytemplate engine to generate text files. If you have a set of structured data, and need to generated a bunch of arbitrary types of files from it, this tool can help you to save some time.

v0.1.3 bin+lib #template-engine #template-generator #code #text #generator #engine #template
irssi-sys

Automatically generated bindings to irssi

v0.1.0 #irssi #irssi-sys #translation
ascii-rs

Process image into colored-ascii image

v0.1.2 #image #ascii #ascii-rs #image-engine #stdout
literumilo

A spell checker and morphological analyzer for Esperanto

v0.1.0 bin+lib #morpheme #spell-check #esperanto #analyzer
hebrew_unicode_utils

Some functions for processing Hebrew unicode characters

v0.4.3 460 #unicode-text #hebrew #unicode-characters #utf-8
gen3-charset

Pokemon Generation 3 Character Set Support (GBA)

v0.1.0 #gba #charset #gen3-charset #intl #jpn #set #fr
fsrenamer

refactoring invalid file/dir names

v0.2.1 bin+lib #name #fsrenamer #directory #names #backup
transcript

A transcriber for European scripts

v0.1.12 bin+lib #transcript #futhark #unimplemented #rules
saneput

Sane input reading library

v0.2.0 #saneput #input #ff #15 #space-tab
kudubot-bindings

Rust Bindings for the kudubot framework

v0.18.2 #chat #python #kudubot
edgesearch

Serverless full-text search with Cloudflare Workers, WebAssembly, and Roaring Bitmaps

v0.4.1 bin+lib #full-text-search #search-index #bitmap #search #full #text
sayit

String replacements using regex

v0.3.0 bin+lib #regex #text #tags #format
mykebab

convert snake_case strings to kebab-case

v0.1.0 #snake-case #mykebab #snake-to-kebab
futf

Handling fragments of UTF-8

v0.1.5 1.0M #utf-8 #futf #offset
words-count

Count the words and characters, with or without whitespaces

v0.1.6 2.1K no-std #word-count #character #utf-8 #letter #word #count
regex-automata

Automata construction and matching using regular expressions

v0.4.9 25.2M no-std #nfa-automata #dfa-automata #regex-automata #regex #dfa
markitdown

designed to facilitate the conversion of various document formats into markdown text

v0.1.10 140 bin+lib #atom #pdf #docx #markdown #image #excel #openai #deepseek #csv #html
goodname

assist you with cool naming of your methods and software

v0.2.2 #acronym #goodname #trie #match
deck

A command line tool to generate HTML presentations from Markdown documents

v0.3.0 app #markdown #slide #presentation #document
fifthtry-mdbook

fork of mdbook, only for ft-cli

v0.4.8 bin+lib #rust-book #mdbook #gitbook #book #markdown
cfasttext-sys

fastText ffi binding

v0.7.8 23K sys #fasttext #classify #bindings #api-bindings #text
mdast2minimad

converting markdown AST to minimad texts

v0.1.0 #markdown #minimad #termimad #mdast #convert
string-simple

containing some simple string utilities that I use in my other projects

v0.1.0 #utility #string #text
cyrla

two-way conversion between latin and cyrillic script

v0.1.0 #latin #cyrillic #serbian #script #converter-builder #prefix
good-mitm-rule

Use MITM technology to provide features like rewrite, redirect

v0.2.0 #rules #mitm #good-mitm-rule #logging #proxy #text-modify #action #filter #matching #url
code-splitter

Split code into semantic chunks using tree-sitter

v0.1.5 #tokenize #artificial-intelligence #nlp #split #code #tokenizer
lindera-dictionary

A morphological analysis library

v0.42.4 31K #morphological-analysis #library #dictionary #morphological #analysis #cc-cedict
gdnative-doc

Documentation tool for gdnative

v0.0.6 #documentation #gd-native #markdown #version
single_source

Generate code files from snippets in md tutorial files

v0.1.5 app #md #source #single #truth #tutorial #skip #generator
autoruby-cli

CLI to easily generate furigana for various document formats

v0.5.1 app #format #autoruby #localization #katakana #formats #txt #md
parattice

Recursive paraphrase lattice generator

v0.2.2 #nlp #paraphrase #generator #lattice #lattice-kmp
charmap

one-to-(none/one/many) character mapping

v0.2.2 no-std #nlp #iterator #no-std #text
doccy

brace based markup language

v0.3.2 bin+lib #html #markup-language #text #language #element #break
tectonic_bridge_harfbuzz

Expose the Harfbuzz C/C++ APIs to Rust/Cargo

v0.2.9 550 sys #tectonic #harfbuzz #tectonic-bridge-harfbuzz #typesetting #path #single #component #unused-imports
cli-colors

A CLI tool for outputting text in ANSI format with features like colors, underlining, boldening, and italicizing

v1.0.0 #ansi-colors #text #formatting #formatting-text #color
pocky

A framework for building your own static site generator

v0.5.2 #web #site #markdown #static-site #static
re_types_core

The core traits and types that power Rerun's data model

v0.24.0-alpha.1 37K #archetypes #re-types-core #default #multimodal
morse-nostd

A nostd version of the morse crate

v0.1.2 #morse #morse-nostd #encode #io
arbitrator

Format text based on a set of rules and regexes

v0.1.3 app #troff #typesetting #text
sm-search

way of searching through text - for people who are too lazy to use Regex

v0.1.3 260 bin+lib #sm-search #regex #search
typeline_ext_csv

csv parsing and serialization for typeline

v0.1.0 #stream #pipeline #shell #tl
allsorts-subset-browser

Temp fork of allsorts 0.15 - includes patch for subsetting fonts for browsers

v0.16.0 1.0K #opentype #true-type #font-shaping #parser #font #shaping #opentype-font
regex-filtered

Efficiently check an input against a large number of patterns

v0.2.0 9.5K #regex #multiple #prefilter #filtered-re2 #filter #pattern
html-query-extractor

HTML extractor for hq: jq, but for HTML

v0.2.2 650 #html #html-query-extractor #extractor #hq #extract
deepphonemizer

G2P model (inference only)

v1.0.0 #linguistics #deepphonemizer #phonemizer #g2p
prettify-markdown

Format Markdown at the speed of Rust

v0.2.0 #markdown #prettify-markdown #prettify #format-markdown #print #file-content
corollary

Cross-compiles Haskell into Rust

v0.3.0 bin+lib #corollary #convert #system #lazy-evaluation #declaration #hkt #recursion #haskell #cross-compiler #parsing-library
uiuifree-text-data

csv and excel convert

v0.1.10 #uiuifree-text-data #convert #text-data
pdf-min

Very minimal crate for writing PDFs

v0.1.12 #pdf #pdf-min #html #head
kincaid

A word statistics library in Rust

v0.2.4 #syllable #word-count #english #readability #flesch-kincaid
merge-whitespace-utils

Procedural macros for merging whitespace in const contexts

v1.1.0 #white-space #graphql #proc-macro #context #merge-whitespace
mdbook-unlink

A mdBook backend that validates local links

v0.1.0 app #mdbook-plugins #link #mdbook #mdbook-backend #unlink #true #chapter
fcnt

cmd-line tool for counting the number of files in given directories

v0.2.8 app #directory #fcnt #directories #size #src #src-package #filenames #mode #ignored #entries
webreg

A CLI tool for testing regexes against web pages

v0.1.0 app #regex #webreg #page #url #insensitive
flw

Process text via configurable tasks

v0.0.3 bin+lib #task #flw #csv #text #tasks #schema #yaml-config #replace #task-manager #word
mdbook-typstpdf

An mdBook backend that generates PDF output using Typst

v0.1.1 bin+lib #mdbook #typst #markdown #pdf #documentation
macro_colors

colorful printing macros

v0.2.0 #color #printing #macro
crossandra

A straightforward tokenization library for seamless text processing

v0.0.2 #tokenize #crossandra #literals #lexer #regex #lexing
trans-case

Transform case

v0.1.0 #text #transform #case
asimov-construct-cli

ASIMOV Construct Command-Line Interface (CLI)

v25.0.0-dev.0 app #asimov #cli #artificial-intelligence
ragtime

Easy Retrieval Augmented Generation

v0.2.0 #artificial-intelligence #rag #phi3 #arc #7b-instruct #document #generation #llama-backend #rag-qa-phi3-gte-qwen #model
ontodev_valve

A lightweight validation engine written in rust

v0.2.1 bin+lib #valve #validation #ontodev-valve
timeharsh

implements the timehash algorithm, an algorithm for creating user configurable, variable-precision sliding windows of time. Useful for binning time values in large collections of data.

v1.0.0 #timeharsh #timehash #abcdef #pdf #com-abeusher-timehash
djot

Djot parser written in pure Rust

v0.0.2 app #djot #markup
grace-cli

CLI tool for processing files and strings

v0.1.1 bin+lib #file #strings-processing #files-manipulation #cli #string
mdbook-checklist

An mdBook preprocessor for generating checklists and indexes

v0.1.1 app #mdbook #mdbook-preprocessor #markdown #mdbook-pre-processor #checklist
uniart

A CLI tool to convert images and gifs to terminal characters

v1.0.0 app #terminal #art #cli #unicode #ascii-art
termbook-cli

termbook is a command-line tool to build mdbook’s while executing bash codeblocks and collecting their output to become part of the mdbook

v1.4.6 app #markdown #terminal #common-mark #mdbook
text_distance

A collection of approximate string matching algorithms

v0.5.0 #string-matching #levenshtein #edit-distance #text #algorithm #string-matching-algorithm #string-distance #string
vidyut-chandas

A Sanskrit metrical classifier

v0.1.0 #sanskrit #classification #vidyut-chandas #vrtta
mdtranslation

prepare multi-lingual Markdown documents

v0.1.2 #translation #markdown #common-mark #localization #document
validations

arbitrary types

v0.1.1 #validation #io
txt_otp

A text based one time pad library

v2.0.0 #otp #txt #txt-otp #otp-rs
markdown-includes

Include other documents, table of content, or rust-doc in Markdown using a simple template system

v0.1.1 240 #include #markdown #readme #content #system #section
unic-ucd-segment

UNIC — Unicode Character Database — Segmentation Properties

v0.9.0 601K #segmentation #character-property #unic #unicode #grapheme #unicode-text #text
iwes

IWE LSP server

v0.0.31 bin+lib #server #iwes #markdown #lsp #md
find_unicode

Find Unicode characters, the easy way!

v0.4.0 app #unicode-characters #find #easy #character #unicode #cli
table_to_html

interface to convert a tabled::Table into a HTML table (<table>)

v0.8.0 750 #pretty-table #html #format #print
forming

lightweight architecture as code language. 架构描述语言

v0.1.0 app #forming #架构描述语言 #design #style #page #pattern #architecture-description-language #轻量级架构即 #码语言
tgo

Heterogeneous data type transtion, it's safe, lightweight and fast

v0.1.0 #schema #transform #low-code #type #tool
jput

puts and putc on unicode-width align for Rust

v0.1.2 #console #unicode #put #alignment #unicode-width #width
genex

Text-expansion library

v0.6.4 #text #text-templates #genex #modifier #grammar #weight #rules #hash-set
character-set

High performance set.contains(char)

v0.4.0 #character-set #range #character #testing
yitizi

異體字查詢 Get variant Chinese characters

v0.1.0 bin+lib #yitizi #nlp #sinograph #chinese #chinese-character
parser-web

Web API for extracting text from various file formats

v0.1.3 bin+lib #web-api #pdf #text-extraction #parser #document #rest
texc-latex

Contains LaTeX templates for TeXCreate

v0.1.6 #tex-create #latex #texc-latex #khan #te-x-create
jp_utils

Utils for working with Japanese text

v0.1.7 #parser #japanese #language #charset #traits
ligotab

Format delimited data with lightweight markup

v0.2.0 bin+lib #csv #markdown #restructuredtext #org #confluence
equt-md-ext

Extend event iterator

v0.2.7 #iterator #equt-md-ext #front-matter
paxcii

Transform images and videos to ascii

v0.5.1 bin+lib #paxcii #ascii #image #video #ascii-art #com-watch #v-jt-xl-ln-aas #command-line
darts

A double array trie, A Forward Maximum Matching Searcher

v0.1.0 #trie #string #text #string-search #text-search #search
rustex

auto-generated LaTeX files in Rust

v0.1.0 #report #latex #generate #reports #component
cronus_parser

The DSL parser for cronus API spec

v0.4.4 240 #typescript #cronus #parser #async-trait #string #eq #documentation
docfmt

A document formatter using Handlebars templates

v0.1.1 app #handlebars #documentation #formatting #template #handlebars-template
rustfits

A light-weight FITS file reader in Rust

v0.1.1 #fits #header #rustfits #data #table
harfbuzz-traits

Rust Traits for the HarfBuzz text shaping engine

v0.6.0 28K #font-shaping #opentype #unicode #font #shaping #unicode-text #opentype-font
vroom

Vim macros from the shell

v0.1.0 app #vroom #shell #juice #filename #stdin #lemon #mango #apple #orange #tomato
aki-stats

output the statistics of text, like a wc of linux command

v0.1.18 bin+lib #text #filter #statistics #en
readput

Fast and easy stdin input parsing for competitive programming in rust

v0.1.3 #parser #input #io #stdin #utility #parsing
asimov-sdk

ASIMOV Software Development Kit (SDK) for Rust

v24.0.0-dev.22 no-std #sdk #asimov #artificial-intelligence
rust-cheatsheet

a quick cheatsheet for rust

v0.1.0 bin+lib #rust-cheatsheet #cheat-sheet #art #rust-book #concepts #org-book
rusk

a Specification Language

v0.1.11 app #language #rusk #greeting #md #greet #kml #greeted
dequote

Remove nested quotes around text

v0.9.0 no-std #quote #trim #no-std #text
falcom-sjis

Falcom-compatibile Shift JIS implementation

v0.1.2 #unicode #charset #falcom-sjis
pdfrust

PDF parser

v0.5.3 bin+lib #pdf #parser #pdfrust #stream #font
uecho

The unicode of the echo command

v0.1.0 app #unicode #uecho #command #codes
combos

Print all permutations of a word list

v0.2.1 app #combos #shell #permutation #command-line-tool
spellabet

Convert characters into spelling alphabet code words

v0.2.0 #formatting #humanize #text #word #spelling-alphabet
kth-lines

Command line tool for filtering stdin lines that just work

v0.1.0 app #kth #kth-lines #line #nth #bash
marker

finding issues in CommonMark documents

v0.9.0 app #markdown #link #validation #common-mark #document
intname

Full English name for any integer of any primitive integer type

v0.2.0 #text-formatting #integer #name
markovish

Markov chain implementation for text generation

v0.2.2 #language #text #parser #generation
mul

Bengali stemmer

v0.1.0 #information-retrieval #stemming #nlp #bengali
pomsky-macro

Macro for converting pomsky expressions to regexes

v0.11.0 180 macro #regex #pomsky #macro #diagnostics
pdf-create

low-level, strongly-typed PDF creation library

v0.1.1 #font #page #stream #label #file-format #rationale
ascii-hangman-webapp

customizable Hangman game with ASCII-art rewarding for children (webapp version)

v5.7.2 #ascii-hangman #version #ascii-hangman-webapp #true #licence #getreu
character_frequency

counting character frequencies in a string concurrently

v0.2.0 #character #frequency #thread #concurrently #character-frequencies #characters
synterm

making beautiful REPLs and Shells with fish like as you type syntax highlighting

v0.3.1 #highlighting #synterm #command-line-tool #lexer #string #start #exit
encoding

Character encoding support for Rust

v0.2.33 194K #unicode #charset #ascii #encoder-trap
jellybean

Syntax highlighting with tree-sitter. Sweet colors.

v0.0.2 #syntax-highlighting #highlight #tree-sitter
markdown-it-autolink

A markdown-it plugin for parsing GFM autolinks

v0.2.0 #markdown-it #markdown #markdown-it-autolink #autolinks #add #md
diffy-fork-filenames

Fork of https://docs.rs/diffy that allows specifiying filenames

v0.4.0 #patch #merge #diff #filenames #diffy
p101_enc

convert Olivetti P101 program to and from different encodings

v0.9.0 #pipeline #enc #p101-enc #encoding #filter #c101
wit-bindgen-gen-markdown

Markdown generator for WIT and the component model, typically used through the wit-bindgen-cli crate

v0.3.0 #wit-bindgen #wasi #gen
wn-parser

parser for WordNet database files

v0.1.0 #parser #wn-parser #key
rulet

figlet implementation

v2.0.0 #ascii #figlet #rulet #text #character #smushing #previous
libxdiff

Rust bindings for the libxdiff C library

v0.2.0 no-std #api-bindings #libxdiff #mm-file
sesdiff

Generates a shortest edit script (Myers' diff algorithm) to indicate how to get from the strings in column A to the strings in column B. Also provides the edit distance (levenshtein).

v0.3.1 bin+lib #nlp #lemmatization #linguistics #text-processing
csvsc

Build processing chains for CSV files

v2.2.1 #csv #csvsc #column #documentation
subscript-compiler

A modern LaTeX rendition

v0.21.0 bin+lib #subscript #compiler #html #latex #math #typesetting #publish
clippy_lints

A bunch of helpful lints to avoid common pitfalls in Rust

v0.0.212 1.7K nightly #clippy #lint #plugin
yozuk-core-skillset

Set of default Yozuk skills

v0.22.11 #yozuk #skill #yozuk-core-skillset #telegram-bot
cozo-ce

A general-purpose, transactional, relational database that uses Datalog and focuses on graph data and algorithms

v0.7.13-alpha.3 #cozo #token-stream #cozo-ce #algorithm #documentation #artificial-intelligence
ucd-util

A small utility library for working with the Unicode character database

v0.2.2 209K #character-properties #character-property #unicode #database #character
yuto51942-servant

cli

v1.1.3 app #servant #language #tracking #timer #package #search #nyancat #emoji #version #subcommand
indentation

Formatter

v0.1.6 #indentation #formatter
tantivy-czech-stemmer

Czech stemmer as Tantivy tokenizer

v0.2.1 #tokenize #stemmer #tantivy #czech
tectonic_xetex_format

Tectonic/XeTeX engine data structures and their expression in TeX "format" files

v0.3.2 #tectonic #xetex #format #typesetting
sparklet

small flashcards library

v0.1.1 #text #sparklet
leven-distance

Compute operational differences between two sequences using the Levenshtein algorithm

v1.0.0 #levenshtein #levenshtein-distance #algorithm
xee-xpath

XPath 3.1 library API

v0.1.4 1.4K #xpath #xml #xee #api
naming_clt

Extract and convert the naming format(case|notation) of identifiers from files or stdin. Use this tool to prepare identifier name strings for further operations (matching,replacing...) on relative files

v1.1.0 app #code #naming #search-pattern #clt #pattern
write16

A UTF-16 analog of the Write trait

v1.0.0 8.2M no-std #utf-16 #unicode #traits
lindera-unidic

A Japanese morphological dictionary for UniDic

v0.42.4 22K #morphological #japanese #japanese-morphological #dictionary #unidic
llmvm-codeassist

A LLM-powered code assistant that automatically retrieves context (i.e. type definitions) from a Language Server Protocol server.

v0.2.0 app #artificial-intelligence #lsp #assistant #llm #code
ftd

ftd: FifthTry Document Format

v0.2.0 bin+lib #ftd #local-variables #format #markdown #json #dev #prose
is_utf8

functions to determine if a sequence of bytes is valid utf-8

v0.1.4 #utf-8 #avx #is-utf8 #simd
maud-pulldown-cmark

An adapter between maud and pulldown-cmark

v0.5.0 #pulldown-cmark #markdown #maud #adapter
webvtt-parser

WebVTT parser for Rust

v1.0.0-beta.4-rc.5 1.1K #web-vtt #subtitle #parser #rust
mdbook-morsels

Morsels plugin for Mdbook

v0.7.3 app #morsels #mdbook-morsels #mdbook #static-site #search #processing
search-in-terminal

A terminal-based search tool

v0.1.3 bin+lib #terminal #cs #search #tool
tiny_pretty

Tiny implementation of Wadler-style pretty printer

v0.2.0 8.4K #text #pretty #tiny #nest #documentation #print-options
blitztext

fast keyword extraction and replacement in strings

v0.1.1 bin+lib #fuzzy-search #aho-corasick #search #trie #keyword #fuzzy
unic-ucd-age

UNIC — Unicode Character Database — Age

v0.9.0 8.9K #age #character-property #unicode #unicode-text #text
tradukisto

Kinda useful natural language translation library and utility

v0.1.1 app #translation #computer-vision #copilot #utility #audio #localization #image
passgenr

generating cryptographically-secure passwords in Rust

v0.2.0 bin+lib #passgenr #ascii #word #hex #digits #utility
bidi

Unicode Bidirectional Algorithm (UBA)

v0.1.1 #bidi #bidirectional #unicode #text-processing
itext

Safe rust bindings to the iText 7 PDF generation library written in Java

v0.2.3 sys #itext #pdf #java #api-bindings #encoding #image #color-constant
gen-epub-book

Generate an ePub book from a simple plaintext descriptor

v2.3.2 bin+lib #ebook #epub #book #generate
md-localizer

Localize markdown with remote links

v0.1.1 app #md-localizer #localizer #md #link
aprilasr-sys

Low-level FFI bindings for the april-asr C api (libaprilasr)

v0.1.3 sys #audio #nlp #neural-network #wrapper
draconis

Small terminal welcome program written in rust

v2.4.8 app #draconis #system-information #terminal #neofetch #page #startup #world #liking #screenfetch
mdbook-fishextract

A mdbook preprocessor which handles mermaid graphs, offline, requires mmdc

v0.1.0 bin+lib #mermaid #mdbook #graph #fishextract #mmdc
findtext_doc

Search text in Document

v0.1.2 bin+lib #word-search #text-search #search #text #documentation #docx #word #cli
protobuf

Protocol Buffers - Google's data interchange format

v4.31.0-release 2.5M sys #upb #protobuf #format
moscato

Outline scaler for OpenType glyphs

v0.1.2 #true-type #glyph #opentype #loader #scaler #graphics
mdbook-svgdx

mdbook preprocessor to convert svgdx fenced code blocks into inline SVG images

v0.7.0 120 bin+lib #svg #mdbook #diagram #svgdx #image
stone-mason

simplify using the Amazon Bedrock Rust SDK aws-sdk-bedrockruntime

v0.1.0 #model #anthropic #stone-mason #bedrock #sdk #blob #client
stylish-stringlike

API for string-like objects that have styles applied

v0.3.0 #string #style #stylish-stringlike #tags #terminal #truncation-style
jpreprocess-jpcommon

Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)

v0.12.0 160 #open-j-talk #text-to-speech #library
aaa

CLI tool for work with 3a files

v1.1.1 app #aaa #file #color #parameters #body #comments #header #preview #value #frame
pcre2

High level wrapper library for PCRE2

v0.2.9 33K #jit #perl #pcre2 #regex #pcre
grammar-runner

A runner for grammar code

v0.1.0 nightly #grammar-runner #runner #grammar #mdbook-grammar
gregex-logic

Logic for the gregex crate

v0.1.1 #regex-automata #nfa-automata #regex #logic #nfa
highlight-pulldown

Process pulldown-cmark events to apply syntax highlighting to code blocks

v0.2.2 #syntax-highlighting #markdown #highlight #block #highlighter
unic-ucd-normal

UNIC — Unicode Character Database — Normalization Properties

v0.9.0 9.5K #unicode-normalization #unic #unicode-text #text #compose #normalization #unicode #internationalization
swc_plugin_import

babel-plugin-import rewritten in Rust

v0.1.8 #import #plugin #swc-plugin
utf8_slice

Lightweight UTF8 Slice Utilities

v1.0.0 500 #string #utf-8 #slice #unicode
string-cases

String case conversion utilities

v0.2.0 6.1K #utilities #cases #string-cases
terraphim-markdown-parser

Terraphim Markdown Parser

v0.1.0 bin+lib #artificial-intelligence #ai-agent #terraphim #personal-assistant #privacy
text-to-json

Convert text to json in rust

v0.1.3 500 bin+lib #json-text #json #rust #text
ezemoji

Catigoryized Emoji's

v0.2.1 #emoji #ezemoji #crab #clone #rain #website #github #io
minigrep_iaziz786

grep

v0.1.0 bin+lib #mini-grep #case-insensitive #filename
ewin-com

editor for Window(GUI) users.No need to remember commands

v0.0.2 #ewin #com #operation #settings #macro #file #term #edit #command #bind
ascii-read

BufRead-like methods for reading into an AsciiString

v0.1.0 #ascii-text #ascii-string #ascii #reader #string #ascii-buf-read #line
beemovie-cli

Bee Movie CLI Application

v0.1.3 app #cli #binary #text #generator
retest

Command-line regular expression tester

v0.2.3 700 app #regex #tester #retest #pattern
rmbs

Remove any fluff, corporate speak, or other bullshit from input text and print the TL;DR essence of what's being said, using the www.bullshitremover.com public LLM API

v1.2.0 app #artificial-intelligence #llm #summarize #condense
arg_input

ARGF-style input handling for Rust

v2.0.1 #text #cli #input
bionic-ebooks

Takes an EPUB file and generate a copy with bionic like font applied

v0.1.1 bin+lib #ebook #bionic #epub #applied
squ

command-line utility for converting quotation marks in plaintext files to "smart quotes"

v0.1.0 app #quote #command-line #convert #smart #line
count-md

configurable command-line tool and Rust library for Unicode-aware, Markdown-aware, HTML-aware word counting in Markdown documents

v0.1.0 bin+lib #count-md #document #text #documents #title #markdown
lingua-french-language-model

The French language model for Lingua, an accurate natural language detection library

v1.2.0 17K #language-recognition #lingua #language-detection #nlp
md-designer

A CLI tool for creating design docs in Markdown

v0.1.1 bin+lib #md-designer #markdown #list #file #rules #locally #md-design-doc #yaml #cd #git
mistletoe

Polyglot Kubernetes Package Manager

v0.1.2 bin+lib #manager #mistletoe #namespaces #deserialize #book #kubernetes #polyglot #name #ok #output
mdbook-open-gh-issue

mdbook preprocessor to add a open-on-github link on every page

v0.1.1 bin+lib #mdbook #mdbook-open-gh-issue #page
text-to-png

way to render text to a png image with basic options

v0.2.0 490 #font-rendering #png #svg #rendering #font
kradical_static

Ready-to-use EDRDG radical decompositions

v0.2.0 #kanji-radical #kanji #japanese #radical #decomposition
chardet

rust version of chardet

v0.2.4 10K #chardet #language #utf-8 #confidence
wxf-converter

Transform yaml, json, pkl files to wolfram

v0.3.2 app #wolfram #converter #exchange
gret

command line tool to search for patterns and show matches in a tree structure

v0.1.2 app #ripgrep #grep #search-pattern #pattern #regex
pdf_composer_definitions

PDF Composer definitions crate

v0.3.0 #markdown #pdf #composer #yaml #generate #margin
zbuf

“Zero-copy” string and bytes buffers

v0.1.2 #buffer #zbuf #byte #language #html5ever
ende

encoding/decoding unicode/utf-8/utf-16(ucs-2) code points

v0.1.0 bin+lib #decode #encode #encode-decode #utf-8 #utf-16 #unicode
struckdown

A structured markdown / commonmark library for Rust

v0.1.0 #cmark #common-mark #markdown #restructuredtext
mdxbook

Fork of mdBook, with more customizations and flexibility for programmers

v0.4.25 bin+lib #rust-book #gitbook #markdown #book
beemovie

Bee Movie crate

v1.0.1 #text #generator #beemovie #barry #benson
mdbook-asciidoc

mdBook backend for AsciiDoc generation

v0.1.0 app #mdbook #asciidoc #mdbook-asciidoc
lodestone

A website wrapper for FFXIV's lodestone

v0.5.0 #lodestone #ffxiv #profile #api-bindings #search #datacenter #id
lindera-cli

A morphological analysis command line interface

v0.42.3 800 app #morphological-analysis #cli #lindera #format #tokenize #morphological #analysis
deface

Lightweight markup to HTML converter

v0.1.2 app #markdown #markup #deface #converter #markup-language #rules #markdown-rendering #syntax #list #numbers
utf8_reader

A UTF-8 reader that read UTF-8 characters from object that implement Read trait

v0.7.0 #utf-8 #reader #traits #cursor #write #set-position
askama-filters

Extra template filters for Askama

v0.1.3 120 #askama #html #text-html #filter #text
scripter

A screenplay compiler

v0.4.1 app #latex #script #screenplay #compiler
text-parsing

Hierarchical text processing preserving char position info

v0.6.6 #info #parser #text
igo-rs

Pure Rust port of the Igo, a POS(Part-Of-Speech) tagger for Japanese (日本語形態素解析)

v0.3.0 bin+lib #nlp #japanese #tagger #dictionary
moenarchbook

Creates a book from markdown files

v0.1.1 bin+lib #mdbook #book #markdown #moenarch
text-tables

A terminal/text table prettifier with no dependencies

v0.3.1 800 #table #terminal #pretty #cli #ascii
dedent

Procedural macro for stripping indentation from multi-line string literals

v0.1.1 7.5K macro #indentation #proc-macro #formatting #string-formatting
umlauts

text transformation of german umlauts

v0.2.0-alpha.3 #umlauts #utf-8 #upper-case #äöü-äö-üß-ß #umlauts-owned
mdbook-all-the-markdowns

Render all markdown files in a given folder structure

v0.3.0 bin+lib #structure #mdbook #markdown #markdowns #md #config
wkhtmltopdf

High-level bindings to wkhtmltopdf

v0.4.0 950 #pdf #html #wkhtmltopdf #wkhtmltoimage #wkhtmltox
linetime

command line utility to add timestamps at the start of lines. The tool can either process lines from stdin or execute a command and process lines from the command's stdout and stderr.

v1.0.2 app #timestamp #optimization #bottleneck #line
twitch2csv

stream the chats of Twitch channels as a CSV

v0.1.1 app #twitch2csv #csv #mistermv #message-text #a67d6dac364a #abe3 #b153d255 #f0dd09c589e4 #b6d07625 #ae08
findtext_textfile

Search text in text file

v0.1.1 bin+lib #markdown #text-search #search #text #text-encoding #encoding
tdk_sozluk

TDK Sözlük API verilerini çeken bir Rust kütüphanesi

v0.1.0 140 #dictionary #tdk #sozluk #turkce #api-bindings
adobe-cmap-parser

parse Adobe CMap files

v0.4.1 53K #pdf #postscript #font #cmap
soundchange

implementing sound change algorithms in Rust

v0.0.8 nightly #linguistics #soundchange #logging #condition #str-to
difference

text diffing and assertion library

v2.0.0 465K bin+lib #diff #text #change #assert
mdbook-iced

An mdBook preprocessor to turn iced code blocks into interactive examples

v0.2.0 bin+lib #iced #mdbook #book #interactive #gui
mmseg

Chinese word segmenation algorithm MMSEG in Rust

v0.3.0 #chinese #nlp #segmenation
linkcheck

extracting and validating links

v0.4.1 5.3K #link-checker #linkcheck #link #check #documentation #links
presciidoc

Preprocessing AsciiDoc for other tools

v0.4.1 app #asciidoc #documentation #redhat
rosie-sys

build or link to librosie to access the Rosie Pattern Language

v1.3.1 sys #regex #rosie #fsa #matching #pattern-matching
mdbookshelf

Create epubs from a list of mdbook repositories

v0.1.2 bin+lib #ebook #epub #rust-book #mdbook #config #repository
hashtag-regex

regex matching hashtags accoding to the unicode spec: http://unicode.org/reports/tr31/#hashtag_identifiers

v0.1.1 #hashtag #regex #emoji
markdown-table-formatter

Markdown table formatter fully compliant with Unicode 15.1.0

v0.3.0 600 #markdown-tables #table-formatter #formatter #table #markdown #east-asian-width
ryaspeller

lib for searching typos in text, files and websites

v0.1.4 bin+lib #spell-check #yandex #spelling #api-bindings #spell-checking #spellcheck #website
wcount

CLI word counting tool

v0.1.0 app #csv #word-counter #cli #word #counter
carlotk

The main library for Carlo, a simple interpreted programming language

v1.1.0 #carlotk #subcommand #flags #language
static_table

creates pretty tables at compiler time

v0.7.0 160 macro #pretty-table #macro #time #time-table #print
psa

PSA(Project structure analysis) is a analyzer for analysis project struct

v0.1.1 #psa
dekor

styling and character repository in Rust

v0.2.2 150 #character #utf-8 #terminal #text-styling #console #utilities #development-tools-console
ogrep

searching in indentation-structured texts

v0.4.0 app #grep #indentation #outline #regex #search
wz

Count words, fast

v1.0.3 app #word-count #line-count #wc #word #line #byte
rew

A text processing CLI tool that rewrites FS paths according to a pattern

v0.3.0 bin+lib #path #regex #rename #pattern #tool
ragzilla

providing tools for RAG (Retrieval-Augmented Generation) pipelines

v0.3.2 #rag #artificial-intelligence #parser #transcribing #embedding #pipeline
kanpyo

Japanese Morphological Analyzer

v0.1.1 bin+lib #japanese #morphological #analyzer #nlp #natural-language-processing
dprint-plugin-sql

SQL formatter for dprint via sqlformat-rs

v0.2.0 #formatter #sql #dprint-plugin-sql #formatting
mdlynx

Small, fast utility to find broken file links in Markdown documents

v0.1.0 app #markdown #broken-links #document #documents #parallel
rdg

Random data generator for the command line

v0.1.1 app #regex #random #line #string #word-list
rsrpp

project for research paper pdf

v1.0.12 #rsrpp #parser #field
latin1str

Windows-1252 string types

v0.1.3 #latin1str #encoded #nul-terminated #utf-8 #ascii #slice #nul-bytes #encoding
hvm-core

massively parallel Interaction Combinator evaluator

v0.3.0-hvm32.compat.4 nightly no-std bin+lib #hvm-core #hvm #port #file #algorithm #locking #evaluator
ed_join

Implemtation of Ed-Join Algorithm for string similarity join

v1.1.1 bin+lib #string-similarity #string #similarity #text-processing
rusttyper

Basic text layout, using rusttype

v0.6.1 #color #rusttype #rusttyper #font #emoji #word-wrapping
unicode_names

Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…

v0.1.7 no-std #unicode #name #unicode-text #text
regex-cli

A command line tool for debugging, ad hoc benchmarking and generating regular expressions

v0.2.1 app #debugging #nfa #dfa #debug #cli
cw

Count Words, a fast wc clone

v0.7.0 bin+lib #word-count #wc #clone #word #count
ron_to_table

pretty print RON as a table

v0.7.0 160 #pretty-table #ron #format #print
chinese_segmenter

Tokenize Chinese sentences using a dictionary-driven largest first matching approach

v1.0.1 #chinese #tokenize #hanzi #segment #localization
character-stream

Helper data structures for reading UTF-8 characters from a stream

v0.13.0 #iterator #unicode #reader #wrapper #stream
mdbook-shiftinclude

mdbook preprocessor for file inclusion with shift

v0.1.0 app #mdbook-preprocessor #mdbook #mdbook-pre-processor #shift #indent
encoding_c

C API for encoding_rs

v0.9.8 14K sys #c-api #charset #unicode #ffi
unicode_converter

CLI tool to convert data between various Unicode encodings

v0.1.2 bin+lib #converter #unicode #utf-32 #utf-16 #utf-8 #cesu8 #encoding
lindera-filter

Character and token filters for Lindera

v0.32.3 9.7K #morphological-analysis #library #filter #morphological #analysis #tokenize
syllarust

quickly counting syllables

v0.2.0 110 #nlp #syllable #text #language
utf8-command

UTF-8 encoded std::process::Command output

v1.0.1 110 #command-output #utf-8 #command #exit-status
uniaxe

replace Unicode letters with Ascii equivalents

v0.1.1 #ascii #unicode #cleaning #text-processing #equivalent
simplecc

Chinese Convert library (partially) compatible with OpenCC's dictionaries

v0.2.2 #opencc #simplecc #dictionary #open-cc
terminal-supports-emoji

Check whether the current terminal supports emoji

v0.1.3 23K #emoji #stream #terminal-supports-emoji
korrektor

work with Uzbek language text processing

v0.3.1 #uzbek #korrektor #language #text-processing
xlsxwriter

Write xlsx file with number, formula, string, formatting, autofilter, merged cells, data validation and more

v0.6.1 27K #xlsx #excel #xlsxwriter #api-bindings #libxlsxwriter
simple-xml-builder

XML builder/writer

v1.1.0 7.2K #xml-element #simple-xml-builder #xml #builder-writer #file
backslash

parsing escape characters

v0.2.0 2.2K #character #backslash #characters #io #awk
unicount-lib

Alphabetic counter supporting unicode

v0.1.4 170 #unicode #unicount #unicount-lib #vec #ad
user_doc-tests

Tests for user_doc

v1.0.3 #documentation #user #user-doc
grammateus

facilitate working with Ancient Greek words

v0.2.2 #ancient-greek #diacritics #word #greek #ancient
assert-text

the testing macro tools

v0.2.10 130 #text #assert #assert-text
spellcheck_toy

a basic spellchecking library based on edit distance

v0.3.2 #spell-check #distance #toy
pact_matching

Pact-Rust support library that implements request and response matching logic

v2.0.0-beta.1 4.3K #pact #testing #cdc #matching #logic #content #matcher #rules #node #bodies
shapdf

Create Shapes into PDF

v0.1.0 bin+lib #pdf #shape #shapdf #pdf-generation
wordshk_tools

A combination of parsers and other tools for words.hk (粵典)

v3.16.0-beta.9 #dictionary #nlp #cantonese #parser #wordshk #hk #粵典
nlprule-build

Build tools for a fast, low-resource Natural Language Processing and Error Correction library

v0.6.4 5.0K #nlp #grammar #spelling #text
genere

randomization of text respecting grammatical gender of sentences

v0.1.2 bin+lib #text #sentence #generator
tectonic_xetex_layout

XeTeX's font loading and layout interface encapsulation, as a crate

v0.2.4 550 sys #tectonic #tectonic-xetex-layout #xetex #typesetting #component #path #single
eudex

A blazingly fast phonetic reduction/hashing algorithm

v0.1.1 1.0K #nlp #dictionary #soundex #search
unicode-box-drawing

Unicode box-drawing characters

v0.2.1 #character #hi-doc #unicode-box-drawing #characters
latex-to-html

Latex to html converter

v0.1.2 bin+lib #latex #html #latex-to-html #converter #label #equation #begin #end #forms #enumerate
gecliht

A disparate collection of text manipulation and formatting algorithms

v0.2.0 #soundex #stemmer #nlp #format #text
dvi2html

converter

v0.2.0 #html #converter #dvi2html #com-kisonecat-dvi2html
utf16-ext

Extensions for reading and writing utf-16

v0.1.0 150 #utf-16 #io #utf16-ext
typos-dict

Source Code Spelling Correction

v0.12.11 12K #spelling #typos-cli #typo #checker #correction #development #spell-check #monorepo #pr
strip-ansi-escapes

Strip ANSI escape sequences from byte streams

v0.2.1 654K #ansi-term #ansi-escapes #ansi-escaping #ansi-terminal #terminal
economic_indicator_finder

A finder for extracting economic indicators from paragraphs

v0.1.1 #economics #finder #economic-indicator #paragraph #text-processing
endf_parser

parsing ENDF-6 format nuclear data

v0.2.0 #nuclear-data #parser #endf #basics
bpmf_py

A Bopomofo and Pinyin library

v0.1.0 #pinyin #mandarin #bopomofo #parser #convert
token-dict

basic dictionary based tokenization

v0.1.0 #tokenize #token #dictionary
stylish-html

stylish helpers for writing styles as HTML elements

v0.1.2 4.5K no-std #stylish #html #stylish-html #element #string
mdbook-translation

prepare multi-lingual mdBook books

v0.1.1 app #translation #mdbook #localization #markdown #pre-processor #book
gesha-core

Core functionality for Gesha project

v0.0.12 #gesha #gesha-core #generator
dictcc

Rust API for reading and querying the dict.cc offline translation database

v0.1.1 bin+lib #dictionary #database #dictcc #translation
anagrambot

find anagrams of words

v1.0.1 #anagrams #word #anagrambot #words
rcut-lib

rcut is a Rust replacement for GNU cut that supports UTF-8

v0.0.52 #rcut #lib #rcut-lib #character #cut #box
typeline

Efficient, Type-Safe Pipeline Processor

v0.1.0 bin+lib #shell #stream #pipeline #tl
mdbook-typst-math

An mdbook preprocessor to use typst to render math

v0.1.1 bin+lib #mdbook #typst #mdbook-preprocessor
strizer

minimal and fast library for text tokenization

v0.1.0 #tokenize #strizer #string-tokenizer #stream-tokenizer
mdbook-webinclude

Preprocessor for mdBook that includes content from URLs

v0.1.0 app #mdbook-preprocessor #mdbook #url #mdbook-pre-processor #webinclude #escaping #text
e_book_sync_library

Synchonize e-book with your local e-library

v0.3.6 bin+lib #ebook #sync #utility #config #folder
unicode-canvas

creating text base drawing

v0.1.1 #canvas #widgets #tui #text
shallow

long text

v0.2.0 #shallow #character-shallow #mode #text #testing
demoji

Remove all emojis from a string

v0.0.3 #emoji #string #demoji
wordbreaker

A Unicode-aware no_std crate (requires alloc) that rapidly finds all sequences of dictionary words that concatenate to a given string

v0.3.0 no-std #dictionary #text #concatenation #concatenations-for
const_format_proc_macros

detail of the const_format crate

v0.2.34 3.1M macro no-std #concat #proc-macro #macro #formatting #no-std #arguments #format
once-cell-regex

just gives you the regex macro from the once_cell docs!

v0.2.1 19K no-std #regex #lazy-evaluation #static #documentation
chisel-lexers

Chisel backend lexers/scanners

v1.1.0 #lexer #parser #chisel-lexers #input
text_manipulation_rs

generating random placeholder text in different languages

v0.1.3 #language #text-manipulation #random-text-generate #text #dictionary
stringsort

Pathological sorting of string characters

v2.0.1 #stringsort #character #string #afterwards #characters
jg

Jeff Goldblum (jg) is a command-line JSON processor. jg searches for structural patterns in json input and prints each json object that matches the pattern.

v0.1.6 bin+lib #search-pattern #pattern #json #grep #selector
github-slugger

A slugger for GitHub headings

v0.1.0 320 #markdown-it #markdown #slug #heading
minigreper

Small grep style cli from the book

v0.1.0 bin+lib #minigreper #book
pikchr-cli

PIC-like diagramming language to SVG converter

v0.1.2 app #svg #markdown #pic #md #html
text-sanitizer

convert text to plain ASCII text

v1.6.0 #utf-8 #unicode #ascii #sanitizing #text-processing
wfst4str

Python library based on rustfst for manipulatig strings with wFSTs

v1.0.4 #python #fst #nlp #linguistics #wfst #string
oxcomm

using Google Translate on the fly

v0.1.2 #text-translation #text #language #google #translation
hashlogs

Command-line utility that hashes the part before a space on each line from stdin with blake2b keyed with an ephemeral randomly-generated key and writes to stdout

v1.0.2 app #cryptography #hash #hashlogs #stdout #cryptographic-hashes
vtext

NLP with Rust

v0.2.0 180 #nlp #tf-idf #tokenize #levenshtein #text-processing
aki-mline

match line, regex text filter like a grep of linux command

v0.1.32 bin+lib #filter #text #aki-mline
cmark2tex

A small utility to convert markdown files to pdf exploiting tectonic

v0.4.0-beta.1 bin+lib #tex #cmark #common-mark
spandex-hyphenation

Knuth-Liang hyphenation for a variety of languages

v0.7.4 #text #typesetting #hyphenation #language
mdbook-playscript

Preprocessor for mdBook, which styles stage play scripts

v0.5.0 bin+lib #markdown #play #pulldown-cmark #stage #script
charjpoet

Charj Poet is a API for write to .cj language

v0.1.0 #charjpoet #poet #properties #md
asimov-dataset-cli

ASIMOV Dataset Command-Line Interface (CLI)

v25.0.0-dev.6 290 bin+lib #asimov #cli #artificial-intelligence
dhoni

converting Bengali text into their phonetic counterpart

v0.1.0 #phonetic #avro #bengali #bangla
dd

a clone of the unix coreutil dd

v0.4.0 app #dd #exit #file #synopsis #block #ascii #directory #ebcdic
mdtable-cli

that makes creating tables in markdown much easier!

v1.1.1 app #md #table #markdown-tables #markdown
base100

Encode your data into emoji

v0.4.1 app #emoji #base100 #simd #base64 #input #memescale
pdfutil

PDF document manipulation

v0.4.0 app #pdfutil #object #lopdf #page #document #operation #subcommand #pdf
rustinsight

The launcher app for the interacive book

v0.10.0 bin+lib #book #rustinsight #launcher #lab #com
txt_to_md

Command converting from a txt file to a markdown file

v0.1.1 app #markdown #text #txt #file #md
shelldon

your new Rust-powered buddy with GPT features!

v0.1.0 app #artificial-intelligence #gpt #shell #prompt
word_filter

A Word Filter for filtering text

v0.8.1 no-std #filter #string #word #censor
tantivy-object-store

A tantivy Directory implementation against object stores (S3, GCS, etc.)

v0.1.0 #search-engine #full-text-search #object-store #search
seven_seg

Seven-segment digital display for terminal

v0.1.2 #text #format #combine-text #sevseg-four
ngrams

Generate n-grams from sequences

v1.0.1 2.3K #ngrams #sequence #documentation #org-wiki-n-gram
scrambler

command line tool to scramble letters

v0.1.1 app #letter #word #scrambler #scramble
markdown-heading-id

Filter for pulldown-cmark which converts headings with custom ID

v0.1.0 2.6K #markdown #pulldown-cmark #heading-id
corpus-preproc

A preprocessor for text and HTML corpora

v0.1.0 app #pre-processor #corpus #text #cli #word #character #mark
asimov-prompt

ASIMOV Software Development Kit (SDK) for Rust

v25.0.0-dev.7 230 no-std #asimov #sdk #artificial-intelligence
bocu1

BOCU-1 compressed unicode encoding

v0.1.0 #unicode #unicode-text #compression #text
code-to-pdf

Generates a syntax-highlighted PDF of your source code

v0.2.0 bin+lib #pdf #font #path #margin #define #ignore #image #overflowing #error-tolerant
ced

Dead easy csv editor

v0.2.2 bin+lib #csv #cli #ced #editor #text-processing #front-end
ucd-generate

A program for generating packed representations of the Unicode character database that can be efficiently searched

v0.3.1 app #generate #unicode #fst #character #table
rust-cedar

efficiently-updatable double-array trie in Rust (ported from cedar)

v0.1.0 #trie #string #text #string-search #text-search #search #darts
lines_lossy

extension to BufRead with a function lines_lossy that works like BufRead::lines but with lossy UTF-8 decoding

v0.1.0 #lossy #utf-8 #bufread #string
br-pdf

PDF Invoice Processing

v0.0.2 #br #inc #pdf #processing
tiny-gradient

Make your string colored in gradient

v0.1.0 5.8K no-std #ansi-term #gradient #tiny-gradient #color #terminal #cli #ansi-terminal
encoji

Emoji based encoding and decoding. 🔥🔥🔥🚀

v0.1.1 #fire-fire #encoji #emoji
terminal_cli

A standalone library with no-std support for command line terminal interfaces. With autocomplete support, helpers for commands and properties and a prompt implementation.

v0.2.0 no-std #terminal #properties #cli #newlines #validation #crlf
tb_normalization

normalization utf8 string, loc dau vietnamese and some language

v1.0.0 bin+lib #normalization #utf-8 #locdau #vietnamese
lingua-spanish-language-model

The Spanish language model for Lingua, an accurate natural language detection library

v1.2.0 16K #language-recognition #lingua #language-detection #nlp
bookrafter

This repository contains code related to bookrafter rendering

v0.1.0 app #markdown-renderer #book #bookrafter #books #rendering #markdown #renderer
mdoc

Modern PDF creation through Markdown and LaTeX

v0.3.0 bin+lib #latex #mdoc #bibliography #document #markdown #documentation #djot #compiler
csvre

replacing data in CSV columns with regular expressions

v0.1.0 app #regex #csv #command
jpreprocess-window

Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)

v0.12.0 170 #open-j-talk #text-to-speech #library
clparse

A command line tool for parsing CHANGELOG.md files that use the Keep A Changelog format

v0.9.1 bin+lib #keep-a-changelog #changelog #parser
zw

encoding and decoding text using zero-width characters

v0.2.0 bin+lib #character #zw #encode #characters
const-utf16

Utf8 to utf16 conversion functions for use in const contexts

v0.2.1 #utf-16 #utf-8 #const #context
num2en

For converting integer and decimal numbers into English cardinal or ordinal number words

v1.0.0 #word #numbers #english-words #cardinal #english #ordinal #words
untex

Understand and manipulate TeX files with ease

v0.4.0-beta bin+lib #latex #formatter #parser #lexer #document #tex
yozuk-helper-english

English NLP utilities for Yozuk

v0.22.11 #yozuk #yozuk-helper-english #english #telegram-bot
convert_encoding

Convert encoding of text files in batch

v0.1.0 app #encoding #convert #convert-encoding
sudachiclone

sudachiclone-rs is a Rust version of Sudachi, a Japanese morphological analyzer

v0.2.1 bin+lib #japanese #morphological #japanese-morphological #analyzer #sudachi
cute_strings

colorize strings in the terminal

v0.1.1 bin+lib #string #coloring #cute #terminal
rpdf

PDF command-line utils written in Rust

v0.1.3 app #annotations #command-line-utilities #pdf #cli #annotation
bos_books_codes

that handles 3-character Bible Books Codes

v0.1.2 #book #codes #bible #usfm #osis #books
lindera-wasm

A morphological analysis library for WebAssembly

v0.42.5 370 #morphological-analysis #library #wasm #morphological #analysis
wordnet

Read a wordnet dictionary in Rust

v0.1.2 #wordnet #nlp #dictionary
md-include

include any file in markdown files

v0.1.0 app #markdown #include #file
borderrs

Add stylish borders around your text and datastructures

v0.1.1 #ansi-term #unicode #ansi-terminal #ascii #terminal #ansi #data-structures #cli
naromat

Convert text to narou novel format

v0.3.1 bin+lib #format #naromat #text-file #converter
whitespace

Encode arbitrary data whitespaces and vice versa

v2.0.0 #white-space #decode #encode #documentation #useless-things
lindera-ipadic

A Japanese morphological dictionary for IPADIC

v0.42.4 25K #morphological #japanese #japanese-morphological #ipadic #dictionary
latexify

Shared definition for turn a rust object into latex code

v0.1.0 #latexify #latex #la-te-xify #writer #bibliography #bibtex #parser
encoding-next-types

Traits and types for the encoding package

v0.2.0 1.2K #unicode #encoding-next #charset #package
czv

performing CSV-related operations for data engineering and analysis

v0.0.2 #csv #library #data #price
mdbook-extended-markdown-table

Preprocessor for mdBook that generates tables with merged cells from ASCII text

v0.1.0 bin+lib #mdbook-preprocessor #mdbook #markdown-tables #markdown #mdbook-pre-processor #diagram #table #build-utils
codegenrs

Moving code-gen our of build.rs

v3.0.2 2.5K #codegen #codegenrs #development
git-blamediff

A program to automatically annotate changes to a file in git(1)

v0.1.2 bin+lib #git #diff #text #utility
git-busy

A wrapper around "git commit" that generates the commit messages for you

v1.0.0 bin+lib #git #commit #gpt-3 #artificial-intelligence #cli #gpt3
skyspell_core

skyspell core library

v5.0.0 #spell-check #skyspell #skyspell-core #line #struct #folder
mdbook-obsidian

mdBook preprocessor to render Obsidian specific syntax

v0.1.0 bin+lib #mdbook-preprocessor #mdbook #obsidian #markdown #mdbook-pre-processor
braille_pics

producing text-art pictures using Braille characters

v0.1.1 #character #braille #bit #braille-pic #false #bounded #characters #16 #d12345678d12345678d12345678d12345678d12345678d12345678
test-catalog

Collect and export test cases as a catalog

v0.1.0 bin+lib #catalog #testing #test-catalog #version #case #test
mdbook-reference-table

mdBook preprocessor to create reference tables

v0.1.0 app #table #mdbook #mdbook-reference-table #pre-processor
mdbook-numeq

An mdbook preprocessor for automatically numbering centered equations

v0.4.0 bin+lib #mdbook-preprocessor #mdbook #katex #mdbook-pre-processor #numeq
ankiding

Creating Anki-Flashcards within Markdown!

v0.1.0 app #markdown #ankiding #latex #anki #decks
yhy-email-encoding

Low level email encoding RFCs implementations

v0.0.2 #email #encoding #utf-8
latex-thebib

Clean and sort legacy TeX bibliographies written using ‘thebibliography’ via the refactor sub-command. Compile BibTeX files to legacy thebibliography TeX code using the compile sub-command…

v0.3.4 app #latex-thebib #latex #thebibliography #2022
asimov-cli

ASIMOV Command-Line Interface (CLI)

v25.0.0-dev.4 bin+lib #asimov #artificial-intelligence #cli #ai
top-english-words

retrieve top words from the English language

v1.1.1 #english-words #word #english #popular #frequent
aklat

create books from markdown files (like Gitbook)

v0.0.20 bin+lib #gitbook #rust-book #book #markdown
crypto-invert

Unicode Upside-Down Mapping

v1.0.1 #crypto-invert #encode #mapping #text #testing
ascii_code_finder

find ascii code of a character or get a character by its ascii code

v0.1.0 #ascii #convert #finder
lines

Utililities for iterating readers efficiently line-by-line

v0.0.6 #text #streaming #line
spongedown

Converts markdown to html with svgbob support

v0.5.0-alpha.1 #svg #markdown #bob
basic_lexer

Basic lexical analyzer for parsing and compiling

v0.2.1 #tokenize #line-comment #tokenizer #compilation #set-line-comment
szovegertesimutato-score

Calculate szovegertesimutato score for a given text and language

v0.1.0 #nlp #readability #szovegertesimutato #text-analysis #language
math-text-transform

Transform greek letters, latin letters, or decimal digits into certain variants from the mathematical alphanumeric symbols Unicode block (U+1D400–U+1D7FF). For example to bold, italic, script or double-struck.

v0.1.1 bin+lib #math #typesetting #unicode #unicode-text #text
admerge

Merge multiply sources into one, with advanced options

v0.1.3 #concatenation #merge #utilities #concatenate #file
toml_to_table

pretty print TOML as a table

v0.7.0 170 #pretty-table #toml #format #print
bytescolor

A versatile Rust library for colorizing strings and byte data in terminal applications using ANSI escape codes

v0.1.0 #ansi-term #ansi-terminal #byte-color #byte #terminal
gqlog

👾 filter your json logs with graphql 👾

v1.0.3 bin+lib #graphql #filter #logging
textocx

Tex code to Office MathML

v0.1.0 app #ms-office #textocx #mathml #download #latex #windows #clipboard
pomsky-bin

Compile pomsky expressions, a new regular expression language

v0.11.0 bin+lib #regex #pomsky #language #cli
blockcounter

Counts the blocks in a stream

v0.3.2 #string #gnuplot #file #text
color-convert

Support RGB,RGBA,HEX,HSL,HSLA,HSV,CMYK to convert each other, write by rust

v0.1.0 #color-convert #convert #color
wz-utf16

UTF-16 counters for wz

v1.0.2 no-std #wz #wz-utf16 #line
df_cp437

Decoder for CP437 to UTF-8

v1.1.0 #cp437 #utf-8 #df-cp437
veryfi

Module for communicating with the Veryfi OCR API

v1.0.0 #api #veryfi #api-key #document
iasthk

Harvard-Kyoto to IAST conversion

v1.0.1 #iasthk #convert
txttyp

Formatted string typewriter

v0.1.2 #txttyp #text #command-line #typewriter #format #string #style #cargo
unicode-utf8

that converts utf-8 bytes to a unicode scalar value, and vice versa

v0.1.3 #versa #utf-8 #unicode
morc

Dead simple, minimal markdown generator library written in Rust

v0.0.2 #markdown #library #md #generator #readme
mime-rs

A text processing framework, inspired by Emacs lisp and keyboard macros

v0.3.0 #scripting #text-processing #mime-rs #cpp
jellybean-pack-0

Sweet syntax highlighting with tree-sitter

v0.0.2 #syntax-highlighting #highlight #tree-sitter
fast_aug

Fast data augmentation for text

v0.1.0 bin+lib #text #word #token #augmentation #base-augmenter #text-augment-parameters #nlp #python
opencc

binding for Rust

v0.3.0 #chinese #opencc #opencc-rs #bindings
buf-trait

abstract over [u8], str, and friends

v0.4.1 170 #binary #text #string #friends
cattocol

Combine two text into one text as columns

v0.3.1 #text #concat #format #column #combine-text
hline

a grep-like tool that highlights lines in files

v0.2.1 bin+lib #expression #hline #recording #file #filename #niche #stdin
character_text_splitter

splitting text into chunks with overlap, designed for handling large amounts of text efficiently. Implementation is identical to langchain's CharacterTextSplitter

v0.1.3 #character-text-splitter #character #chunks #split #size
lithe-cli

A cli of lithe

v0.0.3 app #cli #text #lithe
hina

:]

v0.1.3 nightly #hina #append #character
alpino-tokenize

Wrapper around the Alpino tokenizer for Dutch

v0.4.0 app #tokenize #alpino-tokenizer #dutch
emojicons

Parse :emoji: notation to unicode representation

v1.0.1 #emoji #emojicons #cat
contractions

expand contractions in English

v0.5.4 1.5K #nlp #pre-processor #contractions #language
quoted-string-parser

Quoted string parser for grammar defined in RFC3261

v0.1.0 10K #parser #rfc-3261 #quoted-string #white-space
pta-generator

Test data generator for PTA applications

v25.5.1 180 app #plain-text-accounting #testing #generator #beancount #tackler #ledger #journal #applications #accounting #audit
sprinkles

Randomly colors input text and outputs it to the terminal

v1.0.0 bin+lib #text #pretty-print #format #cli #command-line-utilities
xsv

A high performance CSV command line toolkit

v0.13.0 1.1K app #csv #csv-tsv #tsv #slice #command
parser-cli

Command-line interface for extracting text from various file formats

v0.1.3 bin+lib #pdf #docx #parser #text-extraction #cli-parser #format
bytepiece_rs

The Bytepiece Tokenizer Implemented in Rust

v0.2.2 #tokenize #nlp #bytepiece #deep-learning #tokenizer
yozuk-sdk

Types used in the Yozuk ecosystem

v0.22.11 #yozuk #ecosystem #sdk #telegram-bot #chat-bot
vaporetto_tantivy

Vaporetto Tokenizer for Tantivy

v0.22.3 650 #tokenize #tantivy #japanese
ast-grep-language

Search and Rewrite code at large scale using precise AST pattern

v0.38.2 3.2K #search-pattern #pattern #codemod #rewrite #ast #search
maybe_utf8

Byte container optionally encoded as UTF-8

v0.2.3 nightly #utf-8 #container #string
pulldown-cmark-fork

A pull parser for CommonMark

v0.5.2 bin+lib #common-mark #markdown #pulldown-cmark #block #parser
markdown2unicode

Converter from markdown notation to unicode characters

v0.2.1 bin+lib #character #string #markdown2unicode #characters #unicode #strong
tiniestsegmenter

Compact Japanese segmenter

v0.3.0 #tokenize #japanese #nlp #ngrams
vl-convert-pdf

convert SVG to PDF with embedded text

v1.4.0 #svg-pdf #pdf #svg #text
rustrawi

Rust port of the original PHP Sastrawi

v0.1.2 #tokenize #nlp #stem #sastrawi #stopword
chisel-parsers

Chisel parser front ends

v1.1.0 #parser #chisel-parsers #end #testing #workspace
varcon-core

Varcon-relevant data structures

v5.0.2 11K #typos-cli #varcon-core #checker #structures #spell-check #monorepo #pr #typo
fmtm_ytmimi_markdown_fmt

Fork of @ytmimi's Markdown formatter; powers FMTM

v0.0.3 200 #common-mark #formatter #markdown #markdown-formatter #list
worcher

full-text search for static websites

v0.1.2 #full-text-search #worcher #regex #search #text-search
modit

Modal editor parser

v0.1.5 800 #parser #modit
fnew

A Unicode-aware line-oriented drop-in replacement for coreutils' fold

v1.0.1 app #coreutils #fold #text-processing #command-line-tool
quick_io

facilitate input and output within programs, with a set of macros

v2.0.0 #quick #quick-io #character #down #right #write #mv #addstr #20 #10
cjieba-sys

unsafe ffi to cppjieba

v0.1.1 sys #nlp #chinese #segmentation #cppjieba #rust-jieba
kytea-tokenizer

Wrapper of tokenization by KyTea

v0.10.0 #japanese #morphological #japanese-morphological #analyzer #kytea
uwu_cli

uwuifying the terminal

v1.0.0 app #owo #uwu #cli #terminal #file
aki-txpr-macro

the more easy to use libaki-*

v0.1.5 #thread #fifo #pipe #filter
vaporetto_rules

Rule-base filters for Vaporetto

v0.6.5 1.1K no-std #japanese #tokenize #analyzer #morphological
tfidf-summarizer

Basic tf-idf compute for documents

v2.0.0 #nlp #document #summarizer #text-processing #documents
norm-email

strip email provider defined behaviour from email addresses

v0.1.0 #emoji #unicode #addresses
fbihtax

CLI tool to help manage tax payments in FBiH (Bosnia and Herzegovina Federation)

v0.3.2 bin+lib #federation #pdf #fbihtax #forms #testing #breakdown
mdbook-chapter-number

A mdBook preprocessor that adds chapter numbers to the each page header

v0.1.2 app #mdbook-preprocessor #mdbook #markdown #mdbook-pre-processor #header
yeslogic-unicode-blocks

Functions to access and search Unicode blocks

v0.2.0 190 no-std #block #cjk #character #unicode
typeline_ext_sqlite

sqlite integration for typeline

v0.1.0 #stream #pipeline #shell #tl
charwise

This lightweight, dependency-free rust library provides a convenient way to read characters from different resources

v1.0.1 #buffering #lexer #stream #character #peek
quill_delta_pdf

Convert Quill Delta to PDF

v0.1.4 #pdf #delta #quill #convert #quilljs
quartz_commands

Generates a parser at compile-time for handling commands similar in structure to those of Minecraft

v0.1.0 #cli-parser #command #parser #minecraft #command-line-tool #cli-command
tadm

A collection of algorithms and data structures wrote out while reading The Algorithm Design Manual book

v0.1.1 #book #tadm #sorting #snippets #manual
asciify

converting images to a readable format on the command line

v0.1.6 #ascii #image #line
anagram

A collection of anagram utility functions

v0.4.0 #anagrams #word #function #occurences #cool
kilo

small, fast utility crate/library for manipulating strings and generating sourcemaps with all in Magic 🪄

v0.1.0 #source-map #kilo #parser #magic-string #string-manipulating
latex

An ergonomic library for programatically generating LaTeX documents and reports

v0.3.1 110 #pdf-report #latex #report #pdf #generation #tex #paragraph #section
unicode-character-database

Unicode character database tables (Unicode Standard Annex #44) generated using ucd-generate

v0.1.0 #unicode #ucd #tr44 #unicode-text #text
lindera-sqlite

Lindera tokenizer for SQLite FTS5 extention

v0.42.2 150 #morphological-analysis #sqlite #library
rustyword

An anagram finder

v0.1.0 app #word #letter #cli #command-line-utilites
mupdf-sys

Rust FFI binding to MuPDF

v0.5.0 2.1K sys #pdf #mupdf #mupdf-sys #mupdf-wrapper #progress
yozuk-model

NLP model generator for Yozuk

v0.22.11 #yozuk #yozuk-model #model
suffix

arrays

v1.3.0 7.0K bin+lib #suffix #search-index #search #text-search #text #index #suffix-table
cmdcjones_minigrep

A minimal grep clone from the Rust Book

v0.1.0 bin+lib #book #mini-grep #cmdcjones-minigrep
case_convert

Converts the first letter of a Rust String to uppercase

v0.1.0 #string #case-convert #case #convert
glyphana

Quickly find, inspect & collect unicode glyps

v0.1.4 nightly app #glyphana #character #glyph #glyps #search #unicode #egui #typography #viewer
docstring

manipulating and parsing documentation strings

v0.2.4 #documentation #doc-string #move-idl
simple-word-count

word count function, try to get same result with Microsoft Office Word application

v0.1.1 #word-count #word-counter #simple-word-count #word #count #counter
lindera-compress

A morphological analysis library

v0.32.3 11K #morphological-analysis #library #compression #morphological #analysis
textos

Texts, strings, formatting, unicode…

v0.0.3 no-std #string #unicode #unicode-text #no-alloc #text
lindera-cc-cedict

A Japanese morphological dictionary for CC-CEDICT

v0.42.3 24K #cc-cedict #morphological #dictionary #chinese
mathml-latex

Convert between MathML and LaTeX

v0.0.3 #latex #mathml #mathml-latex #convert #commit #monorepo
recode_rs

Command-line tool for converting between the character encodings defined in the Encoding Standard

v1.0.6 app #unicode #charset #recode-rs
llmvm-core-lib

llmvm core application

v1.1.4 #artificial-intelligence #llm #api-bindings #thread #back-end #preset #workspace #template #ai
word_iter

Iterator over all words in a string

v0.2.1 #iterator #string #word
fontspector

Quality control for OpenType fonts

v1.0.2 450 app #font #profile #fontspector #below #component
mdbook-infisearch

InfiSearch plugin for Mdbook

v0.10.1 app #infisearch #mdbook-infisearch #mdbook #search #static-site
ascii_utils

handle ASCII characters

v0.9.3 688K #ascii #character #ascii-utils #characters
xsystem

Conversion between the Esperanto x-system and Unicode circumflexes

v0.1.0 #esperanto #xsystem #character #unicode-chars #x-to-unicode
morsels_lang_ascii

Basic ascii tokenizer for morsels

v0.7.3 #ascii #language #morsels-lang-ascii #package
fst-subseq-ascii-caseless

An automaton that matches if the input contains a specific subsequence ignoring ASCII case to be used with fst

v0.1.1 #fst #search #ascii #subseq #caseless
lingua-dutch-language-model

The Dutch language model for Lingua, an accurate natural language detection library

v1.2.0 13K #language-recognition #lingua #language-detection #nlp
pdf_encoding

Font related encodings

v0.4.0 340 #encoding #pdf #pdf-encoding #system
file-search

File indexing and search

v0.1.11 app #file-search #search #search-index #pdf #endpoint
aki-json-pick

The json pick out command

v0.1.10 bin+lib #json #text #filter
mdbook-compress

Compress an mdBook project into a single PDF file

v0.2.1 app #mdbook #pdf #rust-book #book #compression
perlin

A lazy, zero-allocation and data-agnostic Information Retrieval library

v0.1.0 #information-retrieval #search-engine #text #search
ttf_word_wrap

Wraps text based on character width

v0.5.0 #word-wrap #font #wrap #word #string
unicode_skeleton

detects unicode strings that look nearly identical once rendered, but do not compare as equal. It defines "confusable" and "skeleton" based on Unicode Standard Annex #39

v0.1.1 #confusable #skeleton #unicode #unicode-text #text
cautious-octo-funicular

Test: shipping an mdbook with API docs

v0.1.5 #documentation #cautious-octo-funicular #cautious #docs #book
findtext_sheet

Search text in SpreadSheet

v0.1.2 bin+lib #xlsx #search #text-search #excel #text #cli
lindera-ipadic-neologd

A Japanese morphological dictionary for IPADIC NEologd

v0.42.4 22K #morphological #japanese #japanese-morphological #dictionary #neologd #ipadic
textframe

query plain text documents by unicode offset without loading them all into memory

v0.3.0 220 #linguistics #text-processing #standoff #text
argot

Parse documentation from codebases into Markdown for easy doc creation

v0.2.2 app #file #argot #class #markdown #language #action #name #variables
with-str-bytes

Safely manipulate the bytes of a UTF-8 string

v1.0.0 no-std #ascii-text #string #utf-8 #ascii-string #byte #ascii #safe
hsk

Return HSK Level for Simplified Chinese Characters

v0.1.1 #chinese #hsk #hanzi #character
rtlicious

A nom-based parser for Yosys RTLIL files

v0.1.1 #rtlicious #eol #parser #design #testing #end
jpreprocess-dictionary

Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)

v0.12.0 170 bin+lib #open-j-talk #text-to-speech #library
rough

A very simple and opinionated static site generator

v0.2.0 app #rough #format #static-site-generator #markdown
stamd

Webservice for working with stand-off annotations on text (STAM)

v0.1.0 app #annotations #nlp #linguistics #standoff #text-processing #annotation
rusty_code_code_for_book

my book_rusty code

v1.1.2 app #book #rusty-code-code-for-book #for
catmark

Console printer for CommonMark

v0.2.2 app #common-mark #terminal #catmark #syntax-highlighting #ansi
minigrep_flict

Simplest text-in-file search engine from rust book

v0.1.1 bin+lib #mini-grep #book #minigrep-flict #engine
lorgn_lang

a general purpose scripting language optimized for graphical programming

v0.1.0 #lorgn #language #lorgn-lang #notation
textr

TeX-inspired plug-n-play interface for converting JSON documents into PDFs

v0.3.0 #pdf #textr #identifier
fontconfig-rs

Safe, higher-level wrapper around the fontconfig library

v0.1.1 370 #fontconfig #wrapper #font #search
mdbook-files

Preprocessor for mdbook which renders files from a directory as an interactive widget

v0.2.0 bin+lib #mdbook #mdbook-files #widgets #serve #book
stfu

Shut The Ferris Up - profanity filtering for Rust

v0.1.0 #word #bad #filter #censor #profanity #act #words
ab-radix-trie

A compressed radix trie implementation supporting matching rules

v0.2.1 #trie #ab-radix-trie #character #rules #radix-trie
text_to_emoji

Convert text to emoji

v0.1.0 #emoji #text #convert #rust
password-characters

help with the "enter the 12th, 35th, and 63rd characters from your password" situations

v1.0.1 app #character #situations #password #characters
md-dir-builder

Webserver for serving all markdown files in a directory

v0.3.1 app #builder #directory #md-dir-builder #run #parser
mdbook-trace

A traceable document preprocessor for mdbook

v0.1.1 app #mdbook-preprocessor #mdbook #mdbook-pre-processor
perspicuity_formula

Calculate Flesh Reading Ease for a given text and language

v0.1.0 #nlp #readability #flesh #text-analysis #language
rigrep

grep from Rust Book

v1.0.1 bin+lib #rigrep #grep #rust #search #command-line-tool #unix-command
twemoji-rs

A word-cloud image generation crate

v0.1.2 #emoji #unicode #icons #image
txtframe

Creates a frame for text

v0.4.0 #frame #text #format #width #fill #top-line #left-top #right-top #left-btm #right-btm
overlap

shows overlap text in files

v0.0.2 bin+lib #text #cli #overlap
noodler

A port of the python-ngram project that provides fuzzy search using N-gram

v0.1.0 190 #ngrams #fuzzy #shingles #search
mdbook-mathpunc

An mdbook preprocessor that prevents line breaks between inline math blocks and punctuation marks when using katex

v0.2.0 bin+lib #mdbook-preprocessor #mdbook #mdbook-pre-processor #katex #punctuation
japanese-ruby-filter

Japanese ruby notation parser

v0.1.0 #japanese-ruby #ruby #japanese-ruby-filter #text #parser #pulldown-cmark
spyglass

Search engine for documents, inspired by bioinformatics

v1.1.0 #spyglass #wildcard #character #bioinformatics #regex #distance
token-counter

wc for tokens: count tokens in files with HF Tokenizers

v0.1.0 app #tokenize #nlp #token-counter #tokenizer #stdin #pattern #count
poetry-book

Create a poetry book in latex, starting from plain text

v0.1.3 #book #poetry #latex #poem #verse
decline-word

Choose word form based on given number

v0.1.2 #word #numbers #decline
masker

Mask patterns in data

v0.0.4 #text-search #text #utility #search #data
merge_pdf

Merge PDF files in a directory

v0.1.0 app #directory #pdf #merge
lexmatch

lexicon matching tool that, given a lexicon of words or phrases, identifies all matches in a given target text. Uses suffix arrays.

v0.3.0 app #nlp #lexmatch #text-processing
jellybean-pack-1

Sweet syntax highlighting with tree-sitter

v0.0.2 #syntax-highlighting #highlight #tree-sitter
bgrep

grep tailored to handle binary patterns and files

v1.0.0 app #grep #regex #binary #search-pattern #pattern
rckive-genpdf

User-friendly PDF generator written in pure Rust

v0.4.0 #pdf #text-layout #element #family #page #text #table #file #system #document
encoding_c_mem

C API for encoding_rs::mem

v0.2.6 14K sys #unicode #c-api #charset #ffi
unidok

A powerful, readable, easy-to-learn markup language

v0.2.0 app #common-mark #unidok #markdown #asciidoc #language
emojito

Find all the Emoji in a string. Supports composed emoji.

v0.3.5 130 #emoji #string-search #search #unicode #string
compiler-tools

A proc-macro for deriving powerful and fast tokenizers with compile-time regex

v0.2.0 4.2K #parser-generator #compiler #regex #parser #generator
trexter

Text progression tracking library

v0.1.1 #text-processing #trexter #unit
termbook

behind the termbook-cli

v1.4.2 #markdown #terminal #common-mark #mdbook
html_to_pdf_lib

converting HTML to PDF

v0.1.2 bin+lib #pdf #html #lib #repository #sh
mdtranslation-cli

Command-line tools for using mdTranslation, which can be used to prepare multi-lingual Markdown documents

v0.1.0 app #translation #markdown #localization #common-mark #document
utf8reader

wrapper around Reader that returns a stream of UTF-8 characters

v0.1.0 #character #utf8reader #reader #access #code-point #characters
meaningsearch

package that helps you find meaningful lines of any given input. Especially useful in CTFs.

v0.1.4 app #tool #ctf #search #file
asimov-repository-cli

ASIMOV Repository Command-Line Interface (CLI)

v25.0.0-dev.0 app #asimov #cli #artificial-intelligence
chapter-8-exercises

Exercises from the 8th chapter of the book

v0.1.0 app #book #chapter #chapter-8-exercises
tashkil

A lightweight library for removing Arabic diacritics

v0.1.0 #diacritics #arabic #language #dari #pashto
asciir

Print ASCII table/values

v0.1.0 bin+lib #table-values #asciir #file #character
json-peek

Amature JSON parser library designed for my specific need

v0.0.2 nightly #json #peek #json-peek #parser
swappy

An anagram generator

v0.3.0 app #anagrams #language #swappy #generator #eyes #mugs #murals #wintergreen
pdf_form

programatically filling out pdf forms

v0.4.0 #forms #pdf #field
pdf_composer_base

PDF Composer base functionality crate

v0.3.0 #markdown #pdf #composer #yaml #generate #margin
books_description_parser

A Rust-based parser to extract book details from structured markdown-like text and output them in formats like JSON or Rust structs for further processing

v0.1.0 bin+lib #book #parser #description #grammar
ddvm

Document to Document Virtual Machine

v0.1.0 app #ddvm #ast #html #pdf #pdf-to-html #pdf-converter
slicer

that slices string slices into smaller string slices

v0.1.1 #slice #string #parser #as-slicer #skip-over
transliterate1234

UTF-8 to ASCII transliteration

v0.1.1 #transliteration #ascii #unicode-characters #unicode #character
llmvm-outsource-lib

outsource backend for llmvm

v1.3.1 #artificial-intelligence #hugging-face #openai #llm #api-bindings
static_format

Format strings with no runtime overhead

v0.0.3 macro no-std #const-format #const #format #no-std
pix-brcode

A ready to use compliant PIX specification, featuring fast de/serialization

v0.1.0 #brcode #pix #emv-qrcps #pdf #pix-toolbelt
iterlower

Final-sigma-correct lowercasing iterator adapter with option for Turkish/Azeri I behavior

v1.0.1 #greek #unicode #azeri #turkish
ewts-c

Converter from EWTS (Extended Wylie Transliteration Scheme) to Tibetan Unicode symbols (c lib)

v0.1.0 #converter #ewts #tibetan #localization
STKLR

STKLR: pronounced 'stickler'. Is a cli tool to automatically link functions, enums, structs, traits etc in rust-doc docstrings. I couldn't find a tool like this when I needed it so... here we are.

v0.0.42 bin+lib #documentation #search-pattern #pattern #sed #rustdoc #docs #search
minigrep_desonglll

grep implementation from The Rust Programing Book

v0.1.2 bin+lib #book #query #mini-grep #txt
lingua-russian-language-model

The Russian language model for Lingua, an accurate natural language detection library

v1.2.0 12K #language-recognition #lingua #language-detection #nlp
asimov-core

ASIMOV Software Development Kit (SDK) for Rust

v24.0.0-dev.22 no-std #asimov #sdk #artificial-intelligence
find-simdoc

Time- and memory-efficient all pairs similarity searches in documents

v0.1.1 500 #similarity-search #all-pairs #lsh #similarity #search
ascii_set

Fast membership of ASCII character classes

v0.1.0 #ascii #membership #character #class #set #testing #representing
openlibrary-rs

A wrapper around openlibrary's Web API

v0.3.1 #book #ebook #openlibrary #api-bindings #author #books #search
nutrimatic

Tools for reading Nutrimatic (https://nutrimatic.org) index files

v0.1.1 #language #trie #node
h_hangul

Korean Characters

v0.1.0 bin+lib #character #hangul #h-hangul #characters
bqrs

apply boolean query to text

v0.1.3 #text-search #text #query #search #boolean #match
indexrs

inefficient multi-language search index

v0.5.0 #search-index #full-text-search #search #index #text-search
simplearrayhash

v0.1.1 #string-search #hash-table #search #string #key
tex

The νTeX typesetting engine

v0.1.1 bin+lib #typesetting #latex #engine #format
mnumonic

A tiny library to convert opaque binary data to and from a human-memorable phrase

v0.2.0 #human-readable #word #convert #encode #words
grep-table-converter

A cli utility to convert grep result to table (csv, markdown, textile)

v0.0.3 bin+lib #grep #grep-table-converter #csv #markdown #line #filename #textile #file #testing
saku

efficient rule-based Japanese Sentence Tokenizer

v0.1.6 #tokenize #saku #tokenizer #python-bindings #japanese #nlp
pattern-3

Needle API (née Pattern API 3.0), generalization of std::str::pattern

v0.5.0 nightly no-std #pattern-3 #pattern #search #experimental
lindera-assets

A helper crate to fetch assets and build dictionary for lindera

v0.32.3 8.5K #morphological #japanese #japanese-morphological #dictionary #assets
llmvm-chat

An llmvm frontend that acts as a CLI chat interface

v0.1.1 app #artificial-intelligence #llm #llmvm #chat #demo #ai
tpng

A small tool that prints truecolor png renderings to the terminal using unicode block characters

v0.1.6 bin+lib #tpng #character #true-color #characters
align_text

Aligns lines in a block of text within a number of columns

v1.0.0 #pretty-print #text #format
nipah_tokenizer

A powerful yet simple text tokenizer for your everyday needs!

v0.1.0 #tokenize #nlp #text #tokenizer #word #words
names-changer

Convert a names of sql schemes from camelcase to snake case

v0.2.1 #name #text #changer #parser
allsorts_no_std

Font parser, shaping engine, and subsetter for OpenType, WOFF, and WOFF2

v0.5.2 no-std #true-type-font #opentype #font-shaping #parser #font #shaping #opentype-font #true-type
minigrep_lswarss

A very small part of Unix/Linux tool grep made with Rust for learning purpose while reading and studying the Rust Book

v0.1.0 bin+lib #mini-grep #book #minigrep-lswarss
code-span

Add additional infomation to code character

v0.2.0 #spans #character #code-span #code-view
codebook

A code-aware spell checker library (dependency for codebook-lsp)

v0.1.0 160 bin+lib #language #autocomplete #lsp #code
minigrep_bakedspacetime

Minimal Rust implementation of grep based on The Book

v0.1.0 bin+lib #mini-grep #book #minigrep-bakedspacetime #string
unic-ucd-common

UNIC — Unicode Character Database — Common Properties

v0.9.0 8.5K #alphabetic #numeric #character-property #unicode #unicode-text #text
emoji_converter

Converts text to emojis

v0.1.0 #emoji #converter #unicode #rust #unicode-text #text
jpreprocess-dictionary-builder

Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)

v0.10.0 #open-j-talk #text-to-speech #library
gbx

GBX (Grundbuch-Exchange) Dateiformat

v1.0.1 #dateiformat #gbx #pdf
alphabet-encoder

A quick and dirty way to deal with escape characters

v0.1.1 #character #alphabet #alphabet-encoder #characters
jellybean-pack-2

Sweet syntax highlighting with tree-sitter

v0.0.2 #syntax-highlighting #highlight #tree-sitter
escaped-delimiter

Iterator of delimited slices with escape characters

v0.1.0 #escaping #iterator #text #character #character-escaping
typeline_ext_http

http(s) tooling for typeline

v0.1.0 #stream #shell #pipeline #tl
unic-ucd-name

UNIC — Unicode Character Database — Name

v0.9.0 4.1K #name #character-property #unicode #unicode-text #text
conveyance

A stop-gap CLI for conveyancing

v0.1.3 app #docx #xml #word
raekna-parser

code needed to parse string slices into Expressions that can later be evaluated

v0.2.1 #raekna #parser #raekna-parser
flesh-reading-ease

Calculate Flesh Reading Ease for a given text and language

v0.1.0 #nlp #readability #flesh #language #text-analysis
font-map-core

Core font-parsing capabilities for font-map

v0.2.9 #font #true-type-font #svg #macro #api-bindings #true-type #preview
panduck-latex

Use panduck to generate XeLaTeX

v0.1.0 #panduck #latex #panduck-latex #text #xe-la-te-x #format #tool
rust-jieba

Rust binding to cppjieba

v0.1.0 #nlp #segmentation #chinese #cppjieba
lingua-persian-language-model

The Persian language model for Lingua, an accurate natural language detection library

v1.2.0 12K #language-recognition #lingua #language-detection #nlp
pdf_forms

programatically filling out pdf forms

v0.3.4 #forms #pdf #field #pdf-form
lindera-dictionary-builder

Shared code for building Lindera dictionary files

v0.32.3 9.6K #morphological #japanese #builder #dictionary #unidic
fum

fum finds fuzzy matches to a literal search pattern, searching recursively through all the files in the current directory and respecting gitignore rules

v0.1.0 app #pattern #fuzzy-search #literals #search #trigram
txt_processor

A little library for text processing

v0.1.4 #text-processing #processing #txt #file #text #filter #index #regex
infisearch_lang_ascii

Basic ascii tokenizer for InfiSearch

v0.10.1 #ascii #language #infisearch-lang-ascii #package
boxy

Declarative builder for Unicode box-drawing characters

v0.1.0 no-std #unicode #tui #character #no-std
writedown

format parser

v0.1.0 nightly #writedown #parser
genpdfi

User-friendly PDF generator written in pure Rust

v0.2.1 #pdf #text-layout #text #layout
xmldecl

Extracts an encoding from an ASCII-based bogo-XML declaration in text/html in a Web-compatible way

v0.2.0 1.0K #charset #unicode #web
asimov-patterns

ASIMOV Software Development Kit (SDK) for Rust

v25.0.0-dev.7 390 no-std #asimov #sdk #artificial-intelligence
unicode_clusters

variable width unicode characters as single items, allowing for array like indexing etc

v0.1.2 #unicode #character #grapheme #unicode-text #cluster #text
bookbinder

Produce books in various formats from markdown, with some understanding of structural semantics and rendering options

v0.1.0 bin+lib #author #epub #markdown #pdf #deserialize #latex-options
pdftotext

High-level library that binds to Poppler to extract text from a PDF

v0.1.5 #pdf #text #poppler #api-bindings
unic-common

UNIC — Common Utilities

v0.9.0 991K #unicode-version #unicode #utilities #unic #version
lipsum-cn

Pseudo-Chinese lorem ipsum generator

v1.0.1 #generator #lipsum-cn #lipsum #format #text #pandoc
guarding_parser

Guarding is a guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.

v0.2.6 #guarding #parser #guarding-parser
wkhtmltox-sys

FFI bindings to wkhtmltox

v0.1.2 950 #pdf #html #wkhtmltox #wkhtmltoimage #wkhtmltopdf
shift_or_euc

Detects among the Japanese legacy encodings

v0.1.0 #charset #web #shift-jis
folia

High-performance library for handling the FoLiA XML format (Format for Linguistic Annotation)

v0.0.6 bin+lib #annotations #nlp #xml #linguistics #text-processing #annotation
bisect

search stdin based on a bitstring pattern

v0.2.0 app #text-search #text #unix #search #command-line-tool
mini-grep

A test crate with mini grep as in The Book

v0.1.3 #mini-grep #grep #book
indentation_flattener

From indented input, generate plain output with indentation PUSH and POP codes

v0.1.0 #tokenize #indentation #flattener #tokenizer #parser
tectonic_pdf_io

Xdvipdfmx’s PDF, XDV, and image I/O APIs in C, as a Rust crate

v0.4.1 600 sys #tectonic #tectonic-pdf-io #pdf #typesetting #path #single #component #unused-imports #tex #import
lix-score

Calculate LIX score for a given text and language

v0.1.0 #nlp #readability #lix #text-analysis #language
rsonpath-test-codegen

Blazing fast JSONPath query engine powered by SIMD. TOML-based test codegen for rsonpath-lib.

v0.5.1 #json-path #query #simd #parser #json
lingua-vietnamese-language-model

The Vietnamese language model for Lingua, an accurate natural language detection library

v1.2.0 12K #language-recognition #lingua #language-detection #nlp
lindera-analyzer

A morphological analysis library

v0.32.3 9.7K #morphological-analysis #library #tokenize #morphological #analysis
pdf-annotations-converter

Converts annotations found in PDF files to different formats

v0.2.0 bin+lib #annotations #md #pdf
textract

extract text from various types of files

v0.1.0 bin+lib #pdf #textract #archive
xgrepx

xgrep is a rust implementation of grep. This is a follow up from the rust book

v0.1.0 bin+lib #xgrepx #book #search #xgrep #txt
string-sections

Build tool for Leptos

v0.1.0 230 #section #string #line #iterator #parser #parsing-tools
yeslogic-fontconfig

RENAMED: use the fontconfig crate instead

v0.1.1 #fontconfig #wrapper #font #search
grep-searcher

Fast line oriented regex searching as a library

v0.1.14 128K #grep #regex #search-pattern #pattern
typeline_ext_python

python integration for typeline

v0.1.0 #stream #pipeline #shell #tl
jpreprocess-naist-jdic

Japanese text preprocessor for Text-to-Speech application (OpenJTalk rewrite in rust language)

v0.12.0 170 #open-j-talk #text-to-speech #library
ssml-parser

parsing speech sythnesis markup language

v0.1.4 #parser #ssml #language
minigrep_crate

grep like console application

v0.1.0 bin+lib #applications #mini-grep #case-insensitive #book #terminal
unic-ucd-case

UNIC — Unicode Character Database — Case Properties

v0.9.0 3.5K #case-folding #character-property #unic #unicode #unicode-text #text
md2pdf

A small utility to convert markdown files to pdf exploiting tectonic

v0.0.3 bin+lib #md2pdf #pdf
lindera-tokenizer

A morphological analysis library

v0.32.3 11K #morphological-analysis #tokenize #library #tokenizer #morphological
lingua-indonesian-language-model

The Indonesian language model for Lingua, an accurate natural language detection library

v1.2.0 12K #language-recognition #lingua #language-detection #nlp
hello_rust_lang_book_chpater_20

rust lang book chapter 20

v1.0.0 bin+lib #language #html #book #20
grep-pcre2

Use PCRE2 with the 'grep' crate

v0.1.8 16K #grep #look #regex #pcre #backreference
ruby-parser

A parser for the Ruby language

v0.0.0-dev1 #parser #array #escaping #language #input #background #mri
lindera-core

A morphological analysis library

v0.33.0 12K #morphological-analysis #library #lindera #morphological #cc-cedict #analysis #reference #ko-dic #ipadic #tokenize
gulpeaseindex

Calculate Gulpease index for a given text and language

v0.1.0 #nlp #readability #gulpease #text-analysis #language
findtext_pdf

Search text in PDF

v0.1.2 bin+lib #pdf #text-search #search #text #cli
rusty_word_builder

Syllable and Word generation library written fully in Rust

v0.6.3 #word #syllable #language #linguistics #conlang
grep-regex

Use Rust's regex library with the 'grep' crate

v0.1.13 120K #grep #regex #search-pattern #line #pattern

Next page?

regex

unicode-width

comfy-table

textwrap

encoding_rs

similar

const_format

heck

fancy-regex

tabled

convert_case

pulldown-cmark

unicode-normalization

deunicode

lazy-regex

rustybuzz

unicode-segmentation

onig

emojis

lopdf

termimad

widestring

unicase

mdbook

prettydiff

regress

html2text

unicode-bidi

unicode-general-category

pulldown-cmark-to-cmark

const-str

mdxjs

linkify

fuzzy-matcher

printpdf

lindera

finl_unicode

charabia

garde

diff

roff

text-splitter

titlecase

synoptic

lngcnv

unicode-script

diffy

text-size

Inflector

str_indices

smartcat

usearch

ascii

os_display

nucleo

arrow-cast

unicode_names2

chardetng

xan

entities

pact_consumer

route-recognizer

cruet

line-index

wana_kana

autocorrect

mdbook-katex

jieba-rs

zeitgrep

stringsext

unicode-case-mapping

ferris-says

spellbook

textsurf

epub-builder

unindent

regex-cursor

htmd

repoyank

decancer