-
comfy-table
An easy to use library for building beautiful tables with automatic content wrapping
-
textwrap
Library for word wrapping, indenting, and dedenting strings. Has optional support for Unicode and emojis as well as machine hyphenation.
-
unicode-segmentation
This crate provides Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules
-
encoding_rs
A Gecko-oriented implementation of the Encoding Standard
-
const_format
Compile-time string formatting
-
unicode-xid
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
heck
heck is a case conversion library
-
fancy-regex
An implementation of regexes, supporting a relatively rich set of features, including backreferences and look-around
-
lazy-regex
lazy static regular expressions checked at compile time
-
widestring
A wide string Rust library for converting to and from wide strings, such as those often used in Windows API or other FFI libaries. Both
u16
andu32
string types are provided, including support for UTF-16 and UTF-32… -
similar
A diff library for Rust
-
ropey
A fast and robust text rope for Rust
-
pulldown-cmark
A pull parser for CommonMark
-
convert_case
Convert strings into any case
-
ascii
ASCII-only equivalents to
char
,str
andString
-
mdbook
Creates a book from markdown files
-
comrak
A 100% CommonMark-compatible GitHub Flavored Markdown parser and formatter
-
tabled
An easy to use library for pretty print tables of Rust
struct
s andenum
s -
onig
Rust-Onig is a set of Rust bindings for the Oniguruma regular expression library. Oniguruma is a modern regex library with support for multiple character encodings and regex syntaxes.
-
diff
An LCS based slice and string diffing implementation
-
deunicode
Convert Unicode strings to pure ASCII by intelligently transliterating them. Suppors Emoji and Chinese.
-
unicode-normalization
This crate provides functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
dissimilar
Diff library with semantic cleanup, based on Google's diff-match-patch
-
encoding
Character encoding support for Rust
-
rustybuzz
A complete harfbuzz shaping algorithm port to Rust
-
Inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
prettydiff
Side-by-side diff for two files
-
pretty
Wadler-style pretty-printing combinators in Rust
-
lindera-cli
A morphological analysis command line interface
-
regex
An implementation of regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
-
ab_glyph
API for loading, scaling, positioning and rasterizing OpenType font glyphs
-
termimad
Markdown Renderer for the Terminal
-
bstr
A string type that is not required to be valid UTF-8
-
html2text
Render HTML as plain text
-
substring
A substring method for string types
-
lindera
A morphological analysis library
-
lngcnv
linguistics: display pronunciation, translate between dialects, convert between orthographies; support for multiple languages: English, Latin, Polish, Quechua, Spanish, Tikuna
-
tokenizers
Provides an implementation of today's most used tokenizers, with a focus on performances and versatility
-
annotate-snippets
Library for building code annotations
-
typos-cli
Source Code Spelling Correction
-
daachorse
Daachorse: Double-Array Aho-Corasick
-
linkify
Finds URLs and email addresses in plain text. Takes care to get the boundaries right with surrounding punctuation like parentheses.
-
lopdf
A Rust library for PDF document manipulation
-
fontdb
A simple, in-memory font database with CSS-like queries
-
pulldown-cmark-to-cmark
Convert pulldown-cmark Events back to the string they were parsed from
-
ra_ap_syntax
Comment and whitespace preserving parser for the Rust language
-
lipsum
Lipsum is a lorem ipsum text generation library. It generates pseudo-random Latin text. Use this if you need filler or dummy text for your application. The text is generated using a simple Markov chain…
-
unicode-script
This crate exposes the Unicode
Script
andScript_Extension
properties from UAX #24 -
unicase
A case-insensitive wrapper around strings
-
unicode_names2
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
stfu8
Sorta Text Format in UTF-8
-
mdxjs
Compile MDX to JavaScript in Rust
-
flexstr
A flexible, simple to use, immutable, clone-efficient
String
replacement for Rust -
mdbook-pdf
A backend for mdBook written in Rust for generating PDF based on headless chrome and Chrome DevTools Protocol
-
yeslogic-ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched with support for additional tables
-
hck
A sharp cut(1) clone
-
unicode-general-category
Fast lookup of the Unicode General Category property for char
-
text_io
really simple to use panicking input functions
-
wana_kana
Utility library for checking and converting between Japanese characters - Kanji, Hiragana, Katakana - and Romaji
-
const-str
compile-time string operations
-
unicode-bidi
Implementation of the Unicode Bidirectional Algorithm
-
csml_engine
The CSML Engine is a conversational engine designed to make it extremely easy to create rich and powerful chatbots
-
emojic
Emoji constants
-
globset
Cross platform single glob and glob set matching. Glob set matching is the process of matching one or more glob patterns against a single candidate path simultaneously, and returning all of the globs that matched.
-
hgrep
hgrep is a grep tool with human-friendly search output. This is similar to
-C
option ofgrep
command, but its output is enhanced with syntax highlighting focusing on human readable outputs. -
mdbook-quiz
Interactive quizzes for your mdBook
-
noto-sans-mono-bitmap
Provides pre-rasterized characters from the “Noto Sans Mono” font in different sizes and font weights for multiple unicode ranges. This crate is
no_std
and needs no allocations or floating… -
unicode-width
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
array_tool
Helper methods for processing collections
-
hyphenation
Knuth-Liang hyphenation for a variety of languages
-
titlecase
A tool and library that capitalizes text according to a style defined by John Gruber for post titles on his website Daring Fireball
-
unic-ucd-version
UNIC — Unicode Character Database — Version
-
shell-escape
Escape characters that may have a special meaning in a shell
-
unindent
Remove a column of leading whitespace from a string
-
jieba-rs
The Jieba Chinese Word Segmentation Implemented in Rust
-
lingua
An accurate natural language detection library, suitable for long and short text alike
-
mdbook-graphviz
mdbook preprocessor to add graphviz support
-
uwc
Counts things in unicode text files
-
swash
Font introspection, complex text shaping and glyph rendering
-
igrepper
The interactive grepper
-
mdbook-compress
Compress an mdBook project into a single PDF file
-
mdbook-admonish
A preprocessor for mdbook to add Material Design admonishments
-
charabia
A simple library to detect the language, tokenize the text and normalize the tokens
-
difflib
Port of Python's difflib library to Rust
-
tengwar
Transliterate text into J.R.R. Tolkien's Tengwar.
-
dr
Command-line data file processing in Rust
-
svgbob_cli
Transform your ascii diagrams into happy little SVG
-
xkpwgen
Generate XKCD 936 passwords
-
emojis
✨ Lookup and iterate over emoji names, shortcodes, and groups
-
cedarwood
efficiently-updatable double-array trie in Rust (ported from cedar)
-
aho-corasick
Fast multiple substring searching
-
languagetool-rust
LanguageTool API bindings in Rust
-
mlc
The markup link checker (mlc) checks for broken links in markup files
-
pad
Library for padding strings at runtime
-
newline-converter
Newline byte converter library
-
unicode-id
Determine whether characters have the ID_Start or ID_Continue properties according to Unicode Standard Annex #31
-
jetscii
A tiny library to efficiently search strings and byte slices for sets of ASCII characters or bytes
-
os_display
Display strings in a safe platform-appropriate way
-
bytelines
Read input lines as byte slices for high efficiency
-
ab_glyph_rasterizer
Coverage rasterization for lines, quadratic & cubic beziers
-
obsidian-export
Rust library and associated CLI program to export an Obsidian vault to regular Markdown
-
aarty
A simple CLI tool to convert the images to ASCII art
-
dprint-plugin-markdown
Markdown formatter for dprint
-
printpdf
Rust library for writing PDF files
-
fsays
A Rust flavored replacement for the classic cowsay
-
pager
Helps pipe your output through an external pager
-
pdf
PDF reader
-
ansi-to-tui
A library to convert ansi color coded text into tui::text::Text type from tui-rs library
-
uncased
Case-preserving, ASCII case-insensitive, no_std string types
-
vaporetto
Vaporetto: a pointwise prediction based tokenizer
-
aki-resort
sort lines of text. You can use regex to specify the KEY.
-
stl_io
STL input and output
-
synoptic
A simple, low-level, syntax highlighting library with unicode support
-
fuzzy-matcher
Fuzzy Matching Library
-
notnow
A terminal based task and TODO management software
-
chamkho
Khmer, Lao, Myanmar, and Thai word segmentation/breaking library and command line
-
any_ascii
Unicode to ASCII transliteration
-
slug
Convert a unicode string to a slug
-
wordshk_tools
A combination of parsers and other tools for words.hk (粵典)
-
mdbook-preprocessor-graphql-playground
A preprocessor for mdbook to add GraphQL playgrounds
-
shell2batch
Coverts simple basic shell scripts to windows batch scripts
-
qtrac-clc
clc (code line count) quickly counts newlines in code files
-
probly-search
A lightweight full-text search engine with a fully customizable scoring function
-
finl_unicode
Library for handling Unicode functionality for finl (categories and grapheme segmentation)
-
diffy
Tools for finding and manipulating differences between files
-
cruet
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
mdbook-tectonic
An mdbook backend for generating LaTeX and PDF documents
-
ob
A Blog and RSS system written in Rust
-
mdbook-linkcheck
A backend for
mdbook
which will check your links for you -
wz
Count words, fast
-
harfbuzz_rs
A high-level interface to HarfBuzz, exposing its most important functionality in a safe manner using Rust
-
molybdenum
Recursive search and replace CLI application
-
mdbook-svgbob
SvgBob mdbook preprocessor which swaps code-blocks with neat SVG
-
character_converter
Turn Traditional Chinese script ot Simplified Chinese script and vice-versa and tokenize
-
analiticcl
Analiticcl is an approximate string matching or fuzzy-matching system that can be used to find variants for spelling correction or text normalisation
-
grok
A rust implementation of the popular java & ruby grok library which allows easy text and log file processing with composable patterns
-
chumsky
A parser library for humans with powerful error recovery
-
chord3
Create pdf songbooks from chopro source
-
mdbook-katex
mdBook preprocessor rendering LaTeX equations to HTML
-
frawk
an efficient Awk-like language
-
binstalk
The binstall toolkit (library interface)
-
vndb_tags_get
convert VNDB tag list (JSON to markdown)
-
thesaurus
An offline thesaurus library for Rust
-
r4d
Text oriented macro processor
-
streampager
streampager is a pager for command output or large files
-
somedoc
A very simple document model and markup generator
-
simsearch
A simple and lightweight fuzzy search engine that works in memory, searching for similar strings (a pun here)
-
random_ascii
A totally useless but 'must have' bloatware (tribute to systemd)
-
typos-dict
Source Code Spelling Correction
-
ferris-says
A Rust flavored replacement for the classic cowsay
-
easy_reader
A Rust library for easily navigating forward, backward or randomly through the lines of huge files
-
charfind
CharFind is an application for finding Unicode characters
-
gh-emoji
Convert
:emoji:
to Unicode using GitHub’s emoji names -
pomsky
A new regular expression language
-
mdbook-bibfile-referencing
An mdBook preprocessor to add bibfile referencing to each page
-
zine
A simple and opinionated tool to build your own magazine
-
rock-paper-scissor
Rock Paper Scissor game
-
tectonic_cfg_support
A build.rs support crate that helps deal with CARGO_CFG_TARGET_* variables. When cross-compiling, these variables must be used instead of constructs such as `cfg!(target_arch = …)…
-
binstalk-types
The binstall toolkit that contains basic types for binstalk crates
-
unicode-blocks
This crate contains a list of all unicode blocks and provides some functions to search across them
-
mdbook-dtmo
Creates a book from markdown files with added plugins
-
lindera-ko-dic-builder
A Korean morphological dictionary builder for ko-dic
-
yeslogic-fontconfig-sys
Raw bindings to Fontconfig without a vendored C library
-
termdiff
Write a diff with color codes to a string
-
colonnade
format tabular data for display
-
stop-words
Common stop words in many languages
-
jira-clean
A tool to clean up Jira task description that is an output of jira-cli tool
-
unicode_categories
Query Unicode category membership for chars
-
fasttext
fastText Rust binding
-
runiq
An efficient way to filter duplicate lines from input, à la uniq
-
rust-beam
A LaTeX slide generator you can write in faster than beamer
-
stringmetrics
Rust library for approximate string matching
-
hyperscan
Hyperscan bindings for Rust with Multiple Pattern and Streaming Scan
-
bcdown
Bilibili漫画下载器,written in Rust,支持epub pdf zip格式
-
binstalk-downloader
The binstall toolkit for downloading and extracting file
-
in_definite
Get the indefinite article ('a' or 'an') to match the given word. For example: an umbrella, a user.
-
translation-api-cn
Some useful structs for calling Chinese translation api cloud services. A helper tool for
bilingual
cmdline tool. -
mecab
Safe Rust wrapper for mecab a japanese language part-of-speech and morphological analyzer library
-
kathoey
Rust library for text feminization using open corpus linguistics data
-
mdbook-kroki-preprocessor
render kroki diagrams from files or code blocks in mdbook
-
instant-segment
Fast English word segmentation
-
sd
An intuitive find & replace CLI
-
rphonetic
Rust port of phonetic Apache commons-codec algorithms
-
str_indices
Count and convert between indexing schemes on string slices
-
mdbook-toc
mdbook preprocessor to add Table of Contents
-
epub
Library to support the reading of epub files
-
indenter
A formatter wrapper that indents the text, designed for error display impls
-
angry-purple-tiger
animal-based hash digests for humans
-
textwrap-macros
Simple procedural macros to use textwrap utilities at compile time
-
vi
An input method library for vietnamese IME
-
mdbook-template
A mdbook preprocessor that allows the re-usability of template files with dynamic arguments
-
lowcharts
Tool to draw low-resolution graphs in terminal
-
coap-numbers
Constants for the CoAP protocol
-
ra_ap_text_edit
TBD
-
fea-rs
Tools for working with Adobe OpenType Feature files
-
safe-regex-compiler
Regex compiler for the safe-regex crate
-
deno_ast
Source text parsing, lexing, and AST related functionality for Deno
-
rectangle-pack
A general purpose, deterministic bin packer designed to conform to any two or three dimensional use case
-
huggingface/tokenizers-python
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
-
serbzip
A quasi-lossless Balkanoidal meta-lingual compressor
-
ufofmt
A fast, flexible UFO source file formatter based on the Norad library
-
bard
Creates PDF and HTML songbooks out of easy-to-write Markdown sources
-
lemmeknow
Identify any mysterious text or analyze strings from a file
-
case
A set of letter case string helpers
-
texcreate
Create LaTeX Projects using prebuilt templates
-
oxford_join
Join string slices with Oxford Commas!
-
lithe-cli
A cli of lithe
-
mdcat
cat for markdown: Show markdown documents in terminals
-
mangatrans
Manga transcription data format and ways to render them into readable formats, statistics and more
-
yozuk
Chatbot for Programmers
-
human_regex
A regex library for humans
-
galm
GalM is pattern matching library
-
bbscope
BBCode to HTML with scoping rules, auto-close tags, highly extensible
-
mdbook-plantuml
A preprocessor for mdbook which will convert plantuml code blocks into inline SVG diagrams
-
nlprule
A fast, low-resource Natural Language Processing and Error Correction library
-
zhconv
Traditional/Simplified and regional Chinese variants converter based on MediaWiki conversion rules and powered by AC automata 轉換中文簡體、繁體及兩岸、新馬地區詞,基於MediaWiki之字詞轉換表
-
ansi-str
A library which provides a set of methods to work with ANSI strings
-
chardetng
A character encoding detector for legacy Web content
-
text-sanitizer
convert text to plain ASCII text
-
rhuffle
Random shuffler for large file with many lines
-
hmd
Custom Markdown Engine for my personal blog
-
sublime_fuzzy
Fuzzy matching algorithm based on Sublime Text's string search
-
newdoc
Generate pre-populated module files formatted with AsciiDoc that are used in Red Hat and Fedora documentation
-
inflections
High performance inflection transformation library for changing properties of words like the case
-
matchers
Regex matching on character and byte streams
-
text_lines
Information about lines of text in a string
-
repgrep
An interactive command line replacer for
ripgrep
-
kv-log-macro
Log macro for log's kv-unstable backend
-
ascii-hangman-backend
customizable Hangman game with ASCII-art rewarding for children (backend)
-
stringmatch
Allow the use of regular expressions or strings wherever you need string comparison
-
roff
ROFF (man page format) generation library
-
easy_process
Easy to use process runner
-
precis-profiles
Implementation of the PRECIS Framework: Preparation, Enforcement, and Comparison of Internationalized Strings Representing Usernames and Passwords as defined in rfc8265; and Nicknames as defined in rfc8266
-
mdbook-cat-prep
a preprocessor for mdbook which provides teacher, subject, material and tag functionality
-
sedregex
Sed-like regex library
-
kas-text
Text layout and font management
-
rand_regex
Generates random strings and byte strings matching a regex
-
epub-builder
A Rust library for generating EPUB files
-
mdtransform
a simple program that renders Markdown as HTML for websites
-
ncount
A word count tool intended to derive useful stats from markdown
-
inflector-plus
Adds String based inflections for Rust. Snake, kebab, camel, word, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
lindera-unidic-builder
A Japanese morphological dictionary builder for UniDic
-
egui_commonmark
Commonmark viewer for egui
-
linurgy
Manipulate the output of multiple newlines. Replace/Insert/Append newlines with text. Input and output from stdio/files/buffers
-
anslatortray
A simple Rust library to translate from English to Pig Latin!
-
mdbook-chapter-number
A mdBook preprocessor that adds chapter numbers to the each page header
-
artem
Convert images from multiple formats (jpg, png, webp, etc…) to ASCII art
-
seaplane-oid
Seaplane Object IDs
-
unindenter
A simple tool to unindent text
-
natural
Pure rust library for natural language processing
-
kase
Convert from one case to another. Input case is determined on a best-guess basis, but can be overridden with the --from flag.
-
fast_symspell
Spelling correction & Fuzzy search
-
loc
Count lines of code (cloc) fast
-
omekasy
Decorate alphanumeric characters in your input with various font; special characters in Unicode
-
easy-regex
Make long regular expressions like pseudocodes
-
hns
Human numeric sorting program — does what
sort -h
is supposed to do! -
pdf-extract
A library to extract content from pdfs
-
easy-xml
An XML library
-
html-auto-p
This library provides a function like
wpautop
in Wordpress. It uses a group of regex replaces used to identify text formatted with newlines and replace double line-breaks with HTML paragraph tags. -
mdbook-fs-summary
Summary generator for mdbook
-
adblock
Native Rust module for Adblock Plus syntax (e.g. EasyList, EasyPrivacy) filter parsing and matching.
-
imperative
Check for imperative mood in text
-
egg-mode-text
Text parsing for Twitter: character counting, hashtag/mention extraction
-
onig_sys
The
onig_sys
crate contains raw rust bindings to the oniguruma library. This crate exposes a set of unsafe functions which can then be used by other crates to create safe wrappers around Oniguruma… -
esl01-renderdag
Render a graph into ASCII or Unicode text
-
text-size
Newtypes for text offsets
-
nlpo3
Thai natural language processing library, with Python and Node bindings
-
fuse-rust
Fuse is a super lightweight library which provides a simple way to do fuzzy searching. Fuse-Rust is a port of Fuse-Swift, written purely in rust
-
boreal
A library to evaluate YARA rules, used to scan bytes for textual and binary pattern
-
fast2s
A fast Traditional Chinese to Simplified Chinese conversion library. Built with FST, faster than most of other libraries.
-
pithy
Ultra-fast, spookily accurate text summarizer that works on any language
-
mdbook-svgbob2
Alternative mdbook preprocessor for svgbob
-
aki-xcat
concatenate files that are plain, gzip, xz and zstd
-
scanlex
a simple lexical scanner for parsing text into tokens
-
sixbit
Small packed strings
-
text2art
Simple library for translate text to ascii art
-
mdbook-chess
An mdbook preprocessing plugin to generate chess boards
-
wchar
Procedural macros for compile time UTF-16 and UTF-32 wide strings
-
clarifai_grpc
The official Clarifai gRPC Rust client
-
untanglr
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies
-
roe
Unicode case conversion
-
ngrammatic
Character-oriented ngram generator and fuzzy matching library
-
crowbook-text-processing
Provides some utilities functions for escaping text (HTML/LaTeX) and formatting it according to typographic rules (smart quotes, ellipsis, french typograhic rules)
-
enum-ts
TypeScript Enum pattern matcher codegen
-
block-list
A minimalist hosts-based tool for managing block lists and ad-blocking
-
glyph_brush
Fast cached text render library using ab_glyph
-
eyes
A simpler way to parse using human-readable templates
-
unicode-normalization-alignments
This crate provides functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
lsp-positions
LSP-compatible character positions
-
resticular
Blazingly Fast Static Site Generator
-
lexical-sort
Sort Unicode strings lexically
-
unicode-truncate
Unicode-aware algorithm to pad or truncate
str
in terms of displayed width -
annoying_journal
A journal that pops up regularly to force you to write entries
-
unicode_reader
Adaptors which wrap byte-oriented readers and yield the UTF-8 data as Unicode code points or grapheme clusters
-
mdbook-epub
An EPUB renderer for mdbook
-
utf16_lit
macro_rules to make utf-16 literals
-
wa
WebAssembly and Rust utility library
-
text_analysis
Analyze text stored as *.txt in provided file or directory. Doesn't read files in subdirectories. Counting all words and then searching for every unique word in the vicinity (+-5 words)…
-
cglue-bindgen
cleanup cbindgen headers for CGlue
-
levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
modeling
Modeling is a tools to analysis different languages by Ctags
-
zalgo-codec
A crate for converting an ASCII text string to a single unicode grapheme cluster and back. Provides a macro for executing Rust source code that has been encoded in this way. Additionally…
-
zspell
Native Rust library for spellchecking, with a command line interface
-
glue
Glue is a parser combinator framework for parsing text based formats, it is easy to use and relatively fast too
-
fast-str
A flexible, easy-to-use, immutable, efficient
String
replacement for Rust -
indefinite
Prefix a noun with an indefinite article - a or an - based on whether it begins with a vowel
-
lcat
Another lolcat in rust! Full unicode support, escapes for ANSI escape sequences, hue shift in Cubehelix
-
resast
Rusty-ECMAScript Abstract Syntax Tree
-
lyra2
Pure rust library in Lyra2, Lyra2RE, Lyra2REv2, Lyra2REv3
-
hypher
hypher separates words into syllables
-
entities
Provides the raw data needed to convert to and from HTML entities
-
sesters
💱 Fast, offline currency converter 💴 💷 💶 💵
-
etch
Not just a text formatter, don't mark it down, etch it
-
cmark-syntax
Syntax highlighting for pulldown-cmark
-
ultron
A web base text-editor with strong focus on displaying multi-width characters correctly
-
rmw-utf8
Short text compression algorithm for utf-8 (optimized for Chinese , developed based on rust programming language). 面向utf-8的短文本压缩算法(为中文压缩优化,基于rust编程语言开发)。
-
csvsc
Build processing chains for CSV files
-
mdzk
Plain text Zettelkasten based on mdBook
-
hunspell-sys
Bindings to the hunspell C API
-
typos-vars
Source Code Spelling Correction
-
gistit
Quick and easy code snippet sharing
-
proc-macro-regex
A proc macro regex library
-
svgbob
Transform your ascii diagrams into happy little SVG
-
symbolic_expressions
A symbolic-expression parser/writer
-
seshat-unicode
A Unicode Library for Rust. Unicode 14.0.0 ready.
-
soft-ascii-string
char/str/string wrappers which add a "is-ascii" soft constraint
-
cesu8
Convert to and from CESU-8 encoding (similar to UTF-8)
-
ranting
Linguistic formatting placeholder extensions for rust
-
crabfetcher
A fetcher written in rust
-
unidecode
Provides pure ASCII transliterations of Unicode strings
-
clippers
Cross-platform clipboard management library
-
unicode-case-mapping
Fast lowercase, uppercase, and titlecase mapping for characters
-
kelp
A convert tool for Japanese
-
pretty-xmlish
Pretty print XML-ish data with unicode art
-
rust_nickname_generater
A rust library that generates user/nick names based on the rust language
-
line-span
Find line ranges and jump between next and previous lines
-
futf
Handling fragments of UTF-8
-
indent_write
Simple Write adapters to add line indentation
-
awabi
A morphological analyzer using mecab dictionary
-
tracery
Text-expansion library
-
deepl-api
Bindings and a commandline tool for the DeepL REST API (https://www.deepl.com/docs-api/)
-
bilingual
A cmdline tool used for markdown translation via calling Chinese translation api cloud services
-
regex-automata
Automata construction and matching using regular expressions
-
unescape
Unescapes strings with escape sequences written out as literal characters
-
mdbook-keeper
An improved testing experience for mdbook
-
glob-match
An extremely fast glob matcher
-
secular
No Diacr!
-
encoding-next
Character encoding support for Rust
-
slicedisplay
Simplistic Display implementation for Vecs and slices
-
markdown-it
Rust port of popular markdown-it.js library
-
unescaper
Unescape strings with escape sequences written out as literal characters
-
prefix
A customizable pretty printer for FIX messages
-
wordcut-engine
Word segmentation/breaking library
-
wikidot-normalize
Simple library to provide Wikidot-compatible string normalization
-
voikko-rs
Rust bindings for the Voikko library
-
near-facsimile
Find similar or identical text files in a directory
-
uwuify
fastest text uwuifier in the west
-
insa
Insa renders imaxes to text - it's a terminal bitmap graphics library
-
nyar-hir
High level intermediate representation for Nyar
-
frender
react in rust. functional rendering web ui
-
esri_ascii_grid
A library for reading ESRI Ascii Grid .asc files
-
token-read
A library for reading whitespace delimited files intended for competitive programming
-
twitter_text_config
Configuration for twitter-text in Rust
-
cloc
Count, or compute differences of, lines of source code and comments
-
utf8_iter
Iterator by char over potentially-invalid UTF-8 in &[u8]
-
cindex
CSV indexing library
-
unicode-security
Detect possible security problems with Unicode usage according to Unicode Technical Standard #39 rules
-
glyph_brush_layout
Text layout for ab_glyph
-
anabot
A CLI tool to find anagrams of words; front-end to anagrambot
-
rewrite
Safely rewrite file contents from stdin, even when file is open as an input
-
mdbook-pikchr
A mdbook preprocessor to render pikchr code blocks as images in your book
-
sanitizer
A collection of methods and macros to sanitize struct fields
-
censor
A simple text profanity filter
-
pandoc
a library API that wraps calls to the pandoc 2.x executable
-
mdbook-twiki
twiki backend for mdbook
-
texcore
Write latex in rust, and either compile it to pdf or output the tex code
-
jsonrpc-reactor
JSON-RPC 2.0 Rust implementation with tokio backend
-
markdown-extract
Extract sections of a markdown file
-
git-busy
A wrapper around "git commit" that generates the commit messages for you
-
ftrace
ftrace - trace files and paths
-
fontconfig
Safe, higher-level wrapper around the Fontconfig library
-
eddie
Fast and well-tested implementations of edit distance/string similarity metrics: Levenshtein, Damerau-Levenshtein, Hamming, Jaro, and Jaro-Winkler
-
qpdf
Rust bindings to QPDF C++ library
-
mdbook-scientific
Enables inline equations for mdbook to set by $..$ signs and $$..$$
-
mdbook-open-on-gh
mdbook preprocessor to add a open-on-github link on every page
-
notegraf
Core library for building a graph-oriented notebook
-
mdxbook
Fork of mdBook, with more customizations and flexibility for programmers
-
slicestring
slicestring is a crate for slicing Strings
-
mdbook-echarts
A preprocessor and cli tool for mdbook to show chart use echarts.js
-
kvarn-chute
A Markdown converter designed to use the Kvarn templating engine
-
docket
Simple markdown to HTML documentation rendering
-
float-pretty-print
Format f64 for showing to user, not for serialisation
-
kaolinite
A crate to assist in the creation of TUI text editors
-
json_to_table
A libary for pretty print JSON as a table
-
pathtrim
This crate implements the TrimmablePath trait on anything that implements AsRef<std::path::Path> so you can easily obtain the last *n* parts of a path
-
masker
Mask patterns in data
-
mdbook-open-git-repo
mdbook preprocessor to add a open-on-git-repo link on every page
-
hunspell-rs
Rust bindings to the Hunspell library
-
analyse-json
analyse-json is a CLI tool for inspecting (Newline Delimited) NDJSON or JSON to understand the contents
-
tgrep
Toy grep that honors .gitignore
-
simple_password_generator
Simple CLI password generator. Can also be used as lib in other porjects
-
hyphenation_commons
Proemial code for the
hyphenation
library -
litua
Read a text document, receive its tree in Lua and manipulate it before representing it as string
-
ut1_blocklist
UT1 blocklist URL/domain filters
-
ipa-translate
A crate for translating ASCII text to IPA
-
emojicon
Find Emoji by using Emoticons and GitHub's, Bengali emoji names
-
words-count
Count the words and characters, with or without whitespaces
-
rust-tfidf
Library to calculate TF-IDF (Term Frequency - Inverse Document Frequency) for generic documents
-
zummi
fun lib that produces spoonerisms
-
linereader
An efficient buffered line reader
-
xim-ctext
compound text en/decoder
-
pinot
Fast, high-fidelity OpenType parser
-
ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched
-
precis-core
PRECIS Framework: Preparation, Enforcement, and Comparison of Internationalized Strings in Application Protocols as defined in rfc8264
-
morsels_lang_ascii
Basic ascii tokenizer for morsels
-
text_trees
Simple textual output for tree-like structures
-
markov_strings
A simplistic Markov chain text generator
-
ccase
Command line interface to convert strings into any case
-
tpt
Pure Rust implementation of the Unix concatenate (cat), word-count (wc) and echo command
-
textcode
Text encoding/decoding library. Supports: UTF-8, ISO6937, ISO8859, GB2312
-
dis-spawner
Session-lived containers for advanced browser-based applications
-
mdbook-catppuccin
A mdbook preprocessor that exposes the catppuccin flavours as available themes!
-
path2regex
Express style path to RegExp utility
-
wordmarkov
A simple but flexible Markov chain library, specifically for text sentences, which handles punctuation and whitespace
-
tcalc-rustyline
A fork of Rustyline for use specifically with tcalc
-
tinyvec_string
tinyvec based string types
-
stam
STAM is a library for dealing with standoff annotations on text
-
markov
A generic markov chain implementation in Rust
-
utf8-bufread
Provides alternatives to BufRead's read_line & lines that stop not on newlines
-
soundex-rs
A library that calculates soundex value
-
binyl
A bitwise UTF-8 string inspection tool
-
utf-8
Incremental, zero-copy UTF-8 decoding with error handling
-
hime_compiler
CLI tool for the generation of LR and GLR parsers for the Hime runtime
-
str_inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
ascii_tree
generates ascii trees
-
lindera-tantivy
Lindera Tokenizer for Tantivy
-
STKLR
STKLR: pronounced 'stickler'. Is a cli tool to automatically link functions, enums, structs, traits etc in rust-doc docstrings. I couldn't find a tool like this when I needed it so... here we are.
-
ryaspeller
A tool and lib for searching typos in text, files and websites
-
mdbook-cmdrun
mdbook preprocessor to run arbitrary commands
-
rust_readability
A package to assess the complexity of texts using a variety of readability formulas
-
whitespace-sifter
Sift duplicate whitespaces away!
-
textpos
A small library to track human-readable text positions
-
image-to-ascii
Converts images and gifs to ascii art
-
varcon-core
Varcon-relevant data structures
-
grammateus
A Rust crate to facilitate working with Ancient Greek words
-
tagsearch
Filter plaintext files based on @keyword tags
-
strmatch
Conditionally match strings in Rust using regex without much boilerplate
-
mktoc
Generate Table of Contents from Markdown files
-
pomsky-syntax
Parser for pomsky, a new regular expression language
-
indented_text_writer
A simple IndentedTextWriter
-
textspan
Text span utility
-
cyrla
Library for two-way conversion between latin and cyrillic script
-
string-builder
A simple string builder type
-
matchable
Check if text is matching against string or regex in one API
-
rsnltk
Rust-based Natural Language Toolkit
-
codepage-strings
encode / decode strings for Windows code pages
-
mdbook-mermaid
mdbook preprocessor to add mermaid support
-
adbook
Creates a book from AsciiDoc files
-
fancy-regex-fork-pb
A custom fork of the fancy-regex crate. You probably don't want to use this.
-
parse_wiki_text
Parse wiki text from Mediawiki into a tree of elements
-
madato
A library and command line tool for working tabular data (XLS, ODS, CSV, YAML), and Markdown
-
cutters
Rule based sentence segmentation library
-
abjad
Calculate the numerical abjad value of Arabic-script text
-
mdbook-to-github-wiki
Turns an mdbook book into a github wiki
-
symbolize
This crate allows you to convert raster images into their symbolic versions
-
eliza
A rust implementation of ELIZA - a natural language processing program developed by Joseph Weizenbaum in 1966
-
ucd
Extends the char type to provide access to most fields of the UCD, Unicode Character Database, as of version 9.0.0. It aims to be compact, fast, and use minimal dependencies (only rust's core crate)…
-
character_frequency
Simple library for counting character frequencies in a string concurrently
-
top-english-words
Library to retrieve top words from the English language
-
crowbook
Render a Markdown book in HTML, PDF or Epub
-
emojito
Find all the Emoji in a string. Supports composed emoji.
-
unic-emoji-char
UNIC — Unicode Emoji — Emoji Character Properties
-
pandoc_types
Rust port of pandoc-types
-
easy_io
Fast and dead-simple IO for competitive programming in Rust
-
fmty
Composable
core::fmt
utilities -
xsystem
Conversion between the Esperanto x-system and Unicode circumflexes
-
zw
Utility for encoding and decoding text using zero-width characters
-
cp437_constants
Defined constants covering the entirety of ASCII code page 437 (https://en.wikipedia.org/wiki/Code_page_437)
-
allsorts
Font parser, shaping engine, and subsetter for OpenType, WOFF, and WOFF2
-
text2num
Parse and convert numbers written in English, Spanish or French into their digit representation
-
diamond-types
The world's fastest text CRDT
-
onepage
A simple static site generator
-
file-expert
Expert system for recognizing source code files, similar to GitHub/lingust
-
svgbobdoc
Renders ASCII diagrams in doc comments as SVG images
-
binstalk-manifests
The binstall toolkit for manipulating with manifest
-
basic-text-internals
Basic Text string literal implementation details
-
fm
Non-backtracking fuzzy text matcher
-
ttaw
talking to a wall, a piecemeal natural language processing library
-
markdown-gen
Crate for generating Markdown files
-
translitrs
Transliteration utility for Serbian language
-
read-fonts
Reading OpenType font files
-
reductive
Optimized vector quantization for dense vectors
-
node-emoji
Convert
:emoji:
to Unicode using GitHub’s and EmojiDB’s emoji names -
repvar
A tiny CLI tool that replaces variables of the style
${KEY}
in text with their respective value. It can also be used as a rust library -
porter-stemmer
Flexible and unicode friendly, Porter stemmer implementation
-
slugify-rs
A rust library to generate slugs from strings
-
grammalecte_client
Grammalecte HTTP client
-
mdbook-yml-header
mdBook preprocessor for removing yml header
-
dynfractal
A program to render any equation (in terms of z and c) into an ASCII fractal
-
branchy
Provides tools for generating strings and sequences using context-free grammars
-
csml_interpreter
The CSML Interpreter is the official interpreter for the CSML programming language, a DSL designed to make it extremely easy to create rich and powerful chatbots
-
mdbook-trace
A traceable document preprocessor for mdbook
-
aki-mcolor
mark up text with color
-
emoji
Every emoji, their metadata, and localized annotations
-
ascii_converter
A library for converting between different ascii representations
-
yozuk-sdk
Types used in the Yozuk ecosystem
-
chinese_detection
Classify a string as either English, Chinese, or Pinyin
-
tectonic_bridge_harfbuzz
Expose the Harfbuzz C/C++ APIs to Rust/Cargo
-
zipcodes
Query US zipcodes without SQLite
-
texting
string helpers
-
paxcii
Transform images and videos to ascii
-
glifnames
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
kl-hyphenate
Knuth-Liang hyphenation for a variety of languages
-
ascii-canvas
simple canvas for drawing lines and styled text and emitting to the terminal
-
rnltk
Natural Language Toolkit for Rust
-
lazy-char-iter
Lazily iterate over Unicode characters from a u8 slice in Rust
-
unified-diff
An implementation of the GNU unified diff format
-
console_static_text
Logging for text that should stay in the same place in a console
-
gimme
Pull useful data out of your clipboard
-
debug_canvas
Easily print complicated debug strings
-
commonregex
Rust port for CommonRegex. Find all times, dates, links, phone numbers, emails, ip addresses, prices, hex colors, and credit card numbers in a string. We did the hard work so you don't have to.
-
docstring
Utilities for manipulating and parsing documentation strings
-
charname
Incredibly simple library that just gives you the Unicode name for a character
-
readable-regex
Regex made for humans. Wrapper to build regexes in a verbose style.
-
sgit
stupid git in a tool
-
ascii-alphabetic-char
Traits for ASCII alphabetic characters
-
rake
Rust implementation of Rapid Automatic Keyword Extraction (RAKE) algorithm
-
encoding8
various 8-bit encodings
-
site
A simple, fast, opinioned static site generator
-
lindera-cc-cedict-builder
A Chinese morphological dictionary builder for CC-CEDICT
-
publish_demo_node_1
This program picks a random christmas emoji for you
-
owned_chars
Owned iterators with the same output as Chars and CharIndices
-
sentencepiece-sys
Binding for the sentencepiece tokenizer
-
tzgrep
grep tar.gz
-
rural
User-friendly command-line HTTP tool
-
regex-split
An implementation of split_inclusive for the regex crate
-
xpanda
Unix shell-like parameter expansion/variable substitution
-
md-inc
Include files in Markdown docs
-
str-utils
This crate provides some traits to extend types which implement
AsRef<[u8]>
orAsRef<str>
-
slugify
Macro for flexible slug generation
-
guarding
Guarding is a guardians for code, architecture, layered. Guarding crate a architecture aguard DSL which based on ArchUnit.
-
nanohtml2text
A zero-dependency library to convert HTML to plain text
-
loe
Very fast and yet another line ending (CRLF <-> LF) converter written in Rust
-
csv-groupby
execute a sql-like group-by on arbitrary text or csv files
-
collapse
Trim and collapse consecutive whitespace to a single space
-
readability
Port of arc90's readability project to rust
-
blockishfire
doomfire in the terminal
-
regex_generate
Use regular expressions to generate text
-
unicode-vo
Unicode vertical orientation detection
-
simple_hider
A crate for hiding and unhiding text with a salt value
-
genpdf
User-friendly PDF generator written in pure Rust
-
mdbook-d2
D2 diagram generator plugin for MdBook
-
narwhal_me
Need an ascii Narwhal in a hurry?
-
tabwriter
Elastic tabstops
-
korrektor
Library to work with Uzbek language text processing
-
ironstorm_lookup
Lightning fast lookup table for auto completion, type ahead, suggestion engines
-
avamain
Check domain availability based on generated character sequence
-
mapm
A set of tools, with command line and graphical interfaces, used to build exams typeset in LaTeX
-
udp-logger-rs
Log macro for log's kv-unstable backend and a UDP socket logger
-
uniwhat
Display the unicode characters text
-
crawdad
Crawdad: ChaRActer-Wise Double-Array Dictionary
-
rustascii
Display Rust in ASCII
-
homoglyphs
generate all homoglyphs for a given input sentence
-
typos
Source Code Spelling Correction
-
pandoc-ac
A simple pandoc filter for converting acronym codes to LaTeX
-
basecracker
Encode, Decode and Crack encoded data, useful to crack some random encoded strings in CTFs
-
nipah_tokenizer
A powerful yet simple text tokenizer for your everyday needs!
-
justify
Justify plaintext while handling Unicode gracefully
-
kvc
Very simple key-value-count tools to go from / to pandas data frames or streaming formats
-
case_insensitive_hashmap
A HashMap that uses case-insensitive strings as keys
-
regex_mutator
The Nautilus regex_mutator
-
rosie
Interface for the Rosie Pattern Language, for efficient and maintainable text pattern matching and search
-
mdbook-chart
A preprocessor and cli tool for mdbook to show chart use C3.js
-
interpolate
A simple form of string interpolation
-
ncase
Enforce a case style
-
histongram
A small crate for counting n-grams
-
mingrep_arpitjp
my first crate
-
stylish-stringlike
API for string-like objects that have styles applied
-
rcut-lib
rcut is a Rust replacement for GNU cut that supports UTF-8
-
moco_abm
MOCO - Anytime Behavior Model (moco_abm)
-
asciis
ASCII base on RFC20. Just ord() and chr()
-
changecase
A trait and implementation for changing the case of Strings and &str. It currently supports uppercase, lowercase, alternating case, and inverting case. Title case is in the works.
-
lsp-textdocument
A LSP text documents manager that map of text document
-
nfa_regex
Simple NFA regex engine for text processing
-
git-blamediff
A program to automatically annotate changes to a file in git(1)
-
ogrep
Tool for searching in indentation-structured texts
-
sejong
Sejong Buffer is a buffer that can receive ASCII bytes different from keyboard and send out UTF-32 Hangul string. This buffer allows deletion by Jamo.
-
hyperscan-sys
Hyperscan bindings for Rust with Multiple Pattern and Streaming Scan
-
pangu
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
bughouse
This is a bughouse (2v2 Chess) library for tracking game state and validating legal moves
-
const_format_proc_macros
Implementation detail of the
const_format
crate -
string-overlap
A helper crate for "layering" ASCII art
-
latex-to-html
Latex to html converter
-
byte_string
Wrapper types for outputting byte strings (b"Hello") using the Debug ({:?}) format
-
wtf8-rs
Implementation of the WTF-8 encoding
-
esperanto-text
Convert Esperanto text between UTF-8, x-system and h-system transliterations
-
mdbook-latex
An mdbook backend for generating LaTeX and PDF documents
-
mdplayscript
An extension of Markdown for play scripts
-
neo-mime
Strongly Typed Mimes
-
mandown
Markdown to groff (man page) converter
-
rust_lemmatizer
A lemmatizing package for use with a .csv dictionary of lemmas and their corresponding words
-
hello_lib
Demonstrate Generics Function
-
textedit-merge
A library for sorting and merging textedits
-
mathematica-notebook-filter
mathematica-notebook-filter
parses Mathematica notebook files and strips them of superfluous information so that they can be committed into version control systems more easily -
encoding-index-korean
Index tables for Korean character encodings
-
mdbook-nocomment
Remove all html comments
-
anagrambot
A library to find anagrams of words
-
truecase
Restore correct letter casings in arbitrary text using a statistical model
-
notedown_ast
Notedown Abstract Syntax Tree
-
mdbook-checklist
An mdBook preprocessor for generating checklists and indexes
-
rusty-x
A snippet manager in Rust
-
lithe
A Slim template engine by using Pest
-
kana-converter
A simple converter for half-width/full-width Japanese language characters (katakana, hiragana, and ASCII)
-
fluxcap
time expression parsing inspired on duckling
-
ripgrep
ripgrep is a line-oriented search tool that recursively searches the current directory for a regex pattern while respecting gitignore rules. ripgrep has first class support on Windows, macOS and Linux.
-
pragmatic-segmenter
Rust port of pySBD v3.1.0
-
random_access_unicode
Provide random access to the unicode characters in a file
-
relex
a library for building a regex-based lexer
-
unicode-reverse
Unicode-aware in-place string reversal
-
optic
Schoolwork as code
-
wildflower
Wildcard matching against strings
-
rough
A very simple and opinionated static site generator
-
const-str-proc-macro
compile-time string operations
-
mdbook-indexing
mdbook preprocessor for index generation
-
mdbook-backlinks
A preprocessor for mdbook that automatically adds backlinks to each chapter
-
indent_tokenizer
Generate tokens based on indentation
-
tablers
Tool to create tables and export them into CSV or PDF. Without external deps
-
tpng
A small tool that prints truecolor png renderings to the terminal using unicode block characters
-
fontconfig-rs
Safe, higher-level wrapper around the fontconfig library
-
cfasttext-sys
fastText ffi binding
-
character-set
High performance
set.contains(char)
-
verba
A library for working with Latin words
-
poriborton
Interconversion between Unicode and various Bengali ANSI encodings
-
clippy_lints
A bunch of helpful lints to avoid common pitfalls in Rust
-
pulldown-cmark-fork
A pull parser for CommonMark
-
utf16_iter
Iterator by char over potentially-invalid UTF-16 in &[u16]
-
res-regex
A js-regex validator
-
vaporetto_rules
Rule-base filters for Vaporetto
-
mdbook-pagetoc
A mdbook plugin that provides a table of contents for each page
-
cow-utils
Copy-on-write string utilities for Rust
-
stringslice
A collection of methods to slice strings based on character indices rather than bytes
-
djr
Djot rendering CLI based on the djot crate
-
ascii-hangman-webapp
customizable Hangman game with ASCII-art rewarding for children (webapp version)
-
scie
Scie is a research about how to build simple code identify engine for different languages
-
yozuk-helper-english
English NLP utilities for Yozuk
-
texc-latex
Contains LaTeX templates for TeXCreate
-
bitap
Bitap implementation in rust
-
wordbreaker
A Unicode-aware no_std crate (requires alloc) that rapidly finds all sequences of dictionary words that concatenate to a given string
-
dmp
A high-performance library in that manipulates plain text
-
spandex-hyphenation
Knuth-Liang hyphenation for a variety of languages
-
precis-tools
Tools and parsers to generate PRECIS tables from the Unicode Character Database (UCD)
-
search_file
yet another grep utility
-
mdbook-pdf-headless_chrome
A fork from headless_chrome for publishing mdbook-pdf
-
find-simdoc
Time- and memory-efficient all pairs similarity searches in documents
-
terminal-paint
library that simplifies printing colorful text to your console
-
norad
Read and write Unified Font Object files
-
notmecab
Library for tokenizing text with mecab dictionaries. Not a mecab wrapper.
-
lindera-ipadic-builder
A Japanese morphological dictionary builder for IPADIC
-
csmlinterpreter
The CSML (Conversational Standard Meta Language) is a Domain-Specific Language developed for creating conversational experiences easily
-
doc-chunks
Clusters of doc comments and dev comments as coherent view
-
powr-tokenizer
converts JS/TS code into tokens. part of the power project.
-
moenarchbook
Creates a book from markdown files
-
seam
Symbolic Expressions As Markup
-
rckive-genpdf
User-friendly PDF generator written in pure Rust
-
deface
Lightweight markup to HTML converter
-
goodname
Goodname: Tool to assist you with cool naming of your methods and software
-
linkcheck
A library for extracting and validating links
-
md_converter
My markdown to html converter
-
basen
Convert binary data to ASCII with a variety of supported bases
-
taro-cli
Wrapper for multiple quality of life command-line tools
-
fuzzywuzzy
A pure-Rust clone of the incredibly useful fuzzy string matching python package, FuzzyWuzzy
-
unicode-canonical-combining-class
Fast lookup of the Canonical Combining Class property
-
chinese_segmenter
Tokenize Chinese sentences using a dictionary-driven largest first matching approach
-
morsels_lang_latin
Ascii tokenizer with stemmers for morsels
-
try_publish_for_demo
This program picks a random christmas emoji for you
-
mdtranslation-cli
Command-line tools for using mdTranslation, which can be used to prepare multi-lingual Markdown documents
-
unicode-jp
A library to convert Japanese Half-width-kana[半角カナ] and Wide-alphanumeric[全角英数] into normal ones
-
dcsv
Dyanmic csv reader,writer,editor
-
igo-rs
Pure Rust port of the Igo, a POS(Part-Of-Speech) tagger for Japanese (日本語 形態素解析)
-
aki-gsub
substitude text command, replace via regex
-
gen-epub-book
Generate an ePub book from a simple plaintext descriptor
-
mdbook-preprocessor-boilerplate
Boilerplate code for mdbook preprocessors
-
goya
Yet another morphological analyzer for Rust and WebAssembly
-
ucd-trie
A trie for storing Unicode codepoint sets and maps
-
lindera-decompress
A morphological analysis library
-
mdbook-theme
A preprocessor and a backend to config theme for mdbook, especially creating a pagetoc on the right and setting full color themes from the offical ace editor
-
tectonic_engine_xetex
The XeTeX engine as a reusable crate
-
pdf-canvas
Generate PDF files in pure Rust. Currently, simple vector graphics and text set in the 14 built-in fonts are supported
-
emoji-printer
Replace emoji shortcodes in string with emoji unicode (":sushi:" -> 🍣)
-
unicode-ccc
Unicode Canonical Combining Class detection
-
rupantor
A Bengali Phonetic Parser which is very flexible and supports Avro Phonetic
-
mdbook-fishextract
A mdbook preprocessor which handles mermaid graphs, offline, requires
mmdc
-
doccy
Doccy is a simple brace based markup language
-
arbitrator
Format text based on a set of rules and regexes
-
whitespacers
A whitespace JIT-compiler, for when your whitespace just isn't executing fast enough
-
zhconv-cli
Convert Traditional/Simplified Chinese and regional words of Taiwan/Hong Kong/mainland China/Singapore based on Wikipedia conversion tables 轉換中文簡體、繁體及兩岸、新馬地區詞,基於中文維基轉換…
-
naromat
Convert text to narou novel format
-
word_filter
A Word Filter for filtering text
-
hashtag-regex
A simple regex matching hashtags accoding to the unicode spec: http://unicode.org/reports/tr31/#hashtag_identifiers
-
scripter
A screenplay compiler
-
goya-ipadic
IPA dictionary for Goya
-
nlprule-build
Build tools for a fast, low-resource Natural Language Processing and Error Correction library
-
mdbook-last-changed
mdbook preprocessor to add the last modification date per page
-
humnum
Human numeric sorting program — does what
sort -h
is supposed to do! -
unic-char-property
UNIC — Unicode Character Tools — Character Property taxonomy, contracts and build macros
-
fifthtry-mdbook
fork of mdbook, only for ft-cli
-
stfu
Shut The Ferris Up - profanity filtering for Rust
-
yeslogic-ucd-parse
A library for parsing data files in the Unicode character database
-
vtext
NLP with Rust
-
tantivy-meta-tokenizer
Easy and fast book searcher, create and search your private library
-
mojimoji-rs
Rust implementation of a fast converter between Japanese hankaku and zenkaku characters, mojimoji
-
cmark2tex
A small utility to convert markdown files to pdf exploiting tectonic
-
chanoma
chanoma is Characters Normalization library. 文字列正規化処理用のライブラリです。
-
mdbook-presentation-preprocessor
A preprocessor for utilizing an MDBook as slides for a presentation
-
soup
Inspired by the python library BeautifulSoup, this is a layer on top of html5ever that adds a different API for querying and manipulating HTML
-
cabocha
Safe Rust wrapper for cabocha a japanese language dependency structure analyzer library
-
lingua-latvian-language-model
The Latvian language model for Lingua, an accurate natural language detection library
-
varcon
Source Code Spelling Correction
-
twemoji-rs
A word-cloud image generation crate
-
mediawiki_parser
A strict parser for MediaWiki markdown
-
moscato
Outline scaler for OpenType glyphs
-
cyrconv
A funny faux cyrillic character mapper
-
math-text-transform
Transform greek letters, latin letters, or decimal digits into certain variants from the mathematical alphanumeric symbols Unicode block (U+1D400–U+1D7FF). For example to bold, italic, script or double-struck.
-
stylometry
A package to perform stylometry operations
-
parattice
Recursive paraphrase lattice generator
-
unic-segment
UNIC — Unicode Text Segmentation Algorithms
-
trashy-xml
Xml parser that does not stop parsing when encountering errors
-
furigana
Map furigana to a word given its reading
-
owlz
"Owlz" ascii emoojis, created randomly or by design
-
tiny-gradient
Make your string colored in gradient
-
inclip
A simple terminal utility for clipboard interaction and convenience
-
yozuk-helper-filetype
Filetype detection for Yozuk
-
founder
The package provides a font toolbox
-
unidok
A powerful, readable, easy-to-learn markup language
-
wkhtmltopdf
High-level bindings to wkhtmltopdf
-
august
A crate & program for converting HTML to plain text
-
terminal-text-styler
A crate for creating ANSI escape codes to produce styled text for terminal output
-
charset
Thunderbird-compatible character encoding decoding for email
-
json-event-parser-witespace
A json event parser with whitespace
-
truncrate
truncate strings economically without splitting your characters, symbols, emojis etc
-
whisperer
将文本编码为简短的中文字符, 防和谐
-
zalgo-codec-macro
This crate provides the zalgo_embed! macro for compiling zalgo-encoded source code
-
rosie-sys
A crate to build or link to librosie to access the Rosie Pattern Language
-
bocu1
BOCU-1 compressed unicode encoding
-
pomsky-bin
Compile pomsky expressions, a new regular expression language
-
dejavu-runtime
DejaVu shared types and runtime
-
render_readme
Render Markdown or reStructuredText with syntax highlighting and image filtering similar to GitHub's
-
str_overlap
Methods for finding the overlap between two string slices
-
charclass
Library crate to define and modify unicode character classes
-
gskkserv
skkserv using Google IME
-
lindera-dictionary
A Japanese morphological dictionary
-
text-tables
A terminal/text table prettifier with no dependencies
-
unicode_names
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
char-lex
Create easy enum based lexers
-
aki-mcycle
mark up text with cycling color
-
uwl
A management stream for bytes and characters
-
dialogi
A dialog parser
-
unicode-joining-type
Fast lookup of the Unicode Joining Type and Joining Group properties
-
suffix
Suffix arrays
-
autoruby
Easily generate furigana for various document formats
-
nib-cli
A cli for a yet another static site generator Nib
-
yeslogic-unicode-script
Fast lookup of the Unicode Script property
-
str_to_bytes
parse a string representations of numbers or ascii text and return a Vec<u8>
-
harfbuzz
Rust bindings to the HarfBuzz text shaping engine
-
write-fonts
Writing font files
-
lindera-ko-dic
A Japanese morphological dictionary for ko-dic
-
encoding-index-japanese
Index tables for Japanese character encodings
-
ced
Dead easy csv editor
-
chinese-ner
A CRF based Chinese Named-entity Recognition Library written in Rust
-
termbook
The library behind the
termbook-cli
-
textwrap-macros-impl
Simple procedural macros to use textwrap utilities at compile time
-
ucd-data
A crate that just simply stores all UCD data as a
static
array -
yozuk-helper-platform
Platform-dependent utilities for Yozuk
-
publish_demo_node
This program picks a random christmas emoji for you
-
infisearch_lang_ascii
Basic ascii tokenizer for InfiSearch
-
vaporetto_tantivy
Vaporetto Tokenizer for Tantivy
-
tectonic_engine_spx2html
The Tectonic engine that converts SPX output to HTML
-
panduck-core
Conversion tool by rust, inspired by pandoc
-
lindera-compress
A morphological analysis library
-
dejavu-parser
Parser of DejaVu Language
-
pomsky-macro
Macro for converting pomsky expressions to regexes
-
vibrato
Vibrato: viterbi-based accelerated tokenizer
-
sesdiff
Generates a shortest edit script (Myers' diff algorithm) to indicate how to get from the strings in column A to the strings in column B. Also provides the edit distance (levenshtein).
-
mdbook-boilerplate
Avoid the mdbook preprocessor boilerplate
-
lindera-ipadic
A Japanese morphological dictionary for IPADIC
-
ucd-parse
A library for parsing data files in the Unicode character database
-
font-types
Scalar types used in fonts
-
latex
An ergonomic library for programatically generating LaTeX documents and reports
-
xsv
A high performance CSV command line toolkit
-
lingua-english-language-model
The English language model for Lingua, an accurate natural language detection library
-
aki-mline
match line, regex text filter like a grep of linux command
-
saku
A simple yet efficient rule-based Japanese Sentence Tokenizer
-
unic-ucd
UNIC — Unicode Character Database
-
caseless
Unicode caseless matching
-
dejavu-engine
DejaVu shared types and runtime
-
fontfor
find fonts which can show a specified character and preview them in terminal or browser
-
subscript-compiler
A modern LaTeX rendition
-
node_rust_demo
This program picks a random christmas emoji for you
-
unic-char-range
UNIC — Unicode Character Tools — Character Range and Iteration
-
zalgo-codec-common
A crate for converting an ASCII text string to a single unicode grapheme cluster and back