-
bstr
A string type that is not required to be valid UTF-8
-
pulldown-cmark
A pull parser for CommonMark
-
regex
An implementation of regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
-
unicode-segmentation
This crate provides Grapheme Cluster, Word and Sentence boundaries according to Unicode Standard Annex #29 rules
-
textwrap
Powerful library for word wrapping, indenting, and dedenting strings
-
encoding_rs
A Gecko-oriented implementation of the Encoding Standard
-
mdbook
Creates a book from markdown files
-
heck
heck is a case conversion library
-
Inflector
Adds String based inflections for Rust. Snake, kebab, camel, sentence, class, title and table cases as well as ordinalize, deordinalize, demodulize, foreign key, and pluralize/singularize…
-
fancy-regex
An implementation of regexes, supporting a relatively rich set of features, including backreferences and look-around
-
comrak
A 100% CommonMark-compatible GitHub Flavored Markdown parser and formatter
-
comfy-table
An easy to use library for building beautiful tables with automatic content wrapping
-
termimad
Markdown Renderer for the Terminal
-
unicase
A case-insensitive wrapper around strings
-
difference
A Rust text diffing and assertion library
-
lopdf
A Rust library for PDF document manipulation
-
aho-corasick
Fast multiple substring searching
-
unicode-normalization
This crate provides functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
glyph_brush
Fast cached text render library using ab_glyph
-
ropey
A fast and robust text rope for Rust
-
pretty
Wadler-style pretty-printing combinators in Rust
-
encoding
Character encoding support for Rust
-
printpdf
Rust library for writing PDF files
-
deunicode
Convert Unicode strings to pure ASCII by intelligently transliterating them. Suppors Emoji and Chinese.
-
indenter
A formatter wrapper that indents the text, designed for error display impls
-
linkify
Finds URLs and email addresses in plain text. Takes care to get the boundaries right with surrounding punctuation like parentheses.
-
tabwriter
Elastic tabstops
-
const_format
Compile-time string formatting
-
ucd-util
A small utility library for working with the Unicode character database
-
diff
An LCS based slice and string diffing implementation
-
pulldown-cmark-to-cmark
Convert pulldown-cmark Events back to the string they were parsed from
-
kv-log-macro
Log macro for log’s kv-unstable backend
-
scanlex
a simple lexical scanner for parsing text into tokens
-
yeslogic-ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched with support for additional tables
-
lindera-cli
A command-line interface for Lindera
-
unindent
Remove a column of leading whitespace from a string
-
text_io
really simple to use panicking input functions
-
fuzzy-matcher
Fuzzy Matching Library
-
jieba-rs
The Jieba Chinese Word Segmentation Implemented in Rust
-
regex-automata
Automata construction and matching using regular expressions
-
csml_engine
The CSML Engine is a conversational engine designed to make it extremely easy to create rich and powerful chatbots
-
allsorts
Font parser, shaping engine, and subsetter for OpenType, WOFF, and WOFF2
-
lingua
An accurate natural language detection library, suitable for long and short text alike
-
unicode-script
This crate exposes the Unicode
Script
andScript_Extension
properties from UAX #24 -
onig_sys
The
onig_sys
crate contains raw rust bindings to the oniguruma library. This crate exposes a set of unsafe functions which can then be used by other crates to create safe wrappers around Oniguruma… -
unic-ucd-ident
UNIC — Unicode Character Database — Identifier Properties
-
svgbob
Transform your ascii diagrams into happy little SVG
-
pad
Library for padding strings at runtime
-
prettydiff
Side-by-side diff for two files
-
hyphenation
Knuth-Liang hyphenation for a variety of languages
-
array_tool
Helper methods for processing collections
-
lindera
A morphological analysis library
-
markov
A generic markov chain implementation in Rust
-
cloc
Count, or compute differences of, lines of source code and comments
-
ncount
A word count tool intended to derive useful stats from markdown
-
rant
The Rant procedural templating language
-
unicode-xid
Determine whether characters have the XID_Start or XID_Continue properties according to Unicode Standard Annex #31
-
unidecode
Provides pure ASCII transliterations of Unicode strings
-
ab_glyph
API for loading, scaling, positioning and rasterizing OpenType font glyphs
-
shell2batch
Coverts simple basic shell scripts to windows batch scripts
-
any_ascii
Unicode to ASCII transliteration
-
uwc
Counts things in unicode text files
-
natural
Pure rust library for natural language processing
-
unicode_names2
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
rectangle-pack
A general purpose, deterministic bin packer designed to conform to any two or three dimensional use case
-
mdbook-cat-prep
a preprocessor for mdbook which provides teacher, subject, material and tag functionality
-
crowbook
Render a Markdown book in HTML, PDF or Epub
-
uncased
Case-preserving, ASCII case-insensitive, no_std string types
-
wchar
Procedural macros for compile time UTF-16 strings
-
grok
A rust implementation of the popular java & ruby grok library which allows easy text and log file processing with composable patterns
-
pandoc_ast
deserializes and serializes the markdown ast for writing pandoc filters
-
enum-ts
TypeScript Enum pattern matcher codegen
-
crowbook-text-processing
Provides some utilities functions for escaping text (HTML/LaTeX) and formatting it according to typographic rules (smart quotes, ellipsis, french typograhic rules)
-
stop-words
Common stop words in several languages
-
rustybuzz
An incremental harfbuzz port to Rust
-
loc
Count lines of code (cloc) fast
-
etch
Not just a text formatter, don’t mark it down, etch it
-
svgbob_cli
Transform your ascii diagrams into happy little SVG
-
ascii
ASCII-only equivalents to
char
,str
andString
-
ascii-hangman
customizable Hangman game with ASCII-art rewarding for children (desktop version)
-
gspell
Rust bindings for gspell
-
wana_kana
Utility library for checking and converting between Japanese characters - Kanji, Hiragana, Katakana - and Romaji
-
justify
Justify plaintext while handling Unicode gracefully
-
case
A set of letter case string helpers
-
ngrammatic
Character-oriented ngram generator and fuzzy matching library
-
unicode-truncate
Unicode-aware algorithm to pad or truncate
str
in terms of displayed width -
diffy
Tools for finding and manipulating differences between files
-
kl-hyphenate
Knuth-Liang hyphenation for a variety of languages
-
lindera-ipadic-builder
A Japanese morphological dictionary builder for IPADIC
-
textcode
Text encoding/decoding library. Supports: UTF-8, ISO6937, ISO8859, GB2312
-
textwrap-macros
Simple procedural macros to use textwrap utilities at compile time
-
imperative
Check for imperative mood in text
-
synoptic
A simple, low-level, syntax highlighting library with unicode support
-
mdplayscript
An extension of Markdown for play scripts
-
substring
A substring method for string types
-
in_definite
Get the indefinite article (‘a’ or ‘an’) to match the given word. For example: an umbrella, a user.
-
lazy-regex
a macro to reduce regex creation boilerplate
-
unicode_reader
Adaptors which wrap byte-oriented readers and yield the UTF-8 data as Unicode code points or grapheme clusters
-
ucd-generate
A program for generating packed representations of the Unicode character database that can be efficiently searched
-
convert_case
Convert strings into any case
-
csv-groupby
execute a sql-like group-by on arbitrary text or csv files
-
clarifai_grpc
The official Clarifai gRPC Rust client
-
text_analysis
Analyze text stored as *.txt or *pdf in provided file or directory. Doesn’t read files in subdirectories. Counting all words and then searching for every unique word in the vicinity (+-5 words)…
-
unicode-joining-type
Fast lookup of the Unicode Joining Type and Joining Group properties
-
sublime_fuzzy
Fuzzy matching algorithm based on Sublime Text’s string search
-
mlc
The markup link checker (mlc) checks for broken links in markup files
-
unic-ucd-age
UNIC — Unicode Character Database — Age
-
ftrace
ftrace - trace files and paths
-
fm
Non-backtracking fuzzy text matcher
-
owned_chars
Owned iterators with the same output as Chars and CharIndices
-
mktoc
Generate Table of Contents from Markdown files
-
lazy-transform-str
Lazy-copying lazy-allocated scanning
str
transformations. This is good e.g. for (un)escaping text, especially if individual strings are short. -
ferris-says
A Rust flavored replacement for the classic cowsay
-
symbolic_expressions
A symbolic-expression parser/writer
-
safe-regex
Safe regular expression library
-
fancy-regex-fork-pb
A custom fork of the fancy-regex crate. You probably don’t want to use this.
-
notmecab
Library for tokenizing text with mecab dictionaries. Not a mecab wrapper.
-
unicode_categories
Query Unicode category membership for chars
-
lexical-sort
Sort Unicode strings lexically
-
matchers
Regex matching on character and byte streams
-
linkcheck
A library for extracting and validating links
-
unicode-security
Detect possible security problems with Unicode usage according to Unicode Technical Standard #39 rules
-
doccy
Doccy is a simple brace based markup language
-
utf-8
Incremental, zero-copy UTF-8 decoding with error handling
-
levenshtein_automata
Creates Levenshtein Automata in an efficient manner
-
mdbook-plantuml
A preprocessor for mdbook which will convert plantuml code blocks into inline SVG diagrams
-
pandoc_types
Rust port of pandoc-types
-
lipsum
Lipsum is a lorem ipsum text generation library. Use this if you need some filler text for your application. The text is generated using a simple Markov chain, which you can also instantiate…
-
single_source
Generate code files from snippets in md tutorial files
-
mathematica-notebook-filter
mathematica-notebook-filter
parses Mathematica notebook files and strips them of superfluous information so that they can be committed into version control systems more easily -
autocorrect
Automatically add whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
ucd
Extends the char type to provide access to most fields of the UCD, Unicode Character Database, as of version 9.0.0. It aims to be compact, fast, and use minimal dependencies (only rust’s core crate)…
-
lindera-tantivy
A Tokenizer for Tantivy, based on Lindera
-
character_converter
Turn Traditional Chinese script ot Simplified Chinese script and vice-versa
-
aki-mcycle
mark up text with cycling color
-
scie
Scie is a research about how to build simple code identify engine for different languages
-
html2text
Render HTML as plain text
-
unicode-width
Determine displayed width of
char
andstr
types according to Unicode Standard Annex #11 rules -
fontconfig
Safe, higher-level wrapper around the Fontconfig library
-
csmlinterpreter
The CSML (Conversational Standard Meta Language) is a Domain-Specific Language developed for creating conversational experiences easily
-
geml
A simple Generator-orientated ML parser
-
igo-rs
Pure Rust port of the Igo, a POS(Part-Of-Speech) tagger for Japanese (日本語 形態素解析)
-
svgbobdoc
Renders ASCII diagrams in doc comments as SVG images
-
mdbook-latex
An mdbook backend for generating LaTeX and PDF documents
-
names-changer
Convert a names of sql schemes from camelcase to snake case
-
jetscii
A tiny library to efficiently search strings and byte slices for sets of ASCII characters or bytes
-
easy_process
Easy to use process runner
-
markdown-gen
Crate for generating Markdown files
-
harfbuzz
Rust bindings to the HarfBuzz text shaping engine
-
vndb_tags_get
A tool to convert VNDB tags list from JSON into markdown. The list can be downloaded from https://dl.vndb.org/dump/vndb-tags-latest.json.gz in gzip. This tool read from stdin to make it simple…
-
slug
Convert a unicode string to a slug
-
unicode-normalization-alignments
This crate provides functions for normalization of Unicode strings, including Canonical and Compatible Decomposition and Recomposition, as described in Unicode Standard Annex #15
-
pdf-extract
A library to extract content from pdfs
-
nparse
Parser in rust for specific use-cases
-
entities
Provides the raw data needed to convert to and from HTML entities
-
pragmatic-segmenter
Rust port of pySBD v3.1.0
-
sejong
Sejong Buffer is a buffer that can receive ASCII bytes different from keyboard and send out UTF-32 Hangul string. This buffer allows deletion by Jamo.
-
uniwhat
Display the unicode characters text
-
eliza
A rust implementation of ELIZA - a natural language processing program developed by Joseph Weizenbaum in 1966
-
emojic
Emoji constants
-
fuzzywuzzy
A pure-Rust clone of the incredibly useful fuzzy string matching python package, FuzzyWuzzy
-
gimme
Pull useful data out of your clipboard
-
porter-stemmer
Flexible and unicode friendly, Porter stemmer implementation
-
spandex-hyphenation
Knuth-Liang hyphenation for a variety of languages
-
res-regex
A js-regex validator
-
unicode_graph
Unicode glyph graphs with Rust
-
stfu8
Sorta Text Format in UTF-8
-
stringmatch
Allow the use of regular expressions or strings wherever you need string comparison
-
progress
Library for showing text based progress bar and job status
-
regex_generate
Use regular expressions to generate text
-
roff
ROFF (man page format) generation library
-
mdbook-bib
mdbook plugin allowing to load and present a bibliography in BibLaTex format in your books and cite its references
-
rupantor
A Bengali Phonetic Parser which is very flexible and supports Avro Phonetic
-
ttf_word_wrap
Wraps text based on character width
-
cur
The tool that will hunt for your regular expression
-
rex-regextract
extracts key value pairs out of text
-
fontdb
A simple, in-memory font database with CSS-like queries
-
august
A crate & program for converting HTML to plain text
-
easy_io
Fast and dead-simple IO for competitive programming in Rust
-
tsur
Count ASCII characters in file(s)
-
shoebill
A Wadler/Leijen style pretty-printer
-
scripter
A screenplay compiler
-
csvsc
Build processing chains for CSV files
-
line-span
Find line ranges and jump between next and previous lines
-
stringutils
A collection of various and (hopefully) useful String utility functions
-
const_format_proc_macros
Implementation detail of the
const_format
crate -
indent_tokenizer
Generate tokens based on indentation
-
genpdf
User-friendly PDF generator written in pure Rust
-
norad
Read and write Unifed Font Object files
-
sana
Create lexers easily
-
str-utils
This crate provides some traits to extend types which implement
AsRef<[u8]>
orAsRef<str>
-
unicode-case-mapping
Fast lowercase, uppercase, and titlecase mapping for characters
-
adbook
Creates a book from AsciiDoc files
-
const-str
compile-time string operations
-
ab_glyph_rasterizer
Coverage rasterization for lines, quadratic & cubic beziers
-
cow-utils
Copy-on-write string utilities for Rust
-
pcre2
High level wrapper library for PCRE2
-
kas-text
Text layout and font management
-
epub
Library to support the reading of epub files
-
sharkdown
Sharkdown is a markdown parser
-
chardetng
A character encoding detector for legacy Web content
-
str_overlap
Methods for finding the overlap between two string slices
-
unicode-reverse
Unicode-aware in-place string reversal
-
tb_normalization
A library for normalization utf8 string, loc dau vietnamese and some language
-
pulldown-cmark-fork
A pull parser for CommonMark
-
parattice
Recursive paraphrase lattice generator
-
anon-csv-cli
A tool to anonymise CSV files, providing various options to substitute real data with plausable fake data
-
text-utils
Text utils for unescaping and align
-
mdbook-dtmo
Creates a book from markdown files with added plugins
-
ascii-hangman-webapp
customizable Hangman game with ASCII-art rewarding for children (webapp version)
-
aki-mcolor
mark up text with color
-
verba
A library for working with Latin words
-
changecase
A trait and implementation for changing the case of Strings and &str. It currently supports uppercase, lowercase, alternating case, and inverting case. Title case is in the works.
-
pangu
Paranoid text spacing for good readability, to automatically insert whitespace between CJK (Chinese, Japanese, Korean) and half-width characters (alphabetical letters, numerical digits and symbols)
-
sixbit
Small packed strings
-
suffix
Suffix arrays
-
case_insensitive_hashmap
A HashMap that uses case-insensitive strings as keys
-
gh-emoji
Convert
:emoji:
to Unicode using GitHub’s emoji names -
xsv
A high performance CSV command line toolkit
-
sanitizer
A collection of methods and macros to sanitize struct fields
-
pdf_form
A library for programatically filling out pdf forms
-
interpol
String interpolation macros
-
chinese-ner
A CRF based Chinese Named-entity Recognition Library written in Rust
-
windy-macros
Macros to convert a UTF-8 string to WString or AString at compile time
-
deepfrog
A deep learning NLP suite (PoS,lemmatiser,NER) with FoLiA XML support
-
wordninja
A Rust port of the Word Ninja English word splitting library
-
kincaid
A word statistics library in Rust
-
mdbook-presentation-preprocessor
A preprocessor for utilizing an MDBook as slides for a presentation
-
bibutils-sys
Rust bindings for bibutils, a program for bibliography format interconversion
-
mandown
Markdown to groff (man page) converter
-
chinese_detection
Classify a string as either English, Chinese, or Pinyin
-
framework
Framework is a detector for different frameworks in one projects
-
csml_interpreter
The CSML Interpreter is the official interpreter for the CSML programming language, a DSL designed to make it extremely easy to create rich and powerful chatbots
-
josa
Korean language josa selector
-
text-tables
A terminal/text table prettifier with no dependencies
-
gen-epub-book
Generate an ePub book from a simple plaintext descriptor
-
mdbook-playscript
Preprocessor for mdBook, which styles stage play scripts
-
pandoc
a library API that wraps calls to the pandoc 2.x executable
-
branchy
Provides tools for generating strings and sequences using context-free grammars
-
japanese-ruby-filter
Japanese ruby notation parser
-
vi
An input method library for vietnamese IME
-
imag-markdown
Part of the imag core distribution: imag-markdown command
-
pdf_forms
A library for programatically filling out pdf forms
-
unicode-blocks
This crate contains a list of all unicode blocks and provides some functions to search across them
-
kana-converter
A simple converter for half-width/full-width Japanese language characters (katakana, hiragana, and ASCII)
-
secular
No Diacr!
-
html-auto-p
This library provides a function like
wpautop
in Wordpress. It uses a group of regex replaces used to identify text formatted with newlines and replace double line-breaks with HTML paragraph tags. -
muchutils
A selection of command line tools, wow
-
grapheme_to_phoneme
A library for building text to speech applications. Turns graphemes into phonemes.
-
trashy-xml
Less than stellar xml parser, but does what I need in a “simple” way
-
mdsite
building blocks for markdown processing and static-site gen
-
mediawiki_parser
A strict parser for MediaWiki markdown
-
blockcounter
Counts the blocks in a stream
-
math-text-transform
Transform greek letters, latin letters, or decimal digits into certain variants from the mathematical alphanumeric symbols Unicode block (U+1D400–U+1D7FF). For example to bold, italic, script or double-struck.
-
unic-emoji-char
UNIC — Unicode Emoji — Emoji Character Properties
-
poetry-book
Create a poetry book in latex, starting from plain text
-
sudachiclone
sudachiclone-rs is a Rust version of Sudachi, a Japanese morphological analyzer
-
gecliht
A disparate collection of text manipulation and formatting algorithms
-
tagsearch
Filter plaintext files based on @keyword tags
-
latex
An ergonomic library for programatically generating LaTeX documents and reports
-
whitespace_text_steganography
A steganography strategy that uses whitespace to hide text in other text
-
encoding_c
C API for encoding_rs
-
unescape
Unescapes strings with escape sequences written out as literal characters
-
uniaxe
A Rust crate to replace Unicode letters with Ascii equivalents
-
termbook
The library behind the
termbook-cli
-
ogrep
Tool for searching in indentation-structured texts
-
maybe_utf8
Byte container optionally encoded as UTF-8
-
swappy
An anagram generator
-
change-case
Transform a string between camelCase, PascalCase, Capital Case, snake_case, param-case, CONSTANT_CASE and others
-
color-convert
Support RGB,RGBA,HEX,HSL,HSLA,HSV,CMYK to convert each other, write by rust
-
koelner-phonetik
koelner_phonetik or cologne phonetics is a phonetic algorithm like soundex, but specialized for german words
-
csr
csr is a library for encrypting and decrypting many types using caesar cipher. This shouldn’t be used for any serious security but was more of an exercise in generics and creating a rust crate.
-
fountain
Parser and formatter for the Fountain screenplay markup language
-
glyph_brush_layout
Text layout for ab_glyph
-
halfcaps
tRaNslAtE aNy TeXt To ThIs
-
utf8_slice
Lightweight UTF8 Slice Utilities
-
titlecase
A tool and library that capitalizes text according to a style defined by John Gruber for post titles on his website Daring Fireball
-
mdbook-epub
An EPUB renderer for mdbook
-
rusty_word_builder
Syllable and Word generation library written fully in Rust
-
unicode_names
Map characters to and from their name given in the Unicode standard. This goes to great lengths to be as efficient as possible in both time and space, with the full bidirectional tables weighing barely 500 KB…
-
cabocha
Safe Rust wrapper for cabocha a japanese language dependency structure analyzer library
-
unicode-canvas
A library for creating text base drawing
-
bgrep
bgrep is a grep tailored to handle binary patterns and files
-
pdftotext
High-level library that binds to Poppler to extract text from a PDF
-
inflections
High performance inflection transformation library for changing properties of words like the case
-
html2runes
An HTML to Text converter
-
safe-regex-compiler
Regex compiler for the safe-regex crate
-
strcursor
Provides a string cursor type for seeking through a string whilst respecting grapheme cluster and code point boundaries
-
mdbook-open-git-repo
mdbook preprocessor to add a open-on-git-repo link on every page
-
convert_encoding
Convert encoding of text files in batch
-
cistring
A string type that preserve case, but compares insensitiveley
-
yeslogic-unicode-script
Fast lookup of the Unicode Script property
-
resast
Rusty-ECMAScript Abstract Syntax Tree
-
lindera-ipadic-neologd-builder
A Japanese morphological dictionary builder for IPADIC NEologd
-
const-str-proc-macro
compile-time string operations
-
sana_derive
The derive macro for Sana
-
aki-mline
match line, regex text filter like grep
-
textwrap-macros-impl
Simple procedural macros to use textwrap utilities at compile time
-
ascii-hangman-backend
customizable Hangman game with ASCII-art rewarding for children (backend)
-
ccase
Command line interface to convert strings into any case
-
unic-idna-mapping
UNIC — IDNA — IDNA Mapping Table
-
words-count
Count the words and characters, with or without whitespaces
-
unicode-general-category
Fast lookup of the Unicode General Category property for char
-
ucd-parse
A library for parsing data files in the Unicode character database
-
lingua-maori-language-model
The Māori language model for Lingua, an accurate natural language detection library
-
chinese_segmenter
Tokenize Chinese sentences using a dictionary-driven largest first matching approach
-
lindera-ko-dic-builder
A Korean morphological dictionary builder for ko-dic
-
markdown-heading-id
Filter for
pulldown-cmark
which converts headings with custom ID -
skribo
A Rust library for low-level text layout
-
aki-gsub
substitude text command
-
yeslogic-ucd-parse
A library for parsing data files in the Unicode character database
-
encoding_c_mem
C API for encoding_rs::mem
-
unic-ucd-name
UNIC — Unicode Character Database — Name
-
sana_core
The core of Sana
-
yeslogic-fontconfig
RENAMED: use the fontconfig crate instead
-
lindera-dictionary
A morphological dictionary loader
-
gfx_glyph
Fast GPU cached text rendering using gfx-rs & ab_glyph
-
aki-xcat
concatenate files that are plain, gzip, xz and zstd
-
lingua-albanian-language-model
The Albanian language model for Lingua, an accurate natural language detection library
-
unic-common
UNIC — Common Utilities
-
lindera-unidic-builder
A Japanese morphological dictionary builder for UniDic
-
glyph-names
Mapping of characters to glyph names according to the Adobe Glyph List Specification
-
suffix_tree
Suffix trees
-
recode_rs
Command-line tool for converting between the character encodings defined in the Encoding Standard
-
unic-char-property
UNIC — Unicode Character Tools — Character Property taxonomy, contracts and build macros
-
aki-xtee
copy standard input to each files and standard output
-
yeslogic-fontconfig-sys
Raw bindings to Fontconfig without a vendored C library
-
lingua-tagalog-language-model
The Tagalog language model for Lingua, an accurate natural language detection library
-
lindera-core
A morphological analysis core library
-
unic-cli
UNIC Command-Line Tools
-
aki-unbody
output first or last n lines, like a head and tail of linux command
-
lindera-ipadic
A Japanese morphological dictionary loader for IPADIC
-
lingua-serbian-language-model
The Serbian language model for Lingua, an accurate natural language detection library
-
chardetng_c
C bindings for chardetng
-
aki-resort
sort lines of text. You can use regex to specify the KEY.
-
lingua-ukrainian-language-model
The Ukrainian language model for Lingua, an accurate natural language detection library
-
unic-emoji
UNIC — Unicode Emoji
-
unic-ucd-core
UNIC - Unicode Character Database - Version
-
lingua-slovene-language-model
The Slovene language model for Lingua, an accurate natural language detection library
-
lingua-swahili-language-model
The Swahili language model for Lingua, an accurate natural language detection library
-
lingua-chinese-language-model
The Chinese language model for Lingua, an accurate natural language detection library
-
lingua-vietnamese-language-model
The Vietnamese language model for Lingua, an accurate natural language detection library
-
lingua-swedish-language-model
The Swedish language model for Lingua, an accurate natural language detection library
-
lingua-yoruba-language-model
The Yoruba language model for Lingua, an accurate natural language detection library
-
lingua-zulu-language-model
The Zulu language model for Lingua, an accurate natural language detection library
-
lingua-shona-language-model
The Shona language model for Lingua, an accurate natural language detection library
-
lingua-tamil-language-model
The Tamil language model for Lingua, an accurate natural language detection library
-
lingua-sotho-language-model
The Sotho language model for Lingua, an accurate natural language detection library
-
lingua-tsonga-language-model
The Tsonga language model for Lingua, an accurate natural language detection library
-
lingua-somali-language-model
The Somali language model for Lingua, an accurate natural language detection library