#spellcheck #spelling #cli

zspell

Native Rust library for spellchecking, with a command line interface

13 unstable releases (3 breaking)

0.3.3 Jan 1, 2023
0.3.2 Jan 1, 2023
0.3.1 Dec 30, 2022
0.2.2 Nov 4, 2022
0.0.1 Jul 21, 2022

#296 in Text processing

Download history 4/week @ 2022-12-02 10/week @ 2022-12-09 95/week @ 2022-12-30 11/week @ 2023-01-06 6/week @ 2023-01-13 2/week @ 2023-01-20 9/week @ 2023-01-27 15/week @ 2023-02-03 26/week @ 2023-02-10 53/week @ 2023-02-17 13/week @ 2023-02-24

73 downloads per month

Custom license

275KB
3.5K SLoC

ZSpell

This project is a spellchecker written completely in rust, that maintains compatibility with the venerable Hunspell dictionary format. It is entirely native and does not rely on any other backends (Enchant, Hunspell, Aspell, etc.). This library also has the goal of being usable via WASM.

The library side has a stabalized checker, but the suggestion API is not yet finalized. The CLI side is usable but not yet considered stabalized. See Feature Status for more information on what is available.

Here are some useful quick links:

Interfaces

This project exposes multiple interfaces to its spellchecker, listed in this section.

CLI Interface

Just want to use this spellchecker from the command line? Check out the book, located here https://pluots.github.io/zspell/, for a more in-depth explanation of installation and usage.

If you don't want to read further, the easiest way to get started is to download a prebuilt binary from here: https://github.com/pluots/zspell/releases.

Rust Library Interface

This project also aims to create a fully functional spellchecking library, for easy programmatic use. See the documentation for the library side here https://docs.rs/zspell/. This also includes a lot of design methodology discussions, for those who are interested.

Python Interface

There is a python wrapper for this library with prebuilt wheels, available here: https://pypi.org/project/zspell/. Its source is located in the zspell-py crate.

Usage via WASM

The library API should work out of the box. Official WASM bindings will be added at some point.

Feature Status

Feature Available via Library Available via CLI Tracking Issue
Basic spellcheck functionality
Forbidden word handling #17
Suggestions #16
Compound word handling
Full Morph/Phone Handling
Python Interface #18
Prebuilt WASM bindings #19

Performance

This repository has the goal of highly prioritizing the most expected usage, i.e., that most words to be checked are correct. With optimizations based around this concept and with the modern computers now able to store entire compiled word lists in memory (~2 MiB), zspell tends to outperform other spellcheckers.

License

See the LICENSE file for license information. The provided license does allow for proprietary use and adaptation; that being said, I kindly suggest that if you come up with an improvement, you submit a pull request and help us all out :)

Dictionary data license

The dictionaries provided in this repository for testing purposed have been obtained under license. These files have been sourced from here: https://github.com/wooorm/dictionaries

These dictionaries are licensed under various licenses, different from that of this project. Please see the applicable .license file withing the dictionaries/ directory.

Dependencies

~3–9MB
~166K SLoC