6 releases

0.1.6 Sep 8, 2023
0.1.5 Apr 12, 2023
0.1.4 Nov 4, 2022
0.1.3 Mar 19, 2022
0.1.1 Aug 14, 2020

#265 in Text processing

Download history 589/week @ 2024-07-26 1017/week @ 2024-08-02 535/week @ 2024-08-09 435/week @ 2024-08-16 379/week @ 2024-08-23 933/week @ 2024-08-30 898/week @ 2024-09-06 508/week @ 2024-09-13 517/week @ 2024-09-20 388/week @ 2024-09-27 539/week @ 2024-10-04 302/week @ 2024-10-11 323/week @ 2024-10-18 343/week @ 2024-10-25 399/week @ 2024-11-01 434/week @ 2024-11-08

1,593 downloads per month
Used in 8 crates (5 directly)

MIT license

10KB
160 lines

Words Count

CI

Count the words and characters, with or without whitespaces.

The algorithm is roughly aligned with the way LibreOffice is counting words. This means that it does not exactly match the Unicode Text Segmentation standard.

Examples

use words_count::WordsCount;

assert_eq!(WordsCount {
    words: 20,
    characters: 31,
    whitespaces: 2,
    cjk: 18,
}, words_count::count("Rust是由 Mozilla 主導開發的通用、編譯型程式語言。"));
let result = words_count::count_separately("apple banana apple");

assert_eq!(2, result.len());
assert_eq!(Some(&2), result.get("apple"));

Crates.io

https://crates.io/crates/words-count

Documentation

https://docs.rs/words-count

License

MIT


lib.rs:

Words Count

Count the words and characters, with or without whitespaces.

The algorithm is roughly aligned with the way LibreOffice is counting words. This means that it does not exactly match the Unicode Text Segmentation standard.

Examples

use words_count::WordsCount;

assert_eq!(WordsCount {
words: 20,
characters: 31,
whitespaces: 2,
cjk: 18,
}, words_count::count("Rust是由 Mozilla 主導開發的通用、編譯型程式語言。"));
let result = words_count::count_separately("apple banana apple");

assert_eq!(2, result.len());
assert_eq!(Some(&2), result.get("apple"));

Dependencies

~68KB