4 releases

0.2.2 Jan 10, 2021
0.2.1 Dec 30, 2020
0.2.0 Dec 28, 2020
0.1.0 Dec 27, 2020

#1697 in Text processing

WTFPL license

20KB
381 lines

MyStem Rust Wrapper

Rust wrapper for the Yandex MyStem 3.1 morpholocial analyzer of the Russian language.

System Requrements

The wrapper was tested on Ubuntu Linux 18.04+, Windows 10. Mystem binary should be accessible via PATH so manual installation is required. MyStem Web Site

A Quick Example

let mut instance = mystem::MyStem::new()?;
for stem in instance.stemming("Связался с лучшим - подохни как все.".into())? {
    println!(
        "'{}' most likely is a '{}' and lexeme is '{}'.",
        stem.text,
        stem.lex[0].grammem.part_of_speech,
        stem.lex[0].lex
    )
}

//'Связался' most likely is a 'Verb' and lexeme is 'связываться'.
//'с' most likely is a 'Preposition' and lexeme is 'с'.
//'лучшим' most likely is a 'Adjective' and lexeme is 'хороший'.
//'подохни' most likely is a 'Verb' and lexeme is 'подыхать'.
//'как' most likely is a 'Conjunction' and lexeme is 'как'.
//'все' most likely is a 'AdjectivePronoun' and lexeme is 'весь'.

Dependencies

~0.6–1.2MB
~23K SLoC