#sensitive #multiple #data-source #word #filtering #algorithm #loading

rust_sensitive

Sensitive word filtering, support multiple data source loading, multiple filtering algorithms, multiple operation functions

3 stable releases

1.1.0 May 12, 2024
1.0.1 May 11, 2024
1.0.0 May 10, 2024

#253 in Algorithms

Download history 351/week @ 2024-05-06 56/week @ 2024-05-13

407 downloads per month

MIT license

17KB
310 lines

🚫 rust-sensitive

build

English | δΈ­ζ–‡

Sensitive word filtering, support multiple data source loading, multiple filtering algorithms, multiple operation functions

🌟 Feature

  • Supports a wide range of operating functions
    • filter() returns the filtered text.
    • replace() Returns the text after replacing sensitive words.
    • is_sensitive() Returns whether the text contains sensitive words.
    • find_one() Returns the first sensitive word matched.
    • find_all() returns all the sensitive words matched.
  • Support multiple data sources loading
    • βœ… Support memory storage
    • πŸ”² Support mysql storage
    • πŸ”² Support mongo storage
  • Support multiple filtering algorithms
    • βœ… DFA Use HashMap to match sensitive words.
    • βœ… AC automated machine

βš™ Usage

first use

    use rust_sensitive::model::DfaSensitiveWordMap;

    fn test(){
        // Initialize the sensitive_map
        let map =DfaSensitiveWordMap::init_dfa_dic_from_file("./data.txt");
        /* because use once_cell,you can use 
         DfaSensitiveWordMap::get_dfa_dic() 
         to get the sensitive_map 
        */

        /*
        if use ac algorithm,you can use
        use rust_sensitive::model::AcSensitiveWordMap;

        let map =AcSensitiveWordMap::init_ac_dic_from_file("./data.txt"); 
        and because use once_call,you can also use
        AcSensitiveWordMap::get_ac_dic()
        to get the sensitive_map
        */
    }
    fn test2(){
        let map = DfaSensitiveWordMap::get_dfa_dic();
        // let map = AcSensitiveWordMap::get_ac_dic();
    }

filter()

    fn filter(&self, text_copy: &str) -> String
    // "hello" is sensitive word
    fn test(){
        let map = DfaSensitiveWordMap::get_dfa_dic();
        let result = map.filter("hello,world!");
        println!("{}",result); //",world"

    }

replace()

    replace(&self, text_copy: &str, repl: &str) -> String
    // "hello" is sensitive word
    fn test(){
        let map = DfaSensitiveWordMap::get_dfa_dic();
        let result = map.replace("hello,world!","*");
        // repl is like "!"、"*".a single word
        println!("{}",result); //"*****,world"

    }

is_sensitive()

    fn is_sensitive(&self, text: &str) -> bool
    // "hello" is sensitive word
    fn test(){
        let map = DfaSensitiveWordMap::get_dfa_dic();
        let result = map.is_sensitive("hello,world!");
        println!("{}",result); //"true"

    }

find_one()

    fn find_one(&self, text: &str) -> String
    // "hello" is sensitive word
    fn test(){
        let map = DfaSensitiveWordMap::get_dfa_dic();
        let result = map.find_one("hello,world!");
        println!("{}",result); //"hello"

    }

find_all()

    find_all(&self, text: &str) -> Vec<String>
    // "hello" "world" are sensitive words
    fn test(){
        let map = DfaSensitiveWordMap::get_dfa_dic();
        let result = map.find_all("hello,world!");
        println!("{}",result); //"hello","world"

    }

βœ” Get

Run the following Cargo command in your project directory:

    cargo add rust_sensitive

Or add the following line to your Cargo.toml:

    [dependencies]
    rust_sensitive = "1.1.0"

πŸ“Œ TODO

  • add mongo data source support
  • add bloom algorithm

Dependencies

~625KB