#regex #csv #pattern #data-processing #partial #matches #file

csv_coincidence

Tool designed to efficiently search for and identify specific patterns within CSV files

1 unstable release

0.1.1 Nov 2, 2023
0.1.0 Nov 1, 2023

#8 in #matches

MIT/Apache

12KB
116 lines

csv_coincidence

Often in the realm of data processing, CSV files are used to store tabular data, and it is important to be able to efficiently search and analyze that data this is the motivation behind the csv_coincidence that is a library focused on the searches for partial matches in a CSV file using a customizable regular expression. This function is used to process CSV files and search for partial matches within the text strings found in the file.

Features

  • Finds partial matches in the CSV file based on the given regular expression pattern.
  • Counts the number of occurrences of a specific pattern in the CSV file.
  • Merges the records in a CSV file that matches a specific pattern and replaces those matches.

Usage

use csv_coincidence::find_partial_matches;

fn main() -> Result<(), Box<dyn Error>> {
    let file_path = "example.csv";  // Replace with the path of your CSV file
    let regex_pattern = r"^[A-Z][a-z]*";  // Replace with the regular expression

    match find_partial_matches(file_path, regex_pattern) {
        Ok(matches) => {
            println!("Partial Matches:");
            for match_str in matches {
                println!("{}", match_str);
            }
        }
        Err(err) => {
            eprintln!("Error: {}", err);
        }
    }

    Ok(())
}
use csv_coincidence::merge_coincidence;

fn main() -> Result<(), Box<dyn Error>> {
    let file_path = "example.csv";  // Replace with the path of your CSV file
    let regex_pattern = r"^[A-Z][a-z]*";  // Replace with the regular expression

    match merge_coincidence(file_path, regex_pattern) {
        Ok(merged_data) => println!("Merge concidences:\n{}", merged_data),
        Err(e) => eprintln!("Error: {}", e),
    }
}

License

This project is licensed under the MIT license.

Dependencies

~3.5–4.5MB
~71K SLoC