#logging #logstash #grok #etl #elastic

grok-rs

Rust port of elastic Grok processor

4 releases

0.1.3 Jun 9, 2024
0.1.2 Jun 9, 2024
0.1.1 Jun 7, 2024
0.1.0 Jun 7, 2024

#1 in #logstash

Download history 9/week @ 2024-12-08 1/week @ 2024-12-15 5/week @ 2024-12-22 1/week @ 2025-01-05 17/week @ 2025-01-12 47/week @ 2025-01-19 7/week @ 2025-01-26 1/week @ 2025-02-02 10/week @ 2025-02-16 19/week @ 2025-02-23 15/week @ 2025-03-02 55/week @ 2025-03-09 26/week @ 2025-03-16

116 downloads per month

Apache-2.0

39KB
745 lines

Build Status Version codecov

grok_rs

the grok_rs is a rust port of Elastic Grok processor, inspired by grok-go and grok

Usage

[dependencies]
grok-rs = "0.1.3"

Example

Only with default patterns

let grok = Grok::default();
let pattern = grok
    // USERNAME are defined in grok-patterns
    .compile("%{USERNAME}", false)
    .unwrap();
let result = pattern.parse("admin admin@example.com").unwrap();
println!("{:#?}", result);

the output is:

{
    "USERNAME": String(
        "admin",
    ),
}

With user-defined patterns

let mut grok = Grok::default();
grok.add_pattern("NAME", r"[A-z0-9._-]+");
let pattern = grok.compile("%{NAME}", false).unwrap();
let result = pattern.parse("admin").unwrap();
println!("{:#?}", result);

the output is:

{
    "NAME": String(
        "admin",
    ),
}

With named_capture_only is true

let grok = Grok::default();
let pattern = grok
    .compile("%{USERNAME} %{EMAILADDRESS:email}", true)
    .unwrap();
let result = pattern.parse("admin admin@example.com").unwrap();
println!("{:#?}", result);

the output is:

{
    "email": String(
        "admin@example.com",
    ),
}

With type

let mut grok = Grok::default();
grok.add_pattern("NUMBER", r"\d+");

let pattern = grok.compile("%{NUMBER:digit:int}", false).unwrap();
let result = pattern.parse("hello 123").unwrap();
println!("{:#?}", result);

the output is:

{
    "digit": Int(
        123,
    ),
}

Notice

grok_rs is based on regex crate, so lacks several features that are not known how to implement efficiently. This includes, but is not limited to, look-around and backreferences. In exchange, all regex searches in this crate have worst case O(m * n) time complexity, where m is proportional to the size of the regex and n is proportional to the size of the string being searched.

Elastic Grok compliance

This crate declares compatible with elastic grok patterns v8.14.0, which is tagged at 2024-06-05.

Dependencies

~2.3–3.5MB
~56K SLoC