38 releases

new 0.13.9 Feb 4, 2025
0.13.8 Nov 7, 2024
0.13.7 Jun 14, 2024
0.13.4 Jan 4, 2024
0.1.1 Dec 18, 2018

#33 in Parser tooling

Download history 1838/week @ 2024-10-22 1954/week @ 2024-10-29 2014/week @ 2024-11-05 1755/week @ 2024-11-12 2156/week @ 2024-11-19 1839/week @ 2024-11-26 2873/week @ 2024-12-03 2777/week @ 2024-12-10 2122/week @ 2024-12-17 1981/week @ 2024-12-24 4553/week @ 2024-12-31 4785/week @ 2025-01-07 4985/week @ 2025-01-14 5126/week @ 2025-01-21 2081/week @ 2025-01-28 3776/week @ 2025-02-04

16,810 downloads per month
Used in 10 crates (8 directly)

Apache-2.0/MIT

505KB
12K SLoC

lrlex

lrlex is a partial replacement for lex / flex. It takes an input string and splits it into lexemes based on a .l file. Unfortunately, many real-world languages have corner cases which exceed the power that lrlex can provide. However, when it is suitable, it is a very convenient way of expressing lexing.

lrlex also has a simple command-line interface, allowing you to check whether your lexing rules are working as expected:

$ cat C.java
class C {
    int x = 0;
}
$ cargo run --lrlex java.l /tmp/C.java
    Finished dev [unoptimized + debuginfo] target(s) in 0.18s
     Running `target/debug/lrlex ../grammars/java7/java.l /tmp/C.java`
CLASS class
IDENTIFIER C
LBRACE {
INT int
IDENTIFIER x
EQ =
INTEGER_LITERAL 0
SEMICOLON ;
RBRACE }

lib.rs:

lrlex is a partial replacement for lex / flex. It takes in a .l file and statically compiles it to Rust code. The resulting [LRNonStreamingLexerDef] can then be given an input string, from which it instantiates an [LRNonStreamingLexer]. This provides an iterator which can produce the sequence of lrpar::Lexemes for that input, as well as answer basic queries about cfgrammar::Spans (e.g. extracting substrings, calculating line and column numbers).

Dependencies

~5–13MB
~157K SLoC