14 releases (5 breaking)
0.6.3 | Dec 10, 2024 |
---|---|
0.6.2 | Dec 10, 2024 |
0.5.0 | Dec 10, 2024 |
0.4.2 | Dec 10, 2024 |
0.1.2 | Dec 7, 2024 |
#523 in Rust patterns
455 downloads per month
20KB
175 lines
Parsey
parsey
is a lightweight framework for creating custom parsers and abstract syntax trees (ASTs).
It provides two key traits: Parser
and Ast
, which together form the foundation
for building parsers and representing the structure of parsed data.
Key Features
- Generic Parsing Framework: Abstracts the process of parsing tokens into structured data.
- Customizable AST Nodes: Easily define nodes of your AST by implementing the
Ast
trait.
Getting Started
Let's implement a simple parser that parses a stream of zero and one tokens into groups of two bits!
Step 1: Implement the Parser
Trait
Define a struct that will serve as your parser. This struct must implement the Parser
trait,
which iterates over tokens and produces an AST.
use parsey::{parse, require_next_n, Ast, Parser, TokenStream};
#[derive(Debug, PartialEq)]
pub enum MyToken {
Zero,
One,
}
#[derive(Debug, PartialEq)]
pub struct MyError;
pub struct MyParser {
tokens: Vec<MyToken>,
}
impl Parser<MyToken, MyError> for MyParser {
type Root = Root;
fn expect(
token_stream: &mut TokenStream<Self, MyToken, MyError>,
token: MyToken,
) -> Result<(), MyError> {
if token_stream.peek() == Some(&token) {
token_stream.next();
Ok(())
} else {
Err(MyError)
}
}
}
impl Iterator for MyParser {
type Item = MyToken;
fn next(&mut self) -> Option<Self::Item> {
self.tokens.pop()
}
}
impl From<Vec<MyToken>> for MyParser {
fn from(mut value: Vec<MyToken>) -> Self {
value.reverse();
Self { tokens: value }
}
}
Step 2: Define the AST Nodes
Create the structure for your AST by implementing the Ast
trait for each node.
The root node must match the type defined in Parser::Root
.
#[derive(Debug, PartialEq)]
pub struct Root(Vec<TwoBit>);
#[derive(Debug, PartialEq)]
pub enum TwoBit {
ZeroZero,
ZeroOne,
OneZero,
OneOne,
}
impl Ast<MyToken, MyError> for Root {
fn parse<P>(token_stream: &mut TokenStream<P, MyToken, MyError>) -> Result<Self, MyError>
where
P: Parser<MyToken, MyError>,
{
let mut two_bits = vec![];
while !token_stream.is_empty() {
two_bits.push(TwoBit::parse(token_stream)?);
}
Ok(Self(two_bits))
}
}
impl Ast<MyToken, MyError> for TwoBit {
fn parse<P>(token_stream: &mut TokenStream<P, MyToken, MyError>) -> Result<Self, MyError>
where
P: parsey::Parser<MyToken, MyError>,
{
use MyToken::*;
use TwoBit::*;
match require_next_n!(token_stream, 2, MyError) {
[Zero, Zero] => Ok(ZeroZero),
[Zero, One] => Ok(ZeroOne),
[One, Zero] => Ok(OneZero),
[One, One] => Ok(OneOne),
}
}
}
Step 3: Parse Tokens
Use your parser to parse a sequence of tokens into an AST.
use MyToken::{One, Zero};
use TwoBit::{OneOne, OneZero, ZeroOne, ZeroZero};
let tokens = vec![Zero, Zero, Zero, One, One, Zero, One, One];
let ast = parse::<MyParser, MyToken, MyError>(tokens);
assert_eq!(ast, Ok(Root(vec![ZeroZero, ZeroOne, OneZero, OneOne])));
Dependencies
~185–610KB
~15K SLoC