2 unstable releases
0.2.0 | Aug 12, 2024 |
---|---|
0.1.0 | Aug 10, 2024 |
#994 in Parser implementations
22 downloads per month
44KB
1K
SLoC
rsedn
rsedn
is a crate that implements a subset(atm) of Extensible Data Notation
Supported Syntax
-
( )
lists -
[ ]
vectors -
{ }
maps -
#{ }
sets -
symbols
(the full edn symbol specification) -
:keywords
-
#user/tags
-
#_discard
(not being discarted, just parsed) - Integers (unsupported arbitrary precision integers)
- Floats
- Boolean
-
nil
- Strings (unsupported
\uNNNN
unicode sequences) - Characters
- Built-in tagged elements (
#inst
and#uuid
) - Comments
Usage
rsedn
usage is aplit into 4 steps:
- Build a
Source
from a&str
(usersedn::source_from_str
) - Lex
Source
to produce aVec<Lexeme>
(usersedn::lex_source
) - Parse each
Lexeme
to produce aToken
(usersedn::parse_lexeme
) - Create a
TokenStream
(useLinkedList::iter
) and consume it to produce aForm
(usersedn::consume_token_stream
)
Concepts
Source
A wrapper around the source code, we always refer to source code as &'source str
. It can be latter used to get the span (the actual text) of some Lexeme
Lexeme
Stores the coordinates of a piece of meaningful source code (just coordinates, no text), it doens't classifies it, just knows that the given piece of text may have a meaning.
For instance: (println)
has 3 lexemes: (
, println
, and )
, (def var 5)
has 5 lexemes: (
, def
, var
, 5
and )
Token
A wrapper around a lexeme that stores the span and what kind of token it is. It classifies by reading the span and checking the syntax of the corresponding piece of source code.
Producing a Token
may produce a TokenizationError
when the lexeme isn't syntatically right.
Form
The final step, it's built by one or more tokens and represents an edn
form: a list, a vector, a symbol, etc.
Almost no manipulation is done with the source text, except the parsing of text into values like: i64
, f64
, bool
and String
for the corresponding edn
forms.
Forms are what you may use out of this library.
Producing forms may produce a ParsingError
when the tokens in the token stream aren't the expected ones.
Dependencies
~1.5MB
~22K SLoC