4 releases
0.0.4 | Jun 18, 2024 |
---|---|
0.0.3 | Jun 14, 2024 |
0.0.2 | Jun 13, 2024 |
0.0.1 | Jun 12, 2024 |
#534 in Parser implementations
185KB
3.5K
SLoC
cbor-edn
Tools for processing CBOR Diagnostic Notation (EDN)
The parser used by this crate is a PEG (Parsing Expression Grammer) parser built from the ABNF used in the EDN specification.
The crate's main types represent not only the parsed items but also all the parts that have no bearing on the translation to CBOR (whitespace, comments, and optional trailing commas) and choices that may or may not influence the CBOR (encoding indicators). This allows detailed manipulation (for example inside comments) and a delayed processing of application oriented literals.
Parsed values are expected to round-trip to identical representations when serialized. Most manipulations of the values will ensure that their serialization output can also be round-tripped from the internal format to the EDN serialization and back into the internal format, but this can not be provided by all. (For example, removing all optional commas while retaining comments would make the previous distinction between whether a comment was before or after a comma indistinguishable).
Correct parsing does not guarantee that the value can also be encoded into CBOR. While there
are aspects that could be handled at parsing time and are not (eg. tag numbers exceeding the
encodable number space), there are cases that can not be handled by a library without further
context or privileges (eg. the e'' application oriented literal that needs application context,
or the ref'' application oriented literal that defers to relative files, accessing which can
involve file or network access). Consequentially, conversion to CBOR through the various
.to_cbor()
methods is inherently fallible.
Completeness
Known limitations are:
-
Support for inspecting and constructing CBOR items is incomplete. The most common types can be constructed; contructing or inspecting more exotic items is possible through parsing hand-crafted EDN/CBOR and using the generated serializations, respectively.
-
Options for attaching comments and space are limited; [
Item::set_comment()
] is currently the only way to get data in (apart from, again, parsing hand-crafted EDN). -
Indenting EDN works for the easy cases, but more exotic cases such as overflowing the limited width, long keys, or hash comments, easily disrupt the visual result.
Security
This library does not access network or file system in any surprising ways and does not
endanger memory safety on its own. The main threat in using it is not resource bound: even
without packed CBOR, heavy nesting can easily overflow the stack, and the float conversions are
costly in time. Unless resource usage per user is limited, it is recommended to limit untrusted
user input to the length of repeated {
characters that do not yet overflow the stack.
The crate has not been audited internally or externally. As the licenses state, the software is provided "as is".
CLI application
Some functionality is available through a binary included with this crate:
$ echo "[1, 2, 'x', ip'2001:db1::/64']" | cbor-edn diag2diag
[1, 2, 'x', ip'2001:db1::/64']
License: MIT OR Apache-2.0
Dependencies
~6–15MB
~267K SLoC