29 stable releases (4 major)
new 8.1.2 | Nov 20, 2024 |
---|---|
8.0.0 | Nov 4, 2024 |
6.2.1 | Oct 7, 2024 |
6.0.0 | Sep 22, 2024 |
3.1.1 | Sep 12, 2024 |
#140 in Encoding
162 downloads per month
115KB
2.5K
SLoC
merde
merde
aims to provide a simpler, lighter alternative to serde,
that might run slower, but compiles faster.
This is the "hub" crate, which re-exports all types from merde_core,
including Value
, Array
, and [Map
], and provides a declarative derive
macro that helps implement
traits like Deserialize
, IntoStatic
and Serialize
.
Support matrix
Format | Deserialize | Serialize | Production-ready? |
---|---|---|---|
JSON | 👍 | 👍 | lol no — please no |
YAML | 👍 | 🙅♀️ | still experimenting |
MsgPack | 👍 | 🙅♀️ | here be dragons etc. |
From serde
to merde
Deriving impls, serializing, deserializing
serde
has its own Serialize
and Deserialize
traits, which you can derive
with, well, derive macros:
use serde::{Serialize, Deserialize};
#[derive(Serialize, Deserialize, Debug)]
struct Point {
x: i32,
y: i32,
}
fn main() {
let point = Point { x: 1, y: 2 };
let serialized = serde_json::to_string(&point).unwrap();
println!("serialized = {}", serialized);
let deserialized: Point = serde_json::from_str(&serialized).unwrap();
println!("deserialized = {:?}", deserialized);
}
By contrast, merde
provides declarative macros — impls for traits
like Deserialize
, Serialize
can be generated with merde::derive!
:
#[derive(Debug)]
struct Point {
x: i32,
y: i32,
}
merde::derive! {
impl (Deserialize, Serialize) for struct Point { x, y }
}
fn main() {
let point = Point { x: 1, y: 2 };
// note: `merde_json` is re-exported as `merde::json` if `merde`'s `json` feature is enabled
let serialized = merde::json::to_string(&point).unwrap();
println!("serialized = {}", serialized);
let deserialized: Point = merde::json::from_str(&serialized).unwrap();
println!("deserialized = {:?}", deserialized);
}
This approach is less flexible, but because there's no proc-macro involved, or re-parsing of the struct definitions by the proc macro, it builds faster.
Copy-on-write types
Picture this: a large JSON documents, with large strings, that don't use escape sequences.
Instead of allocating a separate String
on the heap for each of these, serde
lets you
borrow from the input string, either automatically when you use a type like &str
:
use serde::{Serialize, Deserialize};
#[derive(Serialize, Deserialize, Debug)]
struct Name<'s> {
first: &'s str,
middle: &'s str,
last: &'s str,
}
// etc.
Or explicitly when you use a copy-on-write type like Cow<'s, str>
:
use serde::{Serialize, Deserialize};
use std::borrow::Cow;
#[derive(Serialize, Deserialize, Debug)]
struct Name<'s> {
#[serde(borrow)]
first: Cow<'s, str>,
#[serde(borrow)]
middle: Cow<'s, str>,
#[serde(borrow)]
last: Cow<'s, str>,
}
serde
is really flexible here, letting you have types with multiple lifetimes, not
all of which are related to the input string.
merde
only handles "simple" cases: structs without a lifetime parameter
are the simplest, since they're always owned / 'static
(by definition):
#[derive(Debug)]
struct Name {
first: String,
middle: String,
last: String,
}
merde::derive! {
impl (Deserialize, Serialize) for struct Name { first, middle, last }
}
But, as a treat, structs passed to merde::derive!
can have exactly one lifetime
parameter, so that you may use copy-on-write types, like merde's own CowStr
:
use merde::CowStr;
#[derive(Debug)]
struct Name<'s> {
first: CowStr<'s>,
middle: CowStr<'s>,
last: CowStr<'s>,
}
merde::derive! {
// 👇
impl (Deserialize, Serialize) for struct Name<'s> { first, middle, last }
}
Note that in the merde::derive!
invocation, we have to give it the lifetime parameter's
name — this ends up generating different code, that can borrow from the input.
Although you may use Cow<'s, str>
merde
recommends CowStr<'s>
type,
which dereferences to &str
like you'd expect, but instead of using
String
as its owned type, it uses
compact_str::CompactString
,
which stores strings of up to 24 bytes inline!
As of merde v5, "transparent" tuple structs are supported (aka "newtypes"):
use merde::CowStr;
#[derive(Debug)]
struct Wrapper<'s>(CowStr<'s>);
merde::derive! {
impl (Deserialize, Serialize)
for struct Wrapper<'s> transparent
}
fn main() {
let input = r#"["Hello, World!"]"#;
let wrapper: Vec<Wrapper> = merde::json::from_str(input).unwrap();
println!("Wrapped value: {:?}", wrapper);
}
Enums are also supported, only externally-tagged ones for now, and you need to specify field names and variant names:
#[derive(Debug)]
enum TestEvent {
MouseUp(MouseUp),
MouseDown(MouseDown),
}
merde::derive! {
impl (Serialize, Deserialize) for enum TestEvent
externally_tagged {
"mouseup" => MouseUp,
"mousedown" => MouseDown,
}
}
#[derive(Debug, PartialEq, Eq)]
struct MouseUp {
x: i32,
y: i32,
}
merde::derive! {
impl (Serialize, Deserialize) for struct MouseUp {
x,
y
}
}
#[derive(Debug, PartialEq, Eq)]
struct MouseDown {
x: i32,
y: i32,
}
merde::derive! {
impl (Serialize, Deserialize) for struct MouseDown {
x,
y
}
}
fn main() {
let input = r#"{"mouseup": {"x": 100, "y": 200}}"#;
let event: TestEvent = merde::json::from_str(input).unwrap();
println!("TestEvent: {:?}", event);
}
"string-like" enums are also supported, like so:
#[derive(Debug)]
enum Emergency {
Cuddle,
Smoothie,
Naptime,
Playtime,
}
merde::derive! {
impl (Serialize, Deserialize) for enum Emergency
string_like {
"cuddle" => Cuddle,
"smoothie" => Smoothie,
"naptime" => Naptime,
"playtime" => Playtime,
}
}
fn main() {
let input = r#"["cuddle", "smoothie", "playtime"]"#;
let emergencies: Vec<Emergency> = merde::json::from_str(input).unwrap();
println!("Emergencies: {:?}", emergencies);
}
Interlude: why not &'s str
?
You'll notice that Deserialize
is not implemented for &'s str
, ie.
this code does not compile:
#[derive(Debug)]
struct Name<'s> {
first: &'s str,
middle: &'s str,
last: &'s str,
}
merde::derive! {
impl (Deserialize, Serialize) for struct Nam<'s> { first, middle, last }
}
error[E0277]: the trait bound `&str: Deserialize<'_>` is not satisfied
--> merde/src/lib.rs:183:1
|
12 | / merde::derive! {
13 | | impl (Deserialize, Serialize) for struct Name<'s> { first, middle, last }
14 | | }
| |_^ the trait `Deserialize<'_>` is not implemented for `&str`
That's because it's not always possible to borrow from the input.
This JSON input would be fine:
{
"first": "Jane",
"middle": "Panic",
"last": "Smith"
}
But this JSON input would not:
{
"first": "Jane",
"middle": "\"The Rock\"",
"last": "Smith",
}
We could borrow "The Rock
, but then we'd have a problem: the next actual character is a double-quote,
but the next character from the input is the backslash (\
) used to escape the double-quote.
Such a string will end up being owned in a CowStr
:
use merde::{CowStr, Deserialize};
fn main() {
let input = r#"
["\"The Rock\""]
"#;
let v: Vec<CowStr<'_>> = merde::json::from_str(input).unwrap();
assert!(matches!(v.first().unwrap(), CowStr::Owned(_)));
}
Whereas something without escape sequences will end up being borrowed:
use merde::{CowStr, Deserialize};
fn main() {
let input = r#"
["Joever"]
"#;
let v: Vec<CowStr<'_>> = merde::json::from_str(input).unwrap();
assert!(matches!(v.first().unwrap(), CowStr::Borrowed(_)));
}
All this is pretty JSON-specific, but you get the idea.
Returning something you've serialized
Borrowing from the input (to avoid allocations and copies, in case you missed the memo) is all fun and games until you need to move something around, for example, returning it from a function.
use merde::CowStr;
struct Message<'s> {
kind: u8,
payload: CowStr<'s>,
}
merde::derive! {
impl (Deserialize, Serialize)
for struct Message<'s> { kind, payload }
}
// well this is already fishy, where does the `'s` come from?
fn recv_and_deserialize<'s>() -> Message<'s> {
let s: String = {
// pretend this reads from the network instead, or something
r#"{
"kind": 42,
"payload": "hello"
}"#.to_owned()
};
let message: Message = merde::json::from_str(&s).unwrap();
message
}
fn main() {
let _msg = recv_and_deserialize();
}
This fails to build with:
error[E0515]: cannot return value referencing local variable `s`
--> merde/src/lib.rs:366:9
|
365 | let message: Message = merde::json::from_str(&s).unwrap();
| -- `s` is borrowed here
366 | message
| ^^^^^^^ returns a value referencing data owned by the current function
That's where the IntoStatic
trait comes in! Which you get for free when "deriving" Deserialize
!
use merde::IntoStatic;
use merde::CowStr;
struct Message<'s> {
kind: u8,
payload: CowStr<'s>,
}
merde::derive! {
impl (Deserialize, Serialize)
for struct Message<'s> { kind, payload }
}
fn recv_and_deserialize() -> Message<'static> {
let s: String = {
// pretend this reads from the network instead, or something
r#"{
"kind": 42,
"payload": "hello"
}"#.to_owned()
};
let message: Message = merde::json::from_str(&s).unwrap();
message.into_static()
}
fn main() {
let _msg = recv_and_deserialize();
}
Et voilà! ✨
There might be something smarter to do based on the yoke crate for example, but for now, allocations it is.
Third-party types
Some crates don't have a merde
features. In fact, at the time of this writing,
no crates at all do. merde
is still moving fast (despite major features), so
in fact, I would encourage crate authors not to ship a merde
feature yet, it
would just create frustrating churn.
serde
lets you work around that by specifying a function that should be used to
deserialize some field:
use serde::{Serialize, Deserialize};
use time::OffsetDateTime;
#[derive(Serialize, Deserialize)]
struct Person {
name: String,
#[serde(with = "time::serde::rfc3339")]
birth: OffsetDateTime,
}
Which solves two problems at once:
- the crate may not know about serde at all (not true in this case, time does have a serde feature)
- there might be several ways to serialize/deserialize something (RFC-3339, ISO-8601, and many others etc.)
merde
solves both of these with wrapper types:
use merde::time::OffsetDateTime; // re-export from the time crate
use merde::CowStr;
use merde::time::Rfc3339;
#[derive(Debug)]
struct Person<'s> {
name: CowStr<'s>,
birth: Rfc3339<OffsetDateTime>,
}
merde::derive! {
impl (Deserialize, Serialize) for struct Person<'s> { name, birth }
}
fn main() {
let input = r#"
{
"name": "Jane Smith",
"birth": "1990-01-01T00:00:00Z"
}
"#;
let person: Person = merde::json::from_str(input).unwrap();
println!("person = {:?}", person);
}
You can of course make your own newtype wrappers to control how a field gets deserialized.
Conditional compilation
(As of merde 3.1), you never need to add cfg
gates to conditionally invoke the merde::derive!
macro, because, with default features disabled, merde
has zero dependencies.
There's two main ways to be conservative with the amount of generated code / the amount of dependencies pulled with merde.
Approach 1: "core" by default, "deserialize" on demand
Your manifest could look like this:
# in `Cargo.toml`
[dependencies]
merde = { version = "4.0.0", default-features = false, features = ["core"] }
And then you'd be able to use merde_provided types, like CowStr
:
use merde::CowStr;
#[derive(Debug)]
struct Person<'s> {
name: CowStr<'s>,
age: u8, // sorry 256-year-olds
}
merde::derive! {
impl (Deserialize, Serialize) for struct Person<'s> { name, age }
}
And the impl
blocks for Deserialize
, and Serialize
wouldn't actually
be generated unless crates downstream of yours enable merde/deserialize
or merde/json
.
Approach 2: zero-deps
If your manifest looks more like this:
# in `Cargo.toml`
[dependencies]
merde = { version = "4.0.0", default-features = false }
[features]
default = []
merde = ["merde/core"]
...with no merde
features enabled by default at all, then you have to stay
away from merde types, or use substitutes, for example, you could switch
CowStr<'s>
with std::borrow::Cow<'s, str>
and get largely the same API:
#[cfg(feature = "merde")]
use merde::CowStr;
#[cfg(not(feature = "merde"))]
pub type CowStr<'s> = std::borrow::Cow<'s, str>;
#[derive(Debug)]
pub struct Person<'s> {
name: CowStr<'s>,
age: u8, // sorry 256-year-olds
}
merde::derive! {
impl (Deserialize, Serialize) for struct Person<'s> { name, age }
}
(But not the same ABI! Careful if you use this in conjunction with something like rubicon).
With that configuration, users of your crate will only have to pay for downloading
merde
and evaluating a few derive!
macros which will produce empty token trees —
no extra dependencies, barely any extra build time.
See zerodeps-example
in the merde repository
for a demonstration:
❯ cargo tree --prefix none
zerodeps-example v0.1.0 (/Users/amos/bearcove/merde/zerodeps-example)
merde v3.0.0 (/Users/amos/bearcove/merde/merde)
❯ cargo tree --prefix none --features 'merde'
zerodeps-example v0.1.0 (/Users/amos/bearcove/merde/zerodeps-example)
merde v3.0.0 (/Users/amos/bearcove/merde/merde)
merde_core v3.0.0 (/Users/amos/bearcove/merde/merde_core)
compact_str v0.8.0
castaway v0.2.3
rustversion v1.0.17 (proc-macro)
cfg-if v1.0.0
itoa v1.0.11
rustversion v1.0.17 (proc-macro)
ryu v1.0.18
static_assertions v1.1.0
Dependencies
~0–4MB
~75K SLoC