#avro #serde #code-generation #command-line-tool #read-write

bin+lib rsgen-avro

Command line and library for generating Rust types from Avro schemas

49 releases

0.15.3 Oct 27, 2024
0.15.1 Sep 27, 2024
0.13.0 Sep 3, 2023
0.11.11 Apr 16, 2023
0.2.0 Nov 4, 2018

#159 in Encoding

Download history 1360/week @ 2024-08-18 1309/week @ 2024-08-25 1396/week @ 2024-09-01 1019/week @ 2024-09-08 1523/week @ 2024-09-15 1789/week @ 2024-09-22 1938/week @ 2024-09-29 2063/week @ 2024-10-06 2205/week @ 2024-10-13 1871/week @ 2024-10-20 2315/week @ 2024-10-27 2138/week @ 2024-11-03 2551/week @ 2024-11-10 2545/week @ 2024-11-17 2870/week @ 2024-11-24 3282/week @ 2024-12-01

11,265 downloads per month
Used in 2 crates

MIT license

99KB
2K SLoC

rsgen-avro   latest doc

A command line tool and library for generating serde-compatible Rust types from Avro schemas. The apache-avro crate, which is re-exported, provides a way to read and write Avro data with such types.

Command line usage

Download the latest release.

Available options rsgen-avro --help:

Generate Rust types from Avro schemas

Usage: rsgen-avro [OPTIONS] <GLOB_PATTERN> <OUTPUT_FILE>

Arguments:
  <GLOB_PATTERN>  Glob pattern to select Avro schema files
  <OUTPUT_FILE>   The file where Rust types will be written, '-' for stdout

Options:
      --fmt              Run rustfmt on the resulting <output-file>
      --nullable         Replace null fields with their default value when deserializing
      --precision <P>    Precision for f32/f64 default values that aren't round numbers [default: 3]
      --union-deser      Custom deserialization for apache-avro multi-valued union types
      --chrono-dates     Use chrono::NaiveDateTime for date/timestamps logical types
      --derive-builders  Derive builders for generated record structs
      --derive-schemas   Derive AvroSchema for generated record structs
      --extra_derives    Append extra derive macros list to the generated record structs
  -h, --help             Print help
  -V, --version          Print version

Library usage

As a library, the basic usage is:

use rsgen_avro::{Source, Generator};

let raw_schema = r#"
{
    "type": "record",
    "name": "test",
    "fields": [
        {"name": "a", "type": "long", "default": 42},
        {"name": "b", "type": "string"}
    ]
}
"#;

let source = Source::SchemaStr(&raw_schema);
let mut out = std::io::stdout();

let g = Generator::new().unwrap();
g.gen(&source, &mut out).unwrap();

This will generate the following output:

#[derive(Debug, PartialEq, Eq, Clone, serde::Deserialize, serde::Serialize)]
pub struct Test {
    #[serde(default = "default_test_a")]
    pub a: i64,
    pub b: String,
}

#[inline(always)]
fn default_test_a() -> i64 { 42 }

Various Schema sources can be used with Generator::gen(source, output) method:

pub enum Source<'a> {
    Schema(&'a rsgen_avro::Schema),    // Avro schema enum re-exported from `apache-avro`
    Schemas(&'a [rsgen_avro::Schema]), // A slice of Avro schema enums
    SchemaStr(&'a str),                // Schema as a json string
    GlobPattern(&'a str),              // Glob pattern to select schema files
}

Note also that the Generator can be customized with a builder:

let gen = rsgen_avro::Generator::builder()
    .precision(2)
    .build()
    .unwrap();

See GeneratorBuilder documentation for all available options.

Limitations

  • Avro schema namespace fields are ignored, therefore record names within a schema (and across schemas) must not conflict (i.e. must be unique).
  • Rust Option<T> are supported through Avro unions having "null" in their first position only (See #39)

Dependencies

~14–25MB
~384K SLoC