20 releases

new 0.8.0 Nov 15, 2024
0.7.4 Jul 23, 2024
0.7.3 Sep 28, 2023
0.7.2 Jul 29, 2023
0.1.0 Nov 23, 2021

#157 in WebAssembly

Download history 511/week @ 2024-07-27 580/week @ 2024-08-03 409/week @ 2024-08-10 491/week @ 2024-08-17 574/week @ 2024-08-24 719/week @ 2024-08-31 753/week @ 2024-09-07 321/week @ 2024-09-14 1186/week @ 2024-09-21 730/week @ 2024-09-28 693/week @ 2024-10-05 457/week @ 2024-10-12 371/week @ 2024-10-19 1029/week @ 2024-10-26 1230/week @ 2024-11-02 462/week @ 2024-11-09

3,151 downloads per month
Used in 7 crates (3 directly)

Apache-2.0

285KB
5.5K SLoC

Fluvio SmartModules

This crate provides types and macros for creating custom SmartModules, which are WebAssembly modules that may be used to modify the behavior of Fluvio streams to consumers. The currently supported SmartModule types are filters, which may describe records to keep in or discard from a stream.

Writing SmartModules

See the examples directory for full examples.

All SmartModules require adding crate-type = ['cdylib'] to the Cargo.toml. For a quick setup using cargo-generate, see the SmartModule template.

[package]
name = "fluvio-wasm-filter"
version = "0.1.0"
authors = ["Fluvio Contributors <team@fluvio.io>"]
edition = "2018"

[lib]
crate-type = ['cdylib']

[dependencies]
fluvio-smartmodule = "0.1.0"

Filtering

For filtering, write your smartmodule using #[smartmodule(filter)] on your top-level function. Consider this the "main" function of your SmartModule.

use fluvio_smartmodule::{smartmodule, Record, Result};

#[smartmodule(filter)]
pub fn filter(record: &Record) -> Result<bool> {
    let string = std::str::from_utf8(record.value.as_ref())?;
    Ok(string.contains('a'))
}

This filter will keep only records whose contents contain the letter a.

Mapping

Mapping functions use #[smartmodule(map)], and are also a top-level entrypoint.

use fluvio_smartmodule::{smartmodule, Record, RecordData, Result};

#[smartmodule(map)]
pub fn map(record: &Record) -> Result<(Option<RecordData>, RecordData)> {
    let key = record.key.clone();

    let string = std::str::from_utf8(record.value.as_ref())?;
    let int = string.parse::<i32>()?;
    let value = (int * 2).to_string();

    Ok((key, value.into()))
}

This SmartModule will read each input Record as an integer (i32), then multiply it by 2.

Aggregate

Aggregate functions are a way to combine the data from many input records. Each time the aggregate function is called, it receives an "accumulated" value as well as the value of the current record in the stream, and is expected to combine the accumulator with the value to produce a new accumulator. This new accumulator value will be passed to the next invocation of aggregate with the next record value. The resulting stream of values is the output accumulator from each step.

use fluvio_smartmodule::{smartmodule, Result, Record, RecordData};

#[smartmodule(aggregate)]
pub fn aggregate(accumulator: RecordData, current: &Record) -> Result<RecordData> {
    let mut acc = String::from_utf8(accumulator.as_ref().to_vec())?;
    let next = std::str::from_utf8(current.value.as_ref())?;
    acc.push_str(next);
    Ok(acc.into())
}

This SmartModule reads each record as a string and appends it to the accumulator string.

ArrayMap

ArrayMap functions are used to take one input record and create zero to many output records. This can be used to chop up input records that logically represent more than one data point and turn them into independent records. Below is an example where we take JSON arrays and convert them into a stream of the inner JSON objects.

use fluvio_smartmodule::{smartmodule, Result, Record, RecordData};

#[smartmodule(array_map)]
pub fn array_map(record: &Record) -> Result<Vec<(Option<RecordData>, RecordData)>> {
    // Read the input record as a JSON array
    let array = serde_json::from_slice::<Vec<serde_json::Value>>(record.value.as_ref())?;
    
    // Convert each individual value from the array into its own JSON string
    let strings = array
        .into_iter()
        .map(|value| serde_json::to_string(&value))
        .collect::<core::result::Result<Vec<String>, _>>()?;
        
    // Return a list of records to be flattened into the output stream
    let kvs = strings
        .into_iter()
        .map(|s| (None, RecordData::from(s)))
        .collect::<Vec<_>>();
    Ok(kvs)
}

License

This project is licensed under the Apache license.

Contribution

Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in Fluvio by you, shall be licensed as Apache, without any additional terms or conditions.

Dependencies

~4MB
~79K SLoC