7 unstable releases (3 breaking)

0.4.1 Dec 7, 2023
0.3.3 Oct 12, 2023
0.2.2 Oct 10, 2023
0.1.1 Oct 7, 2023

#609 in Command line utilities

37 downloads per month

MIT license

23KB
489 lines

plumber

unix process pipelines made easy

The goal of this project was to create simple, modular, and powerful pipelines using the standard tools of the unix operating system.

pull_from_stream | transformer_1 | transformer_2 | transformer_3 | push_to_stream

Shell users may recognize the process pipeline language used and ask why even use plumber?

The purpose of plumber is to ensure that pipelines are robust against reboots and interrupts, easily defined and managed through files (for IaC use cases), and are highly configurable.

features

  • modular pipeline building
  • robust signal handling
  • extremely simple implementation

behavior

  • pipes imply that stdout is redirected to stdin of following program
  • plumber run defaults stderr logs to /tmp/plumber/log/<plumber file name>/<cmd>.stderr.log
  • termination signals will be caught, sent to the FIRST program in the pipeline, and wait for completion

example

create a test file with a pipeline of processes:

echo "exec = 'tail -n 100 -f /usr/share/dict/words | grep a | wc'" > test_pipeline.plumb

run the pipeline with plumber run <PATH>

plumber run test_pipeline.plumb
[2023-12-07T19:31:43Z INFO  plumber::pipeline] (test_pipeline) executing pipeline: 'tail -n 100 -f /usr/share/dict/words | grep a | wc'
[2023-12-07T19:31:43Z INFO  plumber::pipeline] (test_pipeline) logging command stderr to: '/tmp/plumber/log/test_pipeline/*.stderr.log'
[2023-12-07T19:31:43Z INFO  plumber::pipeline] (test_pipeline) pid of first job in pipeline is: 30251
[2023-12-07T19:31:43Z INFO  plumber::pipeline] (test_pipeline) gracefully stop pipeline with: 'plumber stop test_pipeline'

gracefully stop the pipeline with plumber stop <NAME> or hitting Ctrl+C in the main process window

^C[2023-12-07T19:31:57Z INFO  plumber::pipeline] (test_pipeline) exiting gracefully...

stdout of the last command is streamed to the stdout of plumber to allow for chaining of pipelines. In this case we see the output of the wc command:

      28      28     334

find stderr logs in /tmp/plumber/log/test_pipeline/tail.stderr.log, grep.stderr.log, wc.stderr.log

try rerunning the pipeline simply through your regular shell and hitting ctrl-c.

tail -n 100 -f /usr/share/dict/words | grep 'a' | wc

notice how there is no output as all the commands received the interrupt. With plumber you can be confident that data held in the buffers of intermediate processes will never be lost like this.

modules

Building a module / transformer / process for the pipeline is easy in almost any programming language (performance is up to the programmer of course). All you need is a program that reads from stdin, does a transformation, and outputs to stdout.

Note: This is different for the first process in a pipeline - it is expected that it will handle SIGTERM and SIGINT in a responsible way.

daemonizing

use your system's daemon / service manager to daemonize plumber pipelines. Here is an example systemd unit file:

[Unit]
Description=Example Plumber Pipeline

[Service]
Environment=RUST_LOG=debug
ExecStart=plumber run /opt/plumber-example
ExecStop=plumber stop /opt/plumber-example
User=plumber
Restart=always

[Install]
WantedBy=default.target

Dependencies

~6–18MB
~206K SLoC