10 stable releases
2.0.0 | Jan 20, 2024 |
---|---|
1.3.4 | Nov 1, 2023 |
1.3.3 | Jul 4, 2023 |
1.3.2 | Oct 16, 2022 |
0.1.1 | Jul 11, 2018 |
#712 in Command line utilities
21KB
293 lines
xstream
A command line tool to split a stream by a delimiter and pipe each section to a child process.
Each chunk can be piped to a new process, with limited parallelism, or for embarassingly parallel processing, processes can be reused.
Installation
cargo install xstream-util
Benchmarks
For a simple illustration of the speed up for reasonably sized streams, the following simple benchmark compares generating 1001 streams of integers and summing them with bc
.
First, generate a null delimited set of streams with
time for I in {10000..11000}; do seq $I; echo -ne '0\0'; done
This stream is roughly 50M, making each stream roughly 50k.
I then piped this into xstream
as
| time xstream -0 -w '' -- bash -c 'paste -sd+ | bc' > /dev/null
and xargs
as
| time xargs -0I@ bash -c '<<< "@" head -n-1 | paste -sd+ | bc' > /dev/null
which on my system gives:
Program | User | System | Elapsed |
---|---|---|---|
xstream |
10.21s | 1.67s | 0:09.58 |
xargs |
15.72s | 2.85s | 0:14.52 |
This benchmark is a toy example, but xstream
already provides a 30% speed up when each stream is only 50k.
Dependencies
~0–8.5MB
~84K SLoC