25 releases (2 stable)
1.0.1 | Jul 14, 2023 |
---|---|
1.0.0 | Oct 14, 2022 |
0.9.2 | May 19, 2022 |
0.9.1 | Dec 18, 2021 |
0.4.3 |
|
#329 in Machine learning
125KB
2.5K
SLoC
bigml-parallel
: A CLI tool for running WhizzML scripts in parallel
This is a tool for parallel processing of BigML resources (typically sources or datasets) using WhizzML scripts. It's still somewhat experimental. To install, download binaries from the releases page.
This tool will output the resulting BigML execution objects as JSON structs, one per line, in no particular order. It runs up to --max-tasks
BigML executions at a time.
Execute WhizzML script in parallel over one or more BigML resources
USAGE:
bigml-parallel [OPTIONS] --script <script>
FLAGS:
-h, --help Prints help information
-V, --version Prints version information
OPTIONS:
-i, --input <inputs>...
Extra inputs to our WhizzML script, specified as
"name=value". These will be parsed as JSON if possible,
or treated as strings otherwise
-J, --max-tasks <max-tasks>
How many BigML tasks should we use at a time? [default:
2]
-n, --name <name>
The name to use for our execution objects
-o, --output <outputs>...
Expected outputs to our WhizzML script, specified as
"name"
-R, --resource-input-name <resource-input-name>
The input name used to pass the dataset [default:
resource]
-r, --resource <resources>...
The resource IDs to process. (Alternatively, pipe
resource IDs on standard input, one per line.)
--retry-count <retry-count>
How many times should we retry a failed execution
matching --retry-on? [default: 0]
--retry-on <retry-on>
A regular expression specifying which WhizzML script
errors should be retried
-s, --script <script>
The WhizzML script ID to run
--tag <tags>...
Apply a tag to the BigML resources we create
Maintenance status
This project has been moved to the "back burner." Issues that affect Faraday will still be addressed. No guarantees about anything else. Please contact the authors if you want to make heavy use of BigML from Rust.
Dependencies
~16–32MB
~513K SLoC