#test-cases #input-file #testing #filesize #case #find #minimize

app minimizer

Minimize files to find minimal test case

10 stable releases

2.0.3 Nov 13, 2024
2.0.1 Nov 4, 2024
1.3.1 Oct 29, 2024
1.2.1 Aug 11, 2024

#53 in Text processing

Download history 143/week @ 2024-09-17 219/week @ 2024-09-24 118/week @ 2024-10-01 294/week @ 2024-10-08 218/week @ 2024-10-15 542/week @ 2024-10-22 657/week @ 2024-10-29 376/week @ 2024-11-05 658/week @ 2024-11-12 449/week @ 2024-11-19 275/week @ 2024-11-26 467/week @ 2024-12-03 447/week @ 2024-12-10 400/week @ 2024-12-17 513/week @ 2024-12-24 446/week @ 2024-12-31

1,904 downloads per month

MIT license

46KB
1K SLoC

Minimizer

Minimizer is a program that is able to minimize the size of files so that they still meet the set requirements.

It is the best suited for minimizing files for fast app, which one iteration takes less than second.

Currently it works only on Linux.

How to use

  • install rust on linux, clone repo and build project
cargo install --path .

or just compile it with crates.io

cargo install minimizer
  • run minimizer
minimizer --input-file input.txt --output-file output.txt --command "echo {}" --attempts 300 --broken-info "BROKEN"

to get info about each argument, read source code or run

minimizer --help

Test it

echo "ABCDEFGH" > input.txt
echo "gABCDEFFGH" >> input.txt
echo "BCDERF" >> input.txt
echo "ABCD" >> input.txt
echo "BDCE" >> input.txt

running

minimizer --input-file input.txt --output-file output.txt --command "cat {}" --attempts 300 --broken-info "AB" -e -v

will probably give you output.txt with content

AB

algorithms are not deterministic so not always the same result will be achieved

Using bigger number of attempts will increase the chance of getting smaller output file and will enable additional mode which rely on removing line/byte/char one by one.

How it works

At start minimizer reads file and checks if this file returns expected output.

If yes, then app continue to run.

At first app checks if file contains valid utf-8 characters, if yes, then two additional modes are enabled, which works on lines and characters.

Each mode(which works on Vec<> of lines, chars and bytes) at start, tries to remove items from start/end of file.

Later in loop random elements from middle/start/end are removed to check if file still returns expected output.

Different strategies

Basing on different files, different strategies can be used to minimize file.

In repo only one general strategy is implemented, which should be good for most of the files.

But if you have some specific file, you can implement your own strategy

Typical commands

Ruff

minimizer --input-file /home/rafal/Desktop/RunEveryCommand/C/PY_FILE_TEST_25518.py --output-file a.py --command "red_knot" --attempts 1000 --broken-info "RUST_BACKTRACE" -z "not yet implemented" -z "failed to parse" -z "SyntaxError" -z "Sorry:" -z "IndentationError" -k "python3 -m compileall {}" -r -v

or shorter

minimizer -i /home/rafal/Desktop/RunEveryCommand/C/PY_FILE_TEST_25518.py -o a.py -c "red_knot" -a 1000 -b "RUST_BACKTRACE" -z "not yet implemented" -z "failed to parse" -z "SyntaxError" -z "Sorry:" -z "IndentationError" -k "python3 -m compileall {}" -r -v

Red Knot

minimizer --input-file /home/rafal/Desktop/RunEveryCommand/C/PY_FILE_TEST_25518.py --output-file a.py --command "red_knot" --attempts 1000 --broken-info "RUST_BACKTRACE" -z "not yet implemented" -z "failed to parse" -z "SyntaxError" -z "Sorry:" -z "IndentationError" -k "python3 -m compileall {}" -r -v

Lofty

minimizer --input-file input.mp3 --output-file output.mp3 --command "lofty {}" --attempts 100000 -r --broken-info "RUST_BACKTRACE" -v --max-time 200 --strategy pedantic

or sho

minimizer -i input.mp3 -o output.mp3 -c "lofty {}" -a 100000 -r -b "RUST_BACKTRACE" -v -t 200 -s pedantic

Why

I just needed this - I doubt that it will be useful for anyone else, but feel free to use this.

License

MIT License

Dependencies

~1.5–8.5MB
~74K SLoC