5 releases

0.4.2	Mar 11, 2020
0.4.1	Mar 11, 2020
0.4.0	Mar 9, 2020
0.3.1	Feb 25, 2020
0.3.0	Feb 25, 2020

#1607 in Command line utilities

27 downloads per month

Apache-2.0

15KB
376 lines

PerfTacho

PerfTacho is a small application for computing performance statistics of an executable.

The executable can be any binary file or a shell script. PerfTacho measures the total execution time and can also capture execution time output from the executable. PerfTacho can run the executable multiple times and compute statistics from the recorded performance data.

Usage

    perftacho [-tachoOptions] <command> [params]

    Available options:
     -tachoRepeat=<n>       repeats the execution n times and calculates statistics
     -tachoShowDetails      together  with -tachoRepeat: show each individual duration
     -tachoShowOutput       displays the output of the executed command
     -tachoASCII            together with -tachoShowOutput: filter out non ASCII characters in output
     -tachoTag=<tag>        adds an informational tag to the output
     -tachoRegEx[=<re>]     capture durations from output (see documentation below)

Example

     perftacho -tachoTag=MyTest -tachoRepeat=5 -tachoShowDetail curl https://www.google.com
     perftacho -tachoShowOutput ls -l

Calculated Statistics

When using PerfTacho in repeated execution mode (-tachoRepeat=n) the following statistics are calculated:

    - avg:                  the arithmetic average of the execution time

    - 95% conf. interval:   the interval [avg-d, avg+d] for which there is a 95% probability that 
                            the true avg is contained in
    
    - min:                  minimum execution time

    - max:                  maximum execution time

    - std_dev:              the standard deviation of the execution time

    - n_recommended:        the recommended number of repetitions to get a confidence interval 
                            smaller than 5% of the avg value, i.e. [5%-avg, avg+5%]

Some mathematical background: The goal is to calculate the "true" average execution time. What is the true average execution time and why "true"? Just try out the following with a sample executable: Run PerfTacho with lets say 5 repetitions (-tachoRepeat=5). It will calculate the average execution time for this sample of 5 executions. Now do it again an again. You will see that the calculated average execution time is more or less similar for each experiment but not exactly the same. The calculated statistics for a given sample of executions is just an estimation for the "true" values. The variation of the calculated average value depends on the variation of the individual measurements. Their variation is expressed through the standard deviation std_dev. A small standard deviation means you only need few measurements to calculate a reasonable approximation for the average execution time. Vice versa, a high value means you need a higher number of measurements. The 95% confidence interval expresses a range for which you can be 95% confident that the "true" average execution time value is contained. If you have a strong variation in measured data the range will be quite broad. To narrow down this range you need to increase the number of measurements (-tachoRepeat). n_recommended estimates the number of measurements you probably will need to get a confidence interval of +/- 5% around the average value.

Consult https://en.wikipedia.org/wiki/Confidence_interval to get some more information

Typical use cases

Evaluating performance of an executable and its algorithms

A software developer wants to evaluate the performance of an executable which comprises of several algorithms. Typically, the developer wants to measure the total execution time and the individual execution time for the algorithms to track where the elapsed time is spent. For this purpose the developer just needs to surround the execution of the algorithms with execution time logging and write the measured execution time to stdout.

Example (Java):

long startTime = System.currentTimeMillis();
runMyAlgorithm();
long duration = System.currentTimeMillis() - startTime;
System.out.printf("Duration MyAlgorithm [%d ms]", duration);

The executable should produce the following output:

    Duration MyAlgorithm [321 ms]

If the multiple algorithms shall be measured just add the output accordingly, i.e:

    Algo1 [321 ms] ... some other output ... Algo2 [456 ms]

PerfTacho parses the output of the executable and captures the performance data with a regular expression. With the current default regular expression

    "\[(\-?\d+[\.,]?\d*)\s?(s|ms|ns)\]"

it matches the following output examples:

    [123.0 ms] [123.0ms] [123 s] [123,45 s] [12345ns]

The following command runs the executable in multiple passes and computes statistics from the recorded performance data:

    perftacho -tachoRepeat=5 -tachoShowDetails -tachoRegEx  MyProgram

The output might look like this:

    Tacho : duration in ms
    1012.00         102.00  100.02  119.00
    1014.00         116.00  100.01  107.00
    1014.00         114.00  100.02  115.00
    1013.00         105.00  100.01  103.00
    1014.00         112.00  100.02  105.00
    Tacho : avg: 1013.40ms / 95% conf. interval 0.78 / min: 1012ms / max: 1014ms / stddev 0.89 ms / n_recommended 1
    Tacho : avg: 109.80ms / 95% conf. interval 5.27 / min: 102ms / max: 116ms / stddev 6.02 ms / n_recommended 5
    Tacho : avg: 100.01ms / 95% conf. interval 0.00 / min: 100.009ms / max: 100.018ms / stddev 0.00 ms / n_recommended 1
    Tacho : avg: 109.80ms / 95% conf. interval 6.02 / min: 103ms / max: 119ms / stddev 6.87 ms / n_recommended 6

In this example the executable "MyProgramm" was run 5 times (-tachoRepeat=5 option). PerfTacho captured 3 performance data items from the output. From this data the table as shown above is generated (and displayed, -tachoShowDetails option).

The first column shows the total execution time, the last 3 columns show the captured execution time data from the output. All data is transformed to milliseconds. For each column a row with statistics is computed and displayed below the table.

Installation

Perftacho is a self contained binary file without any dependencies. You have the following options:

Build and install the binary from GitHub sources

You need to have Rust and Cargo installed Please consult https://doc.rust-lang.org/cargo/getting-started/installation.html

    Get the sources:
    git clone https://github.com/qrider71/tacho.git

    cd tacho
    cargo build --release
    cd target/release

    Copy the perftacho binary to your binary folder (wgich should be in your path),
    e.g. on Linux:

    sudo cp perftacho /usr/local/bin/

Get and build the latest release from crates

You need to have Rust and Cargo installed Please consult https://doc.rust-lang.org/cargo/getting-started/installation.html

    cargo install perftacho

Cargo installs the compiled binary into your bin folder

Mac OSX

You can install from sources as described above or download the binary from Github: https://github.com/qrider71/tacho/releases

You should pick the file perftacho-osx-x.y.z and copy it to your bin folder (which should be in your PATH)

Alternatively, you can install with homebrew (https://brew.sh/)

    brew tab qrider71/perftacho
    brew install perftacho

Linux

You can install from sources as described above or download the binary from Github: https://github.com/qrider71/tacho/releases

You should pick the file perftacho-linux-x.y.z and copy it to your bin folder (which should be in your PATH)

Windows

You can install from sources as described above or download the binary from Github: https://github.com/qrider71/tacho/releases

You should pick the file perftacho-windows-x.y.z.exe and copy it to your bin folder (which should be in your PATH)

Roadmap

I plan the following roadmap. Please contact me if you have any comments or ideas for useful features:

V.1.0

Version 1.0 provides all command line tool features for measuring the performance of a single executable

Simple measurement of the runtime duration (implemented)
Statistical analysis of multiple runs (implemented)
Grabbing performance data from the output of the executable with regular expressions (implemented)

V.2.0

Version 2.0 provides features for comparing the performance of several executable. It can be used for benchmarking different implementation. The execution plans can be specified in a configuration file.

Configuration file (yaml) based performance testing plans for several executable
Exporting the results in different formats

V.3.0

Version 3.0 provides multi-threading options for running the performance tests. The focus is on measuring the performance under multi-threaded access and stress test conditions

multi-threading options for running the performance tests

Dependencies

~2.3–3.5MB
~59K SLoC