bench-show: Show, plot and compare benchmark results

[ benchmarking, bsd3, library, performance, program ] [ Propose Tags ] [ Report a vulnerability ]

Generate text reports and graphical charts from the benchmark results generated by gauge or criterion and stored in a CSV file. This tool is especially useful when you have many benchmarks or if you want to compare benchmarks across multiple packages. You can generate many interesting reports including:

Show individual reports for all the fields measured e.g. time taken, peak memory usage, allocations, among many other fields measured by gauge
Sort benchmark results on a specified criterion e.g. you may want to see the biggest cpu hoggers or biggest memory hoggers on top
Across two benchmark runs (e.g. before and after a change), show all the operations that resulted in a regression of more than x% in descending order, so that we can quickly identify and fix performance problems in our application.
Across two (or more) packages providing similar functionality, show all the operations where the performance differs by more than 10%, so that we can critically analyze the packages and choose the right one.

Quick Start: Use gauge or criterion to generate a results.csv file, and then use either the bench-show executable or the library APIs to generate textual or graphical reports.

$ bench-show report results.csv
$ bench-show graph results.csv output

report "results.csv"  Nothing defaultConfig
graph  "results.csv" "output" defaultConfig

There are many ways to present the reports, for example, you can show can show % regression from a baseline in descending order textually as follows:

(time)(Median)(Diff using min estimator)
Benchmark streamly(0)(μs)(base) streamly(1)(%)(-base)
--------- --------------------- ---------------------
zip                      644.33                +23.28
map                      653.36                 +7.65
fold                     639.96                -15.63

To show the same graphically:

See the README and the BenchShow.Tutorial module for comprehensive documentation.

[Skip to Readme]

Modules

[Index] [Quick Jump]

BenchShow
- Internal
- BenchShow.Tutorial

Flags

Manual Flags

Name	Description	Default
no-charts	Don't build the modules that provide charting functionality	Disabled
no-colors	Use pretty printing without colors	Disabled

Use -f <flag> to enable a flag, or -f -<flag> to disable that flag. More info

Downloads

bench-show-0.3.2.tar.gz [browse] (Cabal source package)
Package description (revised from the package)

Note: This package has metadata revisions in the cabal description newer than included in the tarball. To unpack the package including the revisions, use 'cabal get'.

Maintainer's Corner

Package maintainers

harendra, adithyaov

For package maintainers and hackage trustees

edit package information

Candidates

No Candidates

Versions [RSS]	0.2.0, 0.2.1, 0.2.2, 0.3.0, 0.3.1, 0.3.2
Change log	Changelog.md
Dependencies	ansi-wl-pprint (>=0.6 && <1.1), base (>=4.8 && <5), bench-show, Chart (>=1.6 && <2), Chart-diagrams (>=1.6 && <2), csv (>=0.1 && <0.2), directory (>=1.2 && <1.4), filepath (>=1.3 && <1.6), mwc-random (>=0.13 && <0.16), optparse-applicative (>=0.14.2 && <0.19), optparse-simple (>=0.1.0 && <0.2), split (>=0.2 && <0.3), statistics (>=0.15 && <0.17), transformers (>=0.4 && <0.7), vector (>=0.10 && <0.14) [details]
Tested with	ghc ==9.2.8, ghc ==9.4.8, ghc ==9.6.4, ghc ==9.8.1, ghc ==9.10.1
License	BSD-3-Clause
Copyright	2017, 2018 Composewell Technologies
Author	Harendra Kumar
Maintainer	harendra.kumar@gmail.com
Uploaded	by adithyaov at 2022-02-02T23:10:03Z
Revised	Revision 2 made by harendra at 2025-02-17T19:20:16Z
Category	Performance, Benchmarking
Home page	https://github.com/composewell/bench-show
Bug tracker	https://github.com/composewell/bench-show/issues
Source repo	head: git clone https://github.com/composewell/bench-show
Distributions	LTSHaskell:0.3.2, NixOS:0.3.2
Reverse Dependencies	1 direct, 0 indirect [details]
Executables	bench-show
Downloads	3825 total (31 in the last 30 days)
Rating	2.0 (votes: 1) [estimated by Bayesian average]
Your Rating	λ λ λ
Status	Docs available [build log] Last success reported on 2022-02-03 [all 1 reports]

Readme for bench-show-0.3.2

[back to package description]

bench-show

Show individual reports for all the fields measured e.g. time taken, peak memory usage, allocations, among many other fields measured by gauge
Sort benchmark results on a specified criterion e.g. you may want to see the biggest cpu hoggers or biggest memory hoggers on top
Across two benchmark runs (e.g. before and after a change), show all the operations that resulted in a regression of more than x% in descending order, so that we can quickly identify and fix performance problems in our application.
Across two (or more) packages providing similar functionality, show all the operations where the performance differs by more than 10%, so that we can critically analyze the packages and choose the right one.

Quick Start

Use gauge or criterion to generate a results.csv file, and then use either the bench-show executable or the library APIs to generate textual or graphical reports.

Executable

Use bench-show executable with report and graph sub-commands:

$ bench-show report results.csv
$ bench-show graph results.csv output

For advanced usage, control the generated report by the CLI flags.

Library

Use report and graph library functions:

report "results.csv"  Nothing defaultConfig
graph  "results.csv" "output" defaultConfig

For advanced usage, control the generated report by modifying the defaultConfig.

Reports and Charts

report with Fields presentation style generates a multi-column report. We can select many fields from a gauge raw report. Units of the fields are automatically determined based on the range of values:

$ bench-show --presentation Fields report results.csv

report "results.csv" Nothing defaultConfig { presentation = Fields }

Benchmark     time(μs) maxrss(MiB)
------------- -------- -----------
vector/fold     641.62        2.75
streamly/fold   639.96        2.75
vector/map      638.89        2.72
streamly/map    653.36        2.66
vector/zip      651.42        2.58
streamly/zip    644.33        2.59

graph generates one bar chart per field:

$ bench-show --presentation Fields graph results.csv

graph "results.csv" "output" defaultConfig

When the input file contains results from a single benchmark run, by default all the benchmarks are placed in a single benchmark group named "default".

Grouping

Let's write a benchmark classifier to put the streamly and vector benchmarks in their own groups:

   classifier name =
       case splitOn "/" name of
           grp : bench -> Just (grp, concat bench)
           _          -> Nothing

Now we can show the two benchmark groups as separate columns. We can generate reports comparing different benchmark fields (e.g. time and maxrss) for all the groups:

   report "results.csv" Nothing
     defaultConfig { classifyBenchmark = classifier }

(time)(Median)
Benchmark streamly(μs) vector(μs)
--------- ------------ ----------
fold            639.96     641.62
map             653.36     638.89
zip             644.33     651.42

We can do the same graphically as well, just replace report with graph in the code above. Each group is placed as a cluster on the graph. Multiple clusters are placed side by side (i.e. on the same scale) for easy comparison. For example:

Regression, Percentage Difference and Sorting

We can append benchmarks results from multiple runs to the same file. These runs can then be compared. We can run benchmarks before and after a change and then report the regressions by percentage change in a sorted order:

Given a results file with two runs, this code generates the report that follows:

   report "results.csv" Nothing
     defaultConfig
         { classifyBenchmark = classifier
         , presentation = Groups PercentDiff
         , selectBenchmarks = \f ->
              reverse
              $ map fst
              $ sortBy (comparing snd)
              $ either error id $ f (ColumnIndex 1) Nothing
         }

(time)(Median)(Diff using min estimator)
Benchmark streamly(0)(μs)(base) streamly(1)(%)(-base)
--------- --------------------- ---------------------
zip                      644.33                +23.28
map                      653.36                 +7.65
fold                     639.96                -15.63

It tells us that in the second run the worst affected benchmark is zip taking 23.28 percent more time compared to the baseline.

Graphically:

Full Documentation and examples

See the haddock documentation on Hackage
See the comprehensive tutorial module in the haddock docs
For examples see the test directory in the package

Contributions and Feedback

Contributions are welcome! Please see the TODO.md file or the existing issues if you want to pick up something to work on.

Any feedback on improvements or the direction of the package is welcome. You can always send an email to the maintainer or raise an issue for anything you want to suggest or discuss, or send a PR for any change that you would like to make.