Use the bench-ledger-ops analysis to create a benchmark #79

dnadales · 2023-05-11T13:55:44Z

Background

As part of this long-term goal, we want to elaborate a benchmarking tool that, given two Consensus versions, it compares the cost of performing the 5 main ledger operations between said versions. These 5 ledger operations are:

Forecast.
Header tick.
Header application.
Block tick.
Block application.

These operations combined constitute the bulk of the time used for block adoption.

We want this tool to be usable in the development process.

Motivation

We want to provide a means for the Consensus and Ledger developers, as well as the release engineers, to be able to spot performance regressions early on.

Definition of done

Produce a tool that allows to compare the cost of the main ledger operations across two Consensus versions. This comparison can be carried out by inspecting the following artefacts produced by the tool. No automation in the detection of performance regression is required.

The tool should:

Allow to specify the versions of Consensus to compare.
Allow to specify the GHC to build a given Consensus version (to be compared)
Allow to specify the RTS options to run db-analyser.
Produce a plot per ledger operation, which shows the execution time of both versions (see this example).
TODO: Produce a report/table with <which values?> and <which format?>.
Make each report traceable by storing data like "build information".
Be properly documented so that other developers can use it.
Yield results that are consistent with the system-level benchmarks.

Additionally, we should:

Provide the developers with infrastructure (eg AWS instances) and data that they can use to run the benchmark comparison tool.

As future steps, we could consider running these benchmarks on CI, if that adds value.

Subtasks

The text was updated successfully, but these errors were encountered:

dnadales · 2023-06-26T10:32:41Z

#161 created a tool for comparing benchamrks. We can use that as a starting point. Additional improvements to this tool include (in no particular order):

Make analyseFromSlot and numBlocksToProcess optional.
Add support for command line argument parsing.
Replace A and B in the plot title with the names of versions A and B.
Render output data in a more legible format (eg markdown).
- Round benchmarking metrics to two or three decimals.
Compute the distance between the metrics vectors (per each data point).
Perform statistical analysis of the outliers detected during the first benchmarking pass.

dnadales added this to Consensus Team Backlog May 11, 2023

dnadales converted this from a draft issue May 11, 2023

dnadales self-assigned this May 11, 2023

dnadales moved this from 🔖 Ready to 🏗 In progress in Consensus Team Backlog May 11, 2023

dnadales mentioned this issue Jun 7, 2023

Specify the results we expect from a benchmark run. #142

Closed

dnadales mentioned this issue May 10, 2023

Define Consensus QTAs and test that they're met #72

Open

6 tasks

dnadales moved this from 🏗 In progress to 🔖 Ready in Consensus Team Backlog Oct 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use the bench-ledger-ops analysis to create a benchmark #79

Use the bench-ledger-ops analysis to create a benchmark #79

dnadales commented May 11, 2023 •

edited

Loading

dnadales commented Jun 26, 2023

Use the bench-ledger-ops analysis to create a benchmark #79

Use the bench-ledger-ops analysis to create a benchmark #79

Comments

dnadales commented May 11, 2023 • edited Loading

Background

Motivation

Definition of done

Subtasks

dnadales commented Jun 26, 2023

dnadales commented May 11, 2023 •

edited

Loading