Plugins for the most common Continuous Integration Systems #54

adamsitnik · 2015-11-29T15:41:11Z

I would like to start a discussion, it is not a complete vision yet.

Problem: in terms of Continuous Integration the software performance is usually ignored, or in best case it is not measured properly. Most developers do the profiling and optimizations when the clients start to complain. The performance is not considered and treated as a feature.

As a user I would like to know when code changes are decreasing performance.

The basic idea is to have the results of the benchmarks compared with previous results. If the difference is big enough, then the CI should go red as it would with a broken test.

smudge202 · 2015-11-30T08:53:51Z

I don't think there's a good way of doing this due to persistence (or lack thereof). I've never come across a CI pipeline that needs (or even has access to) previous builds.

The only way I can imagine it working is if benchmarks are committed to source control. CI would fail the build if the difference between the committed benchmark and the CI benchmark was outside of a given tolerance, requiring a user to run the benchmarks and commit the updated results if they're happy with them. However, such a process would imply that the developer run the benchmarks locally which the CI would then compare to running them on a server, and there are very few if any situations where a developers' local machine is similar to a CI build agent, making the benchmarks either unlikely to match, or require such a huge tolerance of change that it be pointless.

I'd love to have such a feature, I just can't imagine how it could work. How do you envision it, @adamsitnik ?

mattwarren · 2015-11-30T10:07:40Z

The basic idea is to have the results of the benchmarks compared with previous results. If the difference is big enough, then the CI should go red as it would with a broken test.

Yeah the persistence side of things is tricky and I'm not sure how or if BenchmarkDotNet can handle that. The best thing I can think of is a plugin that knows where to look for previous results and can compare the current run against those.

But it would probably be the responsibility of the CI infrastructure to ensure that previous Benchmark results were stored somewhere after BenchmarkDotNet had finished running.

Just for reference, at the moment this is the type of things that BenchmarkDotNet can help you with, inside a regular unit test (I know these aren't strictly unit tests, more like integration tests, but it's very easy to re-use the existing tooling (XUnit in this case)):

So it's fairly primitive, but it's something at least.

smudge202 · 2015-11-30T10:54:47Z

Perhaps two or three extensibility points could help achieve this, @mattwarren. For example, let's say BDN allows consumers to write their own benchmark result importers as you've suggested. As a consumer, I would want to utilise BDN API to specify that the imported Benchmark should be compared to the current benchmark to determine pass/fail of test.

Similarly, I'd want to be able to override the comparison process so that any default tolerance mechanisms can be superseded.

We can already override the logger mechanisms as part of export, but (correct me if I'm wrong), what we don't have is a Benchmark DTO that we could simply import/export. Instead, in order to export and persist benchmarks, I think we'd have to extrapolate (deserialize) the content of the log in order to build up such a DTO. Essentially, it would be better (hopefully not confusing?) if both log and dto were available for export, and the ability to import said DTO's for comparison existed?

adamsitnik · 2015-11-30T15:48:20Z

I have just found a dotTrace plugin for Team City that does something very similar to what I wanted to get: link

"It’s also possible to compare measured values against values from previous successful builds."

I know that the persistence side would be tricky. Do you think that custom plugins don't have access to CI's database or can not create it's own during installation process?

I have some initial idea for a simple proof of concept for Team City.

TC can run .NET executable with given parameters (Runner type: .NET Process Runner).
It is possible to configure TC build to publish artifacts (it could be any files that our process have created, in our case benchmark results) & get address of the latest succesfull build artifacts.

We could create new BDN runner that would extend the existing in following way:

Download the files from given http address (Team City artifacts are just hosted by the web server) passed to our new runner in parameters.
Parse the files (as in given unit test examples) to get the exact results
Compare old results with new results
Fail the process if the difference is bigger than given treshold (could be also passed in parameters)

TimirP · 2019-10-14T15:54:01Z

I'm looking for something so that I can run BenchmarkDotNet as Azue WebJob or though Azure Build Pipeline. I would like to benchmark performance of my application running on Azure and Accessing azure resources. I can run it from my development machine.

oformaniuk · 2020-06-12T00:33:37Z

I was looking for Github Actions support and found a PR to Continuous Benchmark action: benchmark-action/github-action-benchmark/pull/34. Would leave it here for further tracking.

tomatopunk · 2020-11-23T11:24:16Z

Hello @adamsitnik

The tail in 2020,Is there any good product or solution recommended by the community

As described in your post,,I would like to know when code changes are decreasing performance.

As convenient as a unit test pipeline

p-bojkowski · 2021-03-04T18:25:09Z

Has anyone tried this ?

Continuous Benchmark (.NET)

oformaniuk · 2021-03-06T22:24:00Z

Has anyone tried this ?

Continuous Benchmark (.NET)

Yep, I'm currently using it in Handlebars.Net.

lipchev · 2021-05-03T19:53:07Z

Has anyone tried this ?
Continuous Benchmark (.NET)

Yep, I'm currently using it in Handlebars.Net.

We're also exploring this in UnitsNet. I've also added a few steps for storing and comparing against an existing base-line (using the dotnet/performance/tools/ResultsComparer, which I hope would find it's place in the a future release of the BenchmarkDotNet).

However- unless you're running this using a stable self-hosted environment- the results are just way too noisy for any meaningful comparison (you could maybe use it track the overall performance over releases, but know that for every other commit you might end up having a different DSv2-series).

PS In our scenario, I'm thinking of introducing something like a 'StableBaseline' - that is a test-method that benchmarks the performance of some immutable piece of code (a suitable candidate for our library might be something like Math.Min) that could be used to re-scale the other results- but that's just a hack..

ghost · 2022-06-27T15:03:45Z

An Azure DevOps/Pipelines task with a proper HTML report would be very nice.

gioce90 · 2022-07-14T16:41:25Z

This issue was opened far in 2015. So old and than so actual. It is really strange that there is not something out-of-the-box to easily integrate BenchmarkDotNet with Azure DevOps Pipelines. Or, if it exists, it is not easy to find.

Reading this thread I found that there is something for GitHub Actions. Can anyone point me to something equivalent for Azure Pipelines?

turowicz · 2025-04-22T12:32:33Z

An Azure DevOps/Pipelines task with a proper HTML report would be very nice.

this

adamsitnik changed the title ~~Plugins for the most common Continuous Integration System~~ Plugins for the most common Continuous Integration Systems Nov 29, 2015

adamsitnik referenced this issue in adamsitnik/DemoCode Nov 29, 2015

updated results of the latest version

629c2c3

smudge202 mentioned this issue Nov 30, 2015

# of GC Collections [Diagnostic/Diagnoser] #56

Closed

AndreyAkinshin added the enhancement label Nov 30, 2015

mattwarren mentioned this issue Feb 25, 2016

DNX451 support #87

Merged

AndreyAkinshin self-assigned this Nov 16, 2017

adamsitnik mentioned this issue Jan 17, 2018

Examples of running BenchmarkDotNet from TeamCity #627

Closed

AndreyAkinshin mentioned this issue Mar 2, 2020

Run BenchmarkDotNet in GitHub Actions #1378

Closed

This was referenced Oct 5, 2022

How to measure a whole application performance over time? #1717

Closed

[Question] Automate BDN reporting in Continuous Integration workflow #1822

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Plugins for the most common Continuous Integration Systems #54

Plugins for the most common Continuous Integration Systems #54

adamsitnik commented Nov 29, 2015

smudge202 commented Nov 30, 2015

Uh oh!

mattwarren commented Nov 30, 2015

Uh oh!

smudge202 commented Nov 30, 2015

Uh oh!

adamsitnik commented Nov 30, 2015

Uh oh!

TimirP commented Oct 14, 2019

Uh oh!

oformaniuk commented Jun 12, 2020

Uh oh!

tomatopunk commented Nov 23, 2020 •

edited

Loading

Uh oh!

p-bojkowski commented Mar 4, 2021

Uh oh!

oformaniuk commented Mar 6, 2021

Uh oh!

lipchev commented May 3, 2021

Uh oh!

ghost commented Jun 27, 2022

Uh oh!

gioce90 commented Jul 14, 2022

Uh oh!

turowicz commented Apr 22, 2025

Uh oh!

Uh oh!

Plugins for the most common Continuous Integration Systems #54

Plugins for the most common Continuous Integration Systems #54

Comments

adamsitnik commented Nov 29, 2015

smudge202 commented Nov 30, 2015

Uh oh!

mattwarren commented Nov 30, 2015

Uh oh!

smudge202 commented Nov 30, 2015

Uh oh!

adamsitnik commented Nov 30, 2015

Uh oh!

TimirP commented Oct 14, 2019

Uh oh!

oformaniuk commented Jun 12, 2020

Uh oh!

tomatopunk commented Nov 23, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

p-bojkowski commented Mar 4, 2021

Uh oh!

oformaniuk commented Mar 6, 2021

Uh oh!

lipchev commented May 3, 2021

Uh oh!

ghost commented Jun 27, 2022

Uh oh!

gioce90 commented Jul 14, 2022

Uh oh!

turowicz commented Apr 22, 2025

Uh oh!

tomatopunk commented Nov 23, 2020 •

edited

Loading