Improve Benchmark Accuracy #2336

timcassell · 2023-06-21T07:58:00Z

Fixes #1133
Fixes #2305
Fixes #1802
Fixes #2530

Drastically simplified benchmark type code gen.
Overhead return type is always void for a fair baseline for all benchmark return types.
Benchmark method is called directly instead of wrapped in a delegate.
Return value is popped off the stack instead of passed to Consumer or a NoInlining method.
Added an assembly weaver that automatically applies MethodImplOptions.NoInlining to benchmark methods.
Added additional tests.

tests/BenchmarkDotNet.IntegrationTests.ManualRunning/ExpectedBenchmarkResultsTests.cs

timcassell · 2025-02-13T21:56:38Z

Unfortunately, I no longer have an Intel CPU to test this, but I think the new results from this PR should address the issues you had in #2334 @AndreyAkinshin (results look good on my AMD 9800X3D CPU in .Net 8).

timcassell · 2025-05-29T15:30:05Z

@AndyAyersMS Adam mentioned you are the expert I should tag to get your opinion on things like this. Do you have any thoughts or concerns about this approach?

AndyAyersMS · 2025-05-29T17:58:50Z

Do you have any thoughts or concerns about this approach?

I am not that familiar with how robust the weaving process is... maybe it would be good (if possible) to validate it on say the dotnet/performance suite of 5000ish benchmarks. I guess it also means there is no simple faithful source representation for the benchmark assembly, which might trip up some users that keep the sources around and use those for investigations?

What happens if you just compile and run those sources?

Do you have examples showing the impact on reported results?

Overhead return type is always void for a fair baseline for all benchmark return types.

Can you explain this a bit more? The overhead is now some "fixed" workload measurement regardless of benchmark method signature?

Overhead always returns `void`.

…thods. Count down loops instead of count up. Added IntroSmokeStringBuilder. Added more return type test cases.

Reverted loop methods back to `AggressiveOptimization`. Added `NoInlining` to `__Overhead` to match weaved benchmark method. Updated ExpectedBenchmarkResultsTests.

Update Weaver.

timcassell · 2025-05-29T21:08:41Z

I am not that familiar with how robust the weaving process is... maybe it would be good (if possible) to validate it on say the dotnet/performance suite of 5000ish benchmarks.

I can give it a try.

I guess it also means there is no simple faithful source representation for the benchmark assembly, which might trip up some users that keep the sources around and use those for investigations?

Could you elaborate? The weaver just adds NoInlining to [Benchmark] annotated methods. Yeah it's not an exact 1:1 match of source code, but C# doesn't provide a way for us to do it ourselves, even with source generators. It's a small change that I'm not sure how it would impact users doing those types of deep investigations. I would expect anyone doing that to be a power user anyway.

What happens if you just compile and run those sources?

What do you mean? If the source project has a reference to BenchmarkDotNet.Weaver (transitive or otherwise) and it's built with msbuild, the assembly weaver will kick in whether they actually run the benchmarks or not. If they manually instantiate and use a benchmark class, it will just not inline the methods. All other behavior will be unaffected.

Do you have examples showing the impact on reported results?

With the benchmarks from #1133, virtually no impact on my Ryzen 9800X3D, slightly more accurate on Apple M3 (a CPU architecture which BDN seems to have overhead measurement issues for nano benchmarks). It seems to mostly impact Intel CPUs and older (pre-Zen) AMD CPUs (see results I obtained previously in #2334. Unfortunately I no longer have those old CPUs to test again).

Apple M3 results

master:

Method	Mean	Error	StdDev
Increment01	0.6632 ns	0.0378 ns	0.0711 ns
Increment02	0.0000 ns	0.0000 ns	0.0000 ns
Increment03	0.0000 ns	0.0000 ns	0.0000 ns
Increment04	0.0000 ns	0.0000 ns	0.0000 ns
Increment05	0.2523 ns	0.0002 ns	0.0002 ns
Increment06	0.5138 ns	0.0003 ns	0.0003 ns

PR:

Method	Mean	Error	StdDev
Increment01	0.0000 ns	0.0000 ns	0.0000 ns
Increment02	0.0000 ns	0.0000 ns	0.0000 ns
Increment03	0.0000 ns	0.0000 ns	0.0000 ns
Increment04	0.1300 ns	0.0009 ns	0.0007 ns
Increment05	0.3994 ns	0.0002 ns	0.0005 ns
Increment06	0.6680 ns	0.0010 ns	0.0009 ns

As for returning a value (like the issue #2305), I'm not sure if my 9800X3D is the best representative CPU compared to what dotnet/performance runs on, but I got these results:

[Benchmark] public Vector3 Vector3() => default;

master:

Method	Mean	Error	StdDev
Vector3	0.0146 ns	0.0026 ns	0.0022 ns

PR:

Method	Mean	Error	StdDev	Median
Vector3	0.0000 ns	0.0001 ns	0.0001 ns	0.0000 ns

Overhead return type is always void for a fair baseline for all benchmark return types.

Can you explain this a bit more? The overhead is now some "fixed" workload measurement regardless of benchmark method signature?

It's a fix for #2305. Not quite, it still passes all the same arguments, only the return type is fixed to void. My previous attempt in #2309 was a failure. Essentially, the overhead disregards the cost of the return type, so the actual result will include the cost of the return type. Practically, you could reliably measure the cost of returning a large struct, ~~which is currently impossible~~ which has weird results currently.

timcassell · 2025-05-31T16:51:53Z

@AndyAyersMS Ignoring net462 (dotnet/performance#4869), I tried build with these changes, and I got this error.

Mono.Cecil.ResolutionException: Failed to resolve System.IO.FileOptions

I can't find anything about System.IO.FileOptions, except an enum, which I think is weird because it's not an assembly. Any ideas?

AndyAyersMS · 2025-05-31T16:57:32Z

@AndyAyersMS Ignoring net462 (dotnet/performance#4869), I tried build with these changes, and I got this error.
Mono.Cecil.ResolutionException: Failed to resolve System.IO.FileOptions
I can't find anything about System.IO.FileOptions, except an enum, which I think is weird because it's not an assembly. Any ideas?

@LoopedBard3 any ideas? Android support?

Copilot

Pull Request Overview

A concise summary:

Simplifies benchmark type code generation by removing legacy helpers, unifying loop IL generation, and using direct stack operations.
Streamlines declarations providers to just Sync/Async variants, dropping multiple specialized classes.
Introduces a Weaver task (WeaveAssemblyTask) in BenchmarkDotNet.Weaver to automatically apply NoInlining to all [Benchmark] methods.

Reviewed Changes

Copilot reviewed 45 out of 45 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
src/BenchmarkDotNet/Helpers/Reflection.Emit/MethodBuilderExtensions.cs	Removed unused `GetParameterTypes` extension and `System` using.
src/BenchmarkDotNet/Helpers/Reflection.Emit/IlGeneratorStatementExtensions.cs	Refactored loop emission from index-based to decrement-based (`EmitLoopBeginFromArgToZero` / `EmitLoopEndFromArgToZero`).
src/BenchmarkDotNet/Helpers/Reflection.Emit/IlGeneratorEmitOpExtensions.cs	Replaced large indirect store switch with `EmitStarg` helper.
src/BenchmarkDotNet/Helpers/Reflection.Emit/IlGeneratorDefaultValueExtensions.cs	Deleted obsolete default-value IL emission helpers.
src/BenchmarkDotNet/Helpers/Reflection.Emit/IlGeneratorCallExtensions.cs	Removed unused `DeclareOptionalLocalForInstanceCall`.
src/BenchmarkDotNet/Engines/Consumer.cs	Stripped out unused `SupportedTypes` and helper methods.
src/BenchmarkDotNet/Code/DeclarationsProvider.cs	Collapsed multiple provider subclasses into `SyncDeclarationsProvider` and `AsyncDeclarationsProvider`.
src/BenchmarkDotNet/Code/CodeGenerator.cs	Removed conditional compilation and wiring to old providers; updated provider selection logic.
src/BenchmarkDotNet.Weaver/src/WeaveAssemblyTask.cs	Added `WeaveAssemblyTask` MSBuild task and custom resolver to apply `NoInlining`.

Comments suppressed due to low confidence (3)

src/BenchmarkDotNet/Helpers/Reflection.Emit/IlGeneratorStatementExtensions.cs:123

[nitpick] The inline IL comment references IL_0002, which no longer matches the updated loopStartLabel. Update or remove this outdated offset comment to avoid confusion.

//                IL_0011: bge.s IL_0002

src/BenchmarkDotNet/Code/DeclarationsProvider.cs:51

SyncDeclarationsProvider does not implement the abstract members ReturnsDefinition, OverheadMethodReturnType, and OverheadImplementation from DeclarationsProvider, which will cause compilation errors.

internal class SyncDeclarationsProvider : DeclarationsProvider

src/BenchmarkDotNet/Code/DeclarationsProvider.cs:56

AsyncDeclarationsProvider is missing overrides for the abstract properties ReturnsDefinition and OverheadImplementation from DeclarationsProvider, leading to incomplete implementation.

internal class AsyncDeclarationsProvider : DeclarationsProvider

timcassell mentioned this pull request Jun 21, 2023

Overhead match workload #2309

Closed

AndreyAkinshin force-pushed the master branch from 6cb423a to 6291a7e Compare July 5, 2023 19:27

timcassell added the Area:CodeGen label Jul 19, 2023

timcassell marked this pull request as draft July 24, 2023 15:45

timcassell force-pushed the fair-types branch from 24cdd34 to bd88b1e Compare August 1, 2023 07:26

timcassell marked this pull request as ready for review August 1, 2023 07:26

This was referenced Aug 9, 2023

[Perf] Linux/arm64: 268 Regressions on 7/29/2023 7:04:01 PM dotnet/runtime#89940

Closed

JIT produces different asm from IL emit than from source dotnet/runtime#89685

Open

timcassell added this to the v0.14.0 milestone Jan 14, 2024

timcassell mentioned this pull request Jan 22, 2024

Call benchmark method directly #2334

Closed

timcassell modified the milestones: v0.15.x, v0.14.0 Mar 6, 2024

This comment was marked as outdated.

Sign in to view

AndreyAkinshin reviewed Mar 7, 2024

View reviewed changes

tests/BenchmarkDotNet.IntegrationTests.ManualRunning/ExpectedBenchmarkResultsTests.cs Outdated Show resolved Hide resolved

timcassell force-pushed the fair-types branch 2 times, most recently from 48266cd to 4abbb1f Compare March 11, 2024 14:08

timcassell modified the milestones: v0.14.0, v0.15.x Aug 6, 2024

timcassell force-pushed the fair-types branch from 4abbb1f to 213549a Compare August 30, 2024 01:48

timcassell force-pushed the fair-types branch from 213549a to 7080e63 Compare January 11, 2025 03:06

timcassell mentioned this pull request Jan 11, 2025

Support for F# anonymous records #2530

Closed

timcassell marked this pull request as draft January 13, 2025 00:58

timcassell mentioned this pull request Feb 13, 2025

Constant stack size #2688

Merged

timcassell force-pushed the fair-types branch from 3e4f122 to e3e774b Compare February 13, 2025 20:24

timcassell changed the title ~~Fair Return Types~~ Improve Benchmark Accuracy Feb 13, 2025

timcassell marked this pull request as ready for review February 13, 2025 21:53

timcassell requested a review from AndreyAkinshin February 13, 2025 22:36

timcassell force-pushed the fair-types branch from e3e774b to f1a89ed Compare February 13, 2025 22:39

timcassell force-pushed the fair-types branch from d2b612f to 374055e Compare March 4, 2025 08:26

timcassell force-pushed the fair-types branch from 374055e to ddc2b98 Compare March 4, 2025 08:29

timcassell added 10 commits May 29, 2025 15:53

Call benchmark method directly instead of via delegate.

0b2d290

Removed Consumer from benchmark actions.

c59b5f2

Overhead always returns `void`.

Removed workload return types from code gen.

bea7d64

Apply NoOptimization instead of AggressiveOptimization to loop me…

87dc695

…thods. Count down loops instead of count up. Added IntroSmokeStringBuilder. Added more return type test cases.

Added assembly weaver.

3fdf09d

Reverted loop methods back to `AggressiveOptimization`. Added `NoInlining` to `__Overhead` to match weaved benchmark method. Updated ExpectedBenchmarkResultsTests.

Remove unused constant.

d6153a1

Remove InProcess check from validation.

1eb3006

Update Weaver.

Fix comment.

b9acb5c

Zero threshold.

7920bef

Update built package version.

82e9d49

timcassell force-pushed the fair-types branch from c6855aa to 82e9d49 Compare May 29, 2025 20:17

Fix test.

12176a6

timcassell force-pushed the fair-types branch from ddf03a6 to 12176a6 Compare May 30, 2025 00:37

AndreyAkinshin requested a review from Copilot June 1, 2025 10:02

Copilot AI reviewed Jun 1, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Improve Benchmark Accuracy #2336

Improve Benchmark Accuracy #2336

Uh oh!

timcassell commented Jun 21, 2023 •

edited

Loading

Uh oh!

This comment was marked as outdated.

Uh oh!

timcassell commented Feb 13, 2025 •

edited

Loading

Uh oh!

timcassell commented May 29, 2025

Uh oh!

AndyAyersMS commented May 29, 2025

Uh oh!

timcassell commented May 29, 2025 •

edited

Loading

Uh oh!

timcassell commented May 31, 2025

Uh oh!

AndyAyersMS commented May 31, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Improve Benchmark Accuracy #2336

Are you sure you want to change the base?

Improve Benchmark Accuracy #2336

Uh oh!

Conversation

timcassell commented Jun 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

timcassell commented Feb 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

timcassell commented May 29, 2025

Uh oh!

AndyAyersMS commented May 29, 2025

Uh oh!

timcassell commented May 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

timcassell commented May 31, 2025

Uh oh!

AndyAyersMS commented May 31, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

timcassell commented Jun 21, 2023 •

edited

Loading

timcassell commented Feb 13, 2025 •

edited

Loading

timcassell commented May 29, 2025 •

edited

Loading