Benchmark noise

I am despairing at the nosiness of our benchmarks. For example, here are two different runs from *identical* code (I forgot to run pystats, so I re-ran the benchmark, and since yesterday's run one commit was added and then reverted):

- [yesterday](https://github.com/faster-cpython/benchmarking/blob/main/results/bm-20230207-3.12.0a5%2B-9595e01/bm-20230207-linux-x86_64-gvanrossum-call_family-3.12.0a5%2B-9595e01-vs-base.png)
- [today](https://github.com/faster-cpython/benchmarking/blob/main/results/bm-20230208-3.12.0a5%2B-cd69634/bm-20230208-linux-x86_64-gvanrossum-call_family-3.12.0a5%2B-cd69634-vs-base.png)

The [commit merge base](https://github.com/gvanrossum/cpython/commit/a9f01448a99c6a2ae34d448806176f2df3a5b323) is the same too.

How can we derive useful signal from this data?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Benchmark noise #551

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Benchmark noise #551

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions