Regressions in System.MathBenchmarks.Double #85985

performanceautofiler · 2023-05-09T11:26:52Z

Run Information

Name	Value
Architecture	x64
OS	Windows 10.0.18362
Queue	TigerWindows
Baseline	3695b6ddd869e53eb663f7674e0f130eaf895b03
Compare	3e8f17a65a068fca3d19fa5cd43a7e1cd414a5ae
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in System.MathBenchmarks.Double

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio	Baseline ETL	Compare ETL
Sqrt - Duration of single invocation	9.35 μs	28.22 μs	3.02	0.00	True	30934.800080369703	32743.496672716275	1.0584680226685648)	Trace	Trace

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

Payloads

Baseline
Compare

git clone https://github.com/dotnet/performance.git
py .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'System.MathBenchmarks.Double*'

Payloads

Baseline
Compare

Histogram

System.MathBenchmarks.Double.Sqrt

Description of detection logic

IsRegressionBase: Marked as regression because the compare was 5% greater than the baseline, and the value was not too small.
IsRegressionChecked: Marked as regression because the three check build points were 0.05 greater than the baseline.
IsImprovementBase: Marked as not an improvement because the compare was not 5% less than the baseline, or the value was too small.
IsRegressionBase: Marked as regression because the compare was 5% greater than the baseline, and the value was not too small.
IsRegressionChecked: Marked as regression because the three check build points were 0.05 greater than the baseline.
IsRegressionWindowed: Marked as regression because 28.217955081921694 > 9.81972192923399.
IsChangePoint: Marked as a change because one of 5/1/2023 6:56:14 PM, 5/9/2023 7:24:34 AM falls between 4/30/2023 6:17:41 PM and 5/9/2023 7:24:34 AM.
IsRegressionStdDev: Marked as regression because -2456.13357226457 (T) = (0 -28234.404781782963) / Math.Sqrt((0.02128795533441095 / (14)) + (1359.3234780046578 / (23))) is less than -2.0301079282477414 = MathNet.Numerics.Distributions.StudentT.InvCDF(0, 1, (14) + (23) - 2, .025) and -2.0190451457621967 = (9352.097573438183 - 28234.404781782963) / 9352.097573438183 is less than -0.05.
IsImprovementBase: Marked as not an improvement because the compare was not 5% less than the baseline, or the value was too small.
IsChangeEdgeDetector: Marked as regression because Edge Detector said so.

JIT Disasms

Baseline
Compare
Diff

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

The text was updated successfully, but these errors were encountered:

cincuranet · 2023-05-09T16:23:38Z

The commit range is 6f19e37...da0aa0c.

tannergooding · 2023-05-09T16:38:01Z

There is some new T0 code in the diffs for the regression (such as LdelemaRef): https://perfsupport.azurewebsites.net/diff?old=https%3A%2F%2Fpvscmdupload.blob.core.windows.net%2Fautofilereport%2Fjitdasms%2F05_09_2023%2FSystem_MathBenchmarks_Double_Sqrt_baseline_a81af296-a311-4f09-861a-365111575051.log&new=https%3A%2F%2Fpvscmdupload.blob.core.windows.net%2Fautofilereport%2Fjitdasms%2F05_09_2023%2FSystem_MathBenchmarks_Double_Sqrt_compare_a81af296-a311-4f09-861a-365111575051.log

But for the actual test code in question, there is no assembly diffs, just a couple small disasm formatting diffs

cincuranet · 2023-05-09T16:52:59Z

More instances:

[Perf] Linux/x64: 9 Regressions on 5/2/2023 12:34:35 AM perf-autofiling-issues#17573

ghost · 2023-05-10T15:35:10Z

Tagging subscribers to this area: @dotnet/area-system-runtime
See info in area-owners.md if you want to be subscribed.

Issue Details

Run Information

Name	Value
Architecture	x64
OS	Windows 10.0.18362
Queue	TigerWindows
Baseline	3695b6ddd869e53eb663f7674e0f130eaf895b03
Compare	3e8f17a65a068fca3d19fa5cd43a7e1cd414a5ae
Diff	Diff
Configs	CompilationMode:tiered, RunKind:micro

Regressions in System.MathBenchmarks.Double

Benchmark	Baseline	Test	Test/Base	Test Quality	Edge Detector	Baseline IR	Compare IR	IR Ratio	Baseline ETL	Compare ETL
Sqrt - Duration of single invocation	9.35 μs	28.22 μs	3.02	0.00	True	30934.800080369703	32743.496672716275	1.0584680226685648)	Trace	Trace

Test Report

Repro

General Docs link: https://github.com/dotnet/performance/blob/main/docs/benchmarking-workflow-dotnet-runtime.md

Payloads

Baseline
Compare

git clone https://github.com/dotnet/performance.git
py .\performance\scripts\benchmarks_ci.py -f net8.0 --filter 'System.MathBenchmarks.Double*'

Payloads

Baseline
Compare

Histogram

System.MathBenchmarks.Double.Sqrt

Description of detection logic

IsRegressionBase: Marked as regression because the compare was 5% greater than the baseline, and the value was not too small.
IsRegressionChecked: Marked as regression because the three check build points were 0.05 greater than the baseline.
IsImprovementBase: Marked as not an improvement because the compare was not 5% less than the baseline, or the value was too small.
IsRegressionBase: Marked as regression because the compare was 5% greater than the baseline, and the value was not too small.
IsRegressionChecked: Marked as regression because the three check build points were 0.05 greater than the baseline.
IsRegressionWindowed: Marked as regression because 28.217955081921694 > 9.81972192923399.
IsChangePoint: Marked as a change because one of 5/1/2023 6:56:14 PM, 5/9/2023 7:24:34 AM falls between 4/30/2023 6:17:41 PM and 5/9/2023 7:24:34 AM.
IsRegressionStdDev: Marked as regression because -2456.13357226457 (T) = (0 -28234.404781782963) / Math.Sqrt((0.02128795533441095 / (14)) + (1359.3234780046578 / (23))) is less than -2.0301079282477414 = MathNet.Numerics.Distributions.StudentT.InvCDF(0, 1, (14) + (23) - 2, .025) and -2.0190451457621967 = (9352.097573438183 - 28234.404781782963) / 9352.097573438183 is less than -0.05.
IsImprovementBase: Marked as not an improvement because the compare was not 5% less than the baseline, or the value was too small.
IsChangeEdgeDetector: Marked as regression because Edge Detector said so.

JIT Disasms

Baseline
Compare
Diff

Docs

Profiling workflow for dotnet/runtime repository
Benchmarking workflow for dotnet/runtime repository

Author:	performanceautofiler[bot]
Assignees:	-
Labels:	`area-System.Runtime`, `os-windows`, `arch-x64`, `untriaged`, `runtime-coreclr`, `needs-area-label`
Milestone:	-

tannergooding · 2023-07-24T23:36:34Z

No codegen differences. Only difference is BDN changed how many operations its executing

CC. @adamsitnik

adamsitnik · 2023-07-25T07:40:41Z

Only difference is BDN changed how many operations its executing

I am not sure if I follow. BDN estimates how many invocations should be performed per iteration (250ms), it almost always changes between runs but this should not have a big impact on the reported time.

tannergooding · 2023-07-25T15:13:13Z

There were 0 changes to codegen for the actual benchmark itself between base and diff, the only actual change is that we have some more methods (on the general startup path) that start in T0, rather than starting in T1 (were no longer marked AggressiveOptimization).

We semi-regularly see cases like this in the weekly triage. We also see cases where BDN doesn't work as expected with functions that have very small execution times (dotnet/BenchmarkDotNet#1802), which in turn can impact the overhead measurement of an empty call.

You can see the asm diff here: https://perfsupport.azurewebsites.net/diff?old=https%3A%2F%2Fpvscmdupload.blob.core.windows.net%2Fautofilereport%2Fjitdasms%2F05_09_2023%2FSystem_MathBenchmarks_Double_Sqrt_baseline_a81af296-a311-4f09-861a-365111575051.log&new=https%3A%2F%2Fpvscmdupload.blob.core.windows.net%2Fautofilereport%2Fjitdasms%2F05_09_2023%2FSystem_MathBenchmarks_Double_Sqrt_compare_a81af296-a311-4f09-861a-365111575051.log

The methods that are actually being measured can be seen by searching for sqrts.

performanceautofiler bot assigned AndyAyersMS May 9, 2023

performanceautofiler bot added arch-x64 os-windows runtime-coreclr specific to the CoreCLR runtime untriaged New issue has not been triaged by the area owner labels May 9, 2023

cincuranet changed the title ~~[Perf] Windows/x64: 1 Regression on 5/2/2023 12:34:35 AM~~ Regressions in System.MathBenchmarks.Double May 9, 2023

cincuranet removed the untriaged New issue has not been triaged by the area owner label May 9, 2023

cincuranet unassigned AndyAyersMS May 9, 2023

cincuranet transferred this issue from dotnet/perf-autofiling-issues May 9, 2023

dotnet-issue-labeler bot added the needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners label May 9, 2023

ghost added the untriaged New issue has not been triaged by the area owner label May 9, 2023

jeffschwMSFT added the area-System.Runtime label May 10, 2023

vcsjones removed the needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners label May 11, 2023

tannergooding removed the untriaged New issue has not been triaged by the area owner label May 26, 2023

ericstj added this to the 8.0.0 milestone Jul 24, 2023

tannergooding closed this as completed Jul 24, 2023

ghost locked as resolved and limited conversation to collaborators Aug 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regressions in System.MathBenchmarks.Double #85985

Regressions in System.MathBenchmarks.Double #85985

performanceautofiler bot commented May 9, 2023

Payloads

Histogram

System.MathBenchmarks.Double.Sqrt

Description of detection logic

JIT Disasms

Docs

cincuranet commented May 9, 2023

tannergooding commented May 9, 2023

cincuranet commented May 9, 2023

ghost commented May 10, 2023

Run Information

Regressions in System.MathBenchmarks.Double

Repro

Payloads

Payloads

Histogram

System.MathBenchmarks.Double.Sqrt

Description of detection logic

JIT Disasms

Docs

tannergooding commented Jul 24, 2023

adamsitnik commented Jul 25, 2023

tannergooding commented Jul 25, 2023

Regressions in System.MathBenchmarks.Double #85985

Regressions in System.MathBenchmarks.Double #85985

Comments

performanceautofiler bot commented May 9, 2023

Run Information

Regressions in System.MathBenchmarks.Double

Repro

Payloads

Payloads

Histogram

System.MathBenchmarks.Double.Sqrt

Description of detection logic

JIT Disasms

Docs

cincuranet commented May 9, 2023

tannergooding commented May 9, 2023

cincuranet commented May 9, 2023

ghost commented May 10, 2023

Run Information

Regressions in System.MathBenchmarks.Double

Repro

Payloads

Payloads

Histogram

System.MathBenchmarks.Double.Sqrt

Description of detection logic

JIT Disasms

Docs

tannergooding commented Jul 24, 2023

adamsitnik commented Jul 25, 2023

tannergooding commented Jul 25, 2023