Port yield normalization from CoreCLR to Native AOT #103675

eduardo-vp · 2024-06-18T23:21:34Z

Porting the current way yield normalization is done to Native AOT.

The CoreCLR implementation was moved to src/coreclr/vm/yieldprocessornormalizedshared.cpp.

Both the CoreCLR file (src/coreclr/vm/yieldprocessornormalized.cpp) and the Native AOT file (src/coreclr/nativeaot/Runtime/yieldprocessornormalized.cpp) now share the same implementation.

dotnet-policy-service · 2024-06-18T23:30:29Z

Tagging subscribers to this area: @mangod9
See info in area-owners.md if you want to be subscribed.

src/coreclr/inc/yieldprocessornormalized.h

src/coreclr/vm/finalizerthread.h

src/coreclr/inc/yieldprocessornormalized.h

jkotas · 2024-06-22T01:30:36Z

src/coreclr/nativeaot/Runtime/FinalizerHelpers.cpp

@@ -46,9 +46,6 @@ uint32_t WINAPI FinalizerStart(void* pContext)

    g_pFinalizerThread = PTR_Thread(pThread);

-    // We have some time until the first finalization request - use the time to calibrate normalized waits.
-    EnsureYieldProcessorNormalizedInitialized();


How is the measurement going to be triggered when this is deleted?

I'm still trying to figure this out, I'm not very familiar with Native AOT in general so I'd appreciate any suggestions

It looks like we would need to call YieldProcessorNormalization::PerformMeasurement() from here or add a EnsureYieldProcessorNormalizedInitialized() entry point to the new code that simply calls YieldProcessorNormalization::PerformMeasurement()

Do you happen to know if this function is called every ~4 seconds or faster than that? Currently we let YieldProcessorNormalization::PerformMeasurement() run every ~4 s so if that's the case, I believe we may add here the same call as in CoreCLR

if (YieldProcessorNormalization::IsMeasurementScheduled()) { GCX_PREEMP(); YieldProcessorNormalization::PerformMeasurement(); }

FinalizerStart function is called once per process. It is equivalent of FinalizerThreadStart function in regular CoreCLR.

I think you want to follow the same structure as in regular CoreCLR: Trigger the measurement from ScheduleMeasurementIfNecessary by calling RhEnableFinalization (it is equivalent of FinalizerThread::EnableFinalization in regular CoreCLR) and then add the measurement to loop in ProcessFinalizers().

Do you happen to know if this function is called every ~4 seconds or faster than that?

I am not sure. The whole deal with measuring duration of something that is proportional to CPU cycle is not very precise, since the CPU cycle can change drastically and many times per second and will be different for every core. Unless machine is configured into HighPerformance power plan, every measurement is a bit of a coin toss and will produce the same result with the same error margins.

The main purpose of calibration is to continue using historically hard-coded spin counts in numerous places where we spinwait while allowing that to work on systems with vastly different pause durations (i.e. on post-skylake intel CPUs pause takes ~140 cycles, pre-skylake is about ~10 cycles). For such purpose the callibration is precise enough.

I am not sure about the value of redoing the measurement over and over.
Perhaps to support scenarios where a VM is migrated between pre/post skylake machines.

I guess we can add a periodic call PerformMeasurement in NativeAOT and see what happens.

My guess - nothing will change, just a bit more time spent in PerformMeasurement.

There is value in having the same behavior though.
If the re-measuring (or the whole calibration deal) could be somehow avoided or improved, it would make sense to do for both runtimes.

IIRC there's a good reason to keep re-doing measurements, so probably keeping this behaviour in Native AOT would be better, I believe @kouvel or @mangod9 may elaborate better

The measurements done are very short and can be perturbed by CPU activity, the rolling min helps to stabilize it over time.

src/coreclr/nativeaot/Runtime/yieldprocessornormalized.cpp

src/coreclr/vm/yieldprocessornormalizedshared.cpp

src/coreclr/inc/yieldprocessornormalized.h

src/coreclr/nativeaot/Runtime/FinalizerHelpers.cpp

src/coreclr/nativeaot/Runtime/windows/PalRedhawkInline.h

src/coreclr/vm/yieldprocessornormalizedshared.cpp

src/coreclr/vm/synch.h

src/coreclr/nativeaot/Runtime/FinalizerHelpers.cpp

src/coreclr/vm/yieldprocessornormalizedshared.cpp

src/coreclr/utilcode/yieldprocessornormalized.cpp

src/coreclr/nativeaot/Runtime/MiscHelpers.cpp

src/coreclr/nativeaot/Runtime/startup.cpp

src/coreclr/nativeaot/Runtime/windows/PalRedhawkInline.h

kouvel

LGTM, thanks!

jkotas · 2024-07-04T02:38:00Z

What kind of testing you have done on the change to validate that it works as expected? Do we expect improvements in any perf benchmarks?

kouvel · 2024-07-04T02:58:50Z

I don't think there would be any changes to benchmarks. I would expect that the CPU time spent during startup in the measurements would be a lot less (the new scheme measures lazily, and in narrower windows), that's about it. It would be good to measure that.

eduardo-vp · 2024-07-17T22:25:02Z

I tested the following snippet and I checked that the 8 initial measurements were done and subsequent measurements every ~4 seconds were done as well.

using System;
using System.Threading;

int minutesToSpin = 10;
int startTicks = Environment.TickCount;
while (Environment.TickCount - startTicks < minutesToSpin * 60 * 1000)
{
    Thread.SpinWait(1000);
    Thread.Sleep(2000);
}

…03675)" This reverts commit d35f302.

Initial commit

392c652

dotnet-issue-labeler bot added the needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners label Jun 18, 2024

dotnet-policy-service bot assigned eduardo-vp Jun 18, 2024

eduardo-vp added area-System.Threading and removed needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners labels Jun 18, 2024

This was referenced Jun 19, 2024

The Operation will be canceled. The next steps may not contain expected logs. dotnet/dnceng#3008

Open

GC/Regressions/v2.0-beta2/452950 failed in CI #103494

Closed

LibraryTests (mostly) timing out #103674

Closed

Eduardo Manuel Velarde Polar added 3 commits June 19, 2024 14:08

Use PalGetTickCount64

9237b9f

Add limits.h

830d8d0

Declare g_pFinalizerThread for Windows only

d0a884c

jkotas reviewed Jun 19, 2024

View reviewed changes

src/coreclr/inc/yieldprocessornormalized.h Outdated Show resolved Hide resolved

jkotas reviewed Jun 19, 2024

View reviewed changes

src/coreclr/vm/finalizerthread.h Outdated Show resolved Hide resolved

jkotas reviewed Jun 19, 2024

View reviewed changes

src/coreclr/vm/finalizerthread.h Outdated Show resolved Hide resolved

Eduardo Manuel Velarde Polar added 4 commits June 19, 2024 16:19

PR comments

165bbb9

Fix build/x86

b089bac

Remove finalizer thread from native aot

f4ed8e8

Remove unnecessary code

b646568

build-analysis bot mentioned this pull request Jun 22, 2024

System.Numerics.Tensors.Tests.TensorSpanTests test failure #103525

Closed

jkotas reviewed Jun 22, 2024

View reviewed changes

jkotas requested review from VSadov and kouvel June 22, 2024 01:34

Eduardo Manuel Velarde Polar added 2 commits June 21, 2024 19:23

PR comments + Fix InterlockedExchange

4127158

Add TODOs

a862782

jkotas reviewed Jun 24, 2024

View reviewed changes

src/coreclr/vm/yieldprocessornormalizedshared.cpp Outdated Show resolved Hide resolved

jkotas reviewed Jun 24, 2024

View reviewed changes

src/coreclr/vm/yieldprocessornormalizedshared.cpp Outdated Show resolved Hide resolved

jkotas reviewed Jun 24, 2024

View reviewed changes

src/coreclr/inc/yieldprocessornormalized.h Outdated Show resolved Hide resolved

Eduardo Manuel Velarde Polar added 2 commits June 24, 2024 22:49

Use max/min and RhEnableFinalization

73d3d71

Remove TODO

0d226da

jkotas reviewed Jun 27, 2024

View reviewed changes

src/coreclr/nativeaot/Runtime/FinalizerHelpers.cpp Show resolved Hide resolved

Move PerformMeasurement

6519b0b

build-analysis bot mentioned this pull request Jun 28, 2024

The job running on agent NetCore-Public ran longer than the maximum time #104044

Closed

jkotas reviewed Jun 29, 2024

View reviewed changes

src/coreclr/nativeaot/Runtime/windows/PalRedhawkInline.h Outdated Show resolved Hide resolved

jkotas reviewed Jun 29, 2024

View reviewed changes

src/coreclr/nativeaot/Runtime/windows/PalRedhawkInline.h Outdated Show resolved Hide resolved

eduardo-vp force-pushed the port-yield-norm-to-aot branch from af5ceaa to 6519b0b Compare July 2, 2024 18:48

kouvel reviewed Jul 2, 2024

View reviewed changes

jkotas reviewed Jul 2, 2024

View reviewed changes

src/coreclr/nativeaot/Runtime/windows/PalRedhawkInline.h Outdated Show resolved Hide resolved

This was referenced Jul 2, 2024

Build failure: Static graph-based restore failed with exit code .* but did not log an error. #103526

Open

Build failure: Static graph-based restore failed with exit code .* but did not log an error. dotnet/dnceng#3139

Closed

Eduardo Manuel Velarde Polar added 2 commits July 2, 2024 14:40

Fix PalInterlockedExchange64

9606eb9

PR comments

234d61b

eduardo-vp marked this pull request as ready for review July 2, 2024 23:09

eduardo-vp requested a review from MichalStrehovsky as a code owner July 2, 2024 23:09

Eduardo Manuel Velarde Polar added 2 commits July 2, 2024 17:04

Fix build

51c4573

Fix PalInterlocked

e8e1290

eduardo-vp requested review from jkotas, kouvel and VSadov July 3, 2024 21:30

kouvel approved these changes Jul 3, 2024

View reviewed changes

eduardo-vp merged commit d35f302 into dotnet:main Jul 17, 2024
87 of 89 checks passed

MichalStrehovsky added a commit to MichalStrehovsky/runtime that referenced this pull request Jul 26, 2024

Revert "Port yield normalization from CoreCLR to Native AOT (dotnet#1…

45de37b

…03675)" This reverts commit d35f302.

MichalStrehovsky added a commit to MichalStrehovsky/runtime that referenced this pull request Jul 26, 2024

Revert "Port yield normalization from CoreCLR to Native AOT (dotnet#1…

5208698

…03675)" This reverts commit d35f302.

MichalStrehovsky mentioned this pull request Jul 26, 2024

EventListenerThreadPool test failing in nativeaot outerloop runs #105556

Closed

eduardo-vp pushed a commit to eduardo-vp/runtime that referenced this pull request Aug 13, 2024

Revert "Port yield normalization from CoreCLR to Native AOT (dotnet#1…

5bc15f8

…03675)" This reverts commit d35f302.

eduardo-vp pushed a commit to eduardo-vp/runtime that referenced this pull request Aug 14, 2024

Revert "Port yield normalization from CoreCLR to Native AOT (dotnet#1…

9f46870

…03675)" This reverts commit d35f302.

github-actions bot locked and limited conversation to collaborators Aug 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Port yield normalization from CoreCLR to Native AOT #103675

Port yield normalization from CoreCLR to Native AOT #103675

eduardo-vp commented Jun 18, 2024 •

edited

Loading

dotnet-policy-service bot commented Jun 18, 2024

jkotas Jun 22, 2024

eduardo-vp Jun 22, 2024

VSadov Jun 22, 2024

eduardo-vp Jun 24, 2024

jkotas Jun 24, 2024

VSadov Jun 24, 2024 •

edited

Loading

VSadov Jun 24, 2024

VSadov Jun 24, 2024 •

edited

Loading

eduardo-vp Jun 24, 2024

kouvel Jul 2, 2024

kouvel left a comment

jkotas commented Jul 4, 2024

kouvel commented Jul 4, 2024 •

edited

Loading

eduardo-vp commented Jul 17, 2024

Port yield normalization from CoreCLR to Native AOT #103675

Port yield normalization from CoreCLR to Native AOT #103675

Conversation

eduardo-vp commented Jun 18, 2024 • edited Loading

dotnet-policy-service bot commented Jun 18, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

VSadov Jun 24, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

VSadov Jun 24, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kouvel left a comment

Choose a reason for hiding this comment

jkotas commented Jul 4, 2024

kouvel commented Jul 4, 2024 • edited Loading

eduardo-vp commented Jul 17, 2024

eduardo-vp commented Jun 18, 2024 •

edited

Loading

VSadov Jun 24, 2024 •

edited

Loading

VSadov Jun 24, 2024 •

edited

Loading

kouvel commented Jul 4, 2024 •

edited

Loading