-
Notifications
You must be signed in to change notification settings - Fork 4.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[QUIC] Move ByteMixingOrNativeAVE_MinimalFailingTest to OuterLoop #55595
[QUIC] Move ByteMixingOrNativeAVE_MinimalFailingTest to OuterLoop #55595
Conversation
Tagging subscribers to this area: @dotnet/ncl Issue DetailsDisable ByteMixingOrNativeAVE_MinimalFailingTest test by ActiveIssue #55588
|
Tagging subscribers to this area: @dotnet/ncl Issue DetailsDisable ByteMixingOrNativeAVE_MinimalFailingTest test by ActiveIssue #55588
|
can we just move just to outer loop? That may be good enough as it won't block others. |
Yep, I was not sure what will be better, so did both 😂 |
Let's try it. Outerloop would still probably catch crashes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
We shouldn't just move this to Outerloop, because it's just going to keep timing out there. This tests takes a long time to run by design. We should either reduce the time it takes (e.g. reduce iterations from 20 to ??), or increase the timeout. |
How much time you think we need @geoffkizer ? |
Am I reading the code correctly, that the current timeout is 16 minutes? (1000 seconds?) How is it possible that a test that runs in 10s on my dev box takes more than 16 minutes on a CI machine? |
I think that is overall timeout for all the test. Each test may fail faster. So as each operation within the test can fail sooner. |
10s is a timeout for 1 iteration as @wfurt said |
Shall we pass 50s as an iteration timeout then? @geoffkizer |
Or, merge it as it is to unblock CI and discuss timeouts or other ways to fix it later |
This is blocking everyone, we need to disable it or move to Outerloop ASAP. @CarnaViire can you please merge your PR? We can discuss proper solution later. Thanks! |
From my experience, if it gets stuck for 16 minutes on a CI machine and eventually killed by CI infra, it's a deadlock (in tests mostly, rarely in production code). The CI is slow enough, or has a different timing, so that different races happen and we end up with something like this. Some of these problem I've successfully debugged in the past by crashing the process on purpose after a certain amount of time and then getting a dump from which you can (in ~90% of times) figure out where it got stuck. |
It didn't stuck for 16 mins, it timeouted from one iteration timeout inside RunClientServer, which is 10s by default |
That being said, we still need to investigate why 10s was not enough, it is possible CI machine is just slow and this test is big, so we just need to increase the timeout for the iteration, but some deadlock is also possible |
Disable ByteMixingOrNativeAVE_MinimalFailingTest test by ActiveIssue #55588Move ByteMixingOrNativeAVE_MinimalFailingTest to outerloop. Fixes #55588