Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

System.Diagnostics.Tests.DiagnosticSourceTest.AllSubscriberStress blocking test runs on ARM64 #28772

Closed
wfurt opened this issue Feb 23, 2019 · 11 comments · Fixed by #56500
Closed
Assignees
Labels
arch-arm64 area-System.Diagnostics.Tracing disabled-test The test is disabled in source code against the issue os-linux Linux OS (any supported distro)
Milestone

Comments

@wfurt
Copy link
Member

wfurt commented Feb 23, 2019

We get catastrophic failure in CI.

~/dotnetbuild/work/6271876f-6ed7-4469-828e-f8c62abc33df/Work/4b9b88e6-1a8f-468f-8386-88f45204d46c/Exec ~/dotnetbuild/work/6271876f-6ed7-4469-828e-f8c62abc33df/Work/4b9b88e6-1a8f-468f-8386-88f45204d46c/Exec
�[37m  Discovering: System.Diagnostics.DiagnosticSource.Tests (method display = ClassAndMethod, method display options = None)
�[m�[37m  Discovered:  System.Diagnostics.DiagnosticSource.Tests (found 50 test cases)
�[m�[37m  Starting:    System.Diagnostics.DiagnosticSource.Tests (parallel test collections = on, max threads = 2)
�[m�[33;1m   System.Diagnostics.DiagnosticSource.Tests: [Long Running Test] 'System.Diagnostics.Tests.DiagnosticSourceTest.AllSubscriberStress', Elapsed: 00:02:36
�[m�[33;1m   System.Diagnostics.DiagnosticSource.Tests: [Long Running Test] 'System.Diagnostics.Tests.DiagnosticSourceTest.AllSubscriberStress', Elapsed: 00:04:38
�[m�[33;1m   System.Diagnostics.DiagnosticSource.Tests: [Long Running Test] 'System.Diagnostics.Tests.DiagnosticSourceTest.AllSubscriberStress', Elapsed: 00:06:38
�[m�[33;1m   System.Diagnostics.DiagnosticSource.Tests: [Long Running Test] 'System.Diagnostics.Tests.DiagnosticSourceTest.AllSubscriberStress', Elapsed: 00:08:38
�[mKilled
@stephentoub
Copy link
Member

Most likely hitting the 10 minute timeout.

@wfurt
Copy link
Member Author

wfurt commented Feb 23, 2019

I'll try to run it without limit and see how long does it take.
For now, I was primarily trying to add tracking for tests failing daily in CI.

@wfurt
Copy link
Member Author

wfurt commented Feb 25, 2019

I did run on container similar to CI and it took 34 minutes for this test set alone to finish. (outerloop)

=== TEST EXECUTION SUMMARY ===
   System.Diagnostics.DiagnosticSource.Tests  Total: 54, Errors: 0, Failed: 0, Skipped: 0, Time: 2050.368s

cc: @safern @danmosemsft

I'm not sure what is best strategy here: either disabling biggest offenders (or moving them to stress category) or increasing timeout quite a bit. Second option will of course have impact on PR as it will take much more time to get feedback.

@safern
Copy link
Member

safern commented Feb 25, 2019

Note that the engineering team just let me know last week that Ubuntu ARM tests are now going to run within a docker container limited to 2 cores only just as official builds do. I guess we should also, ask them to provide more cores, since the servers have 48 cores, specially as this cores are slower than intel ones -- at least get 4 cores for each container would be nice.

@stephentoub
Copy link
Member

Note that the engineering team just let me know last week that Ubuntu ARM tests are now going to run within a docker container limited to 2 cores only just as official builds do.

Do you know when this change is taking effect?

@wfurt wfurt self-assigned this Feb 26, 2019
@safern
Copy link
Member

safern commented Feb 26, 2019

Do you know when this change is taking effect?

Just asked and it happened some time ago today. I've been looking at some random PR failures due to this. Working with the engineering team on getting them fixed.

cc: @ulisesh

@wfurt
Copy link
Member Author

wfurt commented Feb 26, 2019

Did we end-up using 4 cores or did we get more? I can re-run tests to get new timing

@safern
Copy link
Member

safern commented Feb 26, 2019

Did we end-up using 4 cores or did we get more? I can re-run tests to get new timing

Not yet. I just spoke with @Chrisboh and he said that they can probably do that, but first they want to get ARM stuff stable and afterwards do this kind of changes.

@wfurt wfurt removed their assignment Feb 27, 2019
@msftgits msftgits transferred this issue from dotnet/corefx Feb 1, 2020
@msftgits msftgits added this to the 5.0 milestone Feb 1, 2020
@maryamariyan maryamariyan added the untriaged New issue has not been triaged by the area owner label Feb 23, 2020
@tommcdon tommcdon added p1 and removed untriaged New issue has not been triaged by the area owner labels Mar 7, 2020
@tommcdon tommcdon removed the p1 label Jul 30, 2020
@tommcdon tommcdon modified the milestones: 5.0.0, 6.0.0 Jul 30, 2020
@josalem
Copy link
Contributor

josalem commented Jul 27, 2021

Is this still an issue? The last comments on this are from over 2 years ago at this point.

@hoyosjs
Copy link
Member

hoyosjs commented Jul 27, 2021

Test is still disabled in outerloop

@ghost ghost added the in-pr There is an active PR which will close this issue when it is merged label Jul 28, 2021
@noahfalk
Copy link
Member

Trial by fire, I re-enabled it to see what happens.

@noahfalk noahfalk self-assigned this Jul 28, 2021
@ghost ghost removed the in-pr There is an active PR which will close this issue when it is merged label Jul 29, 2021
@ghost ghost locked as resolved and limited conversation to collaborators Aug 28, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
arch-arm64 area-System.Diagnostics.Tracing disabled-test The test is disabled in source code against the issue os-linux Linux OS (any supported distro)
Projects
None yet
Development

Successfully merging a pull request may close this issue.

9 participants