-
Notifications
You must be signed in to change notification settings - Fork 3.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Some macos agents are slow. More than 2x slower #3885
Comments
Hey, @andreineculau |
I modified a bit my workflow, in order to separate more network from disk i/o
You can see though how numbers point towards disk i/o e.g. |
One last comment before I put this to rest: I wanted to print some system info, and hit a very big diff 1m26s vs 4m43s https://github.com/andreineculau/actions-ve-repro-2707/runs/3299223287?check_suite_focus=true vs https://github.com/andreineculau/actions-ve-repro-2707/runs/3299223358?check_suite_focus=true A command like |
Hey, @andreineculau |
I haven't noticed flimsy performance for a while but I'm wondering if a cause has been found and a permanent fix pushed so this issue could be closed. Thanks. |
@andreineculau some environments were fixed and some are still in progress, so the performance can be flaky. |
@miketimofeev thanks for the update! Just to be sure, there isn't anything one can do as part of the job configuration as a local fix, right? |
@andreineculau unfortunately, yes. It's all about underlying infrastructure at the moment. |
Performance is so slow that any CPU-bound task, like caching, is rendered totally useless. Unpacking a 120 MB file takes longer than actually running all the different |
Is it possible that the fact that the runners use the CSharp runtime makes it not perform so well in a Mac environment? |
On top of the observed fluctuations in performance (sometimes a Homebrew installation takes 10 seconds, sometimes the same installation takes 5 minutes), I also observed file I/O to be very slow recently and the Actions UI itself being unresponsive: The runtime counter just stops, no visible log output is displayed and it can take minutes until any output is shown - I do however experience the same issue with other runners and the UI on the whole: It gets "stuck" showing a single job as active even though it has finished and doesn't update the state of other jobs. |
We are on Azure having the same issues recently. The performance of the macos VMs never has been great, but recently (1-2 weeks) the performance dropped to a new low. We have runners that run into the 60 minute timeout, that don't update the UI anymore (even after refresh etc.), jobs not even getting a runner (timeout here too, nonetheless there are runners available). I/O performance is on a new low as well, we have build tasks for an app, that took around 10min now we see build times around 15-20min, if they don't just die in the middle of the task. Please MS/GH fix your macos setups, this is not usable anymore. |
Random ~100x regression on CPU heavy tasks. Here's an example, where this only happened for one Mac job (the others have normal performance): https://github.com/connorjclark/ZeldaClassic/actions/runs/3690258663/jobs/6247167952 Hope this information helps. |
@miketimofeev - is there any update on this? I use |
our macOS tests fails more than half the time while linux and windows pass. the errors are always due to timeout issues. while we can improve the performance of our tests (particularly test/messages.ts) sometimes the timeouts happen when testing against the local SMTPServer as of now, we can't get insight into whether our tests or passing or failing as the majority of test fails when macOS is included. github is away of the issue: actions/runner-images#3885
@miketimofeev Any updates on this? |
We are still experiencing performance issues with Xcode builds on GitHub runners. |
Any updates on this issue? Still a problem, and seems to get worse if you use xcode15 for testing and compiling. He have seen cache download tasks that takes more than doble the time it would take if compiling every dependencies. |
Hey all! We have performed a lot of work for images optimisation and got lots of positive reports regarding images speed, the most significant up is now can be seen in OS13 runners (both intel and m1), also OS12 got speeded up as well due to some hardware modifications, we hope most of the customers will find current situation suitable for their needs, gonna close this ticket now, but we are always glad to hear feedback. |
Separately, I should note that the performance of Xcode 15 on macOS-13 is monitored in another issue #7971 and is not related to this problem and hardware in general. |
Nothing seems to be fixed. For many things that need I/O performance, macOS Runner still has the worst performance. |
Description
Similar to #2707 , I am noticing slower MacOS agents. It's random, but often: 10-25% of the runs are slow.
A good run can "uninstall homebrew" (i.e. mostly disk i/o) in 184 seconds https://github.com/rokmoln/support-firecloud/runs/3295490478?check_suite_focus=true#step:3:190
A slow run can "uninstall homebrew" 2.5 times slower, in 483 seconds
https://github.com/rokmoln/support-firecloud/runs/3295490501?check_suite_focus=true#step:3:190
Similarly, reinstalling homebrew (disk, network and cpu bound) happens 1.8 times slower, in 593 seconds https://github.com/rokmoln/support-firecloud/runs/3295490501?check_suite_focus=true#step:3:447 instead of 333 seconds https://github.com/rokmoln/support-firecloud/runs/3295490478?check_suite_focus=true#step:3:456
Overall, my builds time out even with a 2x-than-normal timeout (normally 23minutes, timeout 45 minutes).
As requested by @miketimofeev , I have a repro workflow here https://github.com/andreineculau/actions-ve-repro-2707 , the same as @smorimoto used in #2707.
You can see a good run ending after 1m30s and running
brew install systembench
in 29s https://github.com/andreineculau/actions-ve-repro-2707/runs/3298362909?check_suite_focus=true as opposed to a slow run ending after 3m09s and runningbrew install systembench
in 2m13s https://github.com/andreineculau/actions-ve-repro-2707/runs/3298362887?check_suite_focus=true .Virtual environments affected
Image version and build link
20210801.1
Is it regression?
No response
Expected behavior
Consistent times. I guess a <25% deviation is expected, but not >100%.
Actual behavior
More than 2x slower run times.
Repro steps
https://github.com/andreineculau/actions-ve-repro-2707/blob/85c642afe6c8d19f3e30bfa51de7d2075d6a5414/.github/workflows/workflow.yml
The text was updated successfully, but these errors were encountered: