-
-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Make Distributed.Worker threadsafe [Take 2] #38134
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah yes, the dreaded timedwait
function. Yes we should always try to avoid that in preference for more responsible implementations.
The I'm on vacation this week and don't have a laptop at hand. |
I was looking at https://build.julialang.org/#/builders/52/builds/4632/steps/5/logs/stdio
Which definitely looks like a time out. |
Still might be failing? https://build.julialang.org/#/builders/4/builds/5236 |
yeah...., much rarer so this might be a case of the actual code change not being sufficient. |
Maybe we should replace |
I don't know if it is much rarer or a different issue, but almost 25% of builds are crashing or segfaulting or failing in this test (ie. https://build.julialang.org/#/builders/52/builds/4884) |
That looks like a peculiar aarch64 issue. |
Co-authored-by: Jonas Schulze <jonas.schulze@st.ovgu.de>
Co-authored-by: Jonas Schulze <jonas.schulze@st.ovgu.de>
Co-authored-by: Jonas Schulze <jonas.schulze@st.ovgu.de>
Co-authored-by: Jonas Schulze <jonas.schulze@st.ovgu.de>
Co-authored-by: Jonas Schulze <jonas.schulze@st.ovgu.de>
Co-authored-by: Jonas Schulze <jonas.schulze@st.ovgu.de>
Co-authored-by: Jonas Schulze <jonas.schulze@st.ovgu.de>
Co-authored-by: Jonas Schulze <jonas.schulze@st.ovgu.de>
Co-authored-by: Jonas Schulze <jonas.schulze@st.ovgu.de>
cc: @jonas-schulze
If my understanding is correct, the issue was that on the slower CI machines the
timedwait
could feasibly timeout and cause the subsequent test to fail. This replaces thetimedwait
with actual waits.Should be squashed on merge.
Fixes JuliaLang/Distributed.jl#73