-
-
Notifications
You must be signed in to change notification settings - Fork 719
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
@gen_cluster
has become 1s slower; CI takes 50% longer
#6632
Comments
the issue is the distributed/distributed/utils_test.py Lines 992 to 994 in c82bba5
the workers are already closed so there should be no remaining comm_handlers left: distributed/distributed/core.py Lines 881 to 886 in c82bba5
however looking at the tasks in self._ongoing_background_tasks._ongoing_tasks :distributed/distributed/scheduler.py Lines 4279 to 4292 in c82bba5
there's a pair of tasks waiting to call remove_worker_from_events, these get a 1 second grace period to stop and are then cancelled distributed/distributed/scheduler.py Lines 4279 to 4294 in c82bba5
I think there should not be a grace period of 1s for background tasks in |
I'd be open to dropping the grace period and directly cancelling tasks. Are there any tasks for which we'd actually prefer graceful shutdown? In that case we could adjust the signature of |
If this causes big problems we can drop it. Some information about entering/leaving workers is also stored in the Is there a problem with just canceling this task? |
I agree that we don't necessarily want a grace period for when we're closing. I'm fine with cancelling stuff right away |
Since #6603, a test decorated with
@gen_cluster
:has increased in runtime on my host from 220ms to 1220ms.
With the same PR, the CI test suites (ci1) have increased:
Ubuntu: 19m23s -> 34m18s
Windows: 23m36s -> 48m55s
MacOSX: 32m15s -> 46m39s
CC @graingert
The text was updated successfully, but these errors were encountered: