Ensuring a group of tasks are scheduled together. #4485

trivialfis · 2021-02-05T14:16:52Z

Hi all,

This is a feature request for applications that depend on MPI like communication framework like XGBoost. I will use XGBoost as an example in the following description.

Motivation

During training, XGBoost runs 1 process per-worker and needs to perform allreduce between all processes. When training 1 model at a time that's fine, we just submit a task for each worker:

def train(...):
    futures = []
    # workers are obtained from input data
    for i, worker in enumerate(workers):
        f = client.submit(train_on_worker, ..., workers=[worker])
        futures.append(f)

    results = client.gather(futures)
    return results

# User side:
output = train(...)

This works as all train_on_worker will get a chance to run eventually. But if users launch multiple training sessions simultaneously, XGBoost might hang. For example, given there are 2 available workers, and each call to train use all of them.

# user side:
output_0 = client.submit(train, Xy_0)
output_1 = client.submit(train, Xy_1)
# Notice that the sub-task launched in `train` from both `train` might interleave due to parallelization.

distributed might schedule 1 worker for each call to train. Due to the MPI allreduce call, the scheduled worker will be waiting for unscheduled worker to synchronize. And as both calls to train will be waiting (remember that we only have 2 workers), other tasks might never be scheduled, resulting into a hang.

Feature request

Notice inside the train function where we submit tasks for each worker, is there a way to ensure tasks submitted there can be scheduled as a whole?

The text was updated successfully, but these errors were encountered:

madsbk · 2021-02-05T14:51:15Z

Maybe explict-comms in dask-cuda might be useful here. Using CommsContext.run() you can schedule a task on each worker that are guaranteed to run in parallel.

quasiben · 2021-02-05T15:05:12Z

Relaxing the worker pinning on the submit might help here. I believe this can be done with allow_other_workers: https://distributed.dask.org/en/latest/locality.html#user-control .

trivialfis · 2021-02-05T16:03:34Z

Thanks for the reply @madsbk @quasiben I'm looking into the CommsContext class, seems quite complicated so might take some time to see if I can modify it enough to fit the use case. Not sure if allow_other_workers is the right way to continue, relaxing worker pinning will make data copies. Also in the example, there's no other available worker even if I don't pin any worker. Let me look into it a bit more.

madsbk · 2021-02-10T16:17:35Z

@trivialfis I am working on a general solution to this problem: #4503

jrbourbeau · 2021-02-10T16:48:14Z

I'm probably just missing something here, but @madsbk why is a MultiLock lock needed instead of our existing Lock which could be acquired inside train to coordinate when we submit new tasks in train?

madsbk · 2021-02-11T14:36:34Z

I'm probably just missing something here, but @madsbk why is a MultiLock lock needed instead of our existing Lock which could be acquired inside train to coordinate when we submit new tasks in train?

The thing is, we want to wait until we can get exclusive access to all workers that have input data. In the following example train() only submits jobs when all workers are ready. Notice, the set of workers will be different between calls to train() and we want multiple calls on non-overlapping sets of workers to run in parallel.

def train(...):
    futures = []
    # workers are obtained from input data
    with MultiLock(lock_names=workers):
        for i, worker in enumerate(workers):
            f = client.submit(train_on_worker, ..., workers=[worker])
            futures.append(f)
        results = client.gather(futures)
        return results

In principle this could be implemented using the Lock and Variable extension but it will very inefficient. MultiLock is a generalization of Lock -- the semantic of Lock and MultiLock is the same when len(lock_names) == 1.

madsbk · 2021-02-11T15:25:59Z

Check out rapidsai/dask-cuda#523 for an use case

trivialfis · 2021-02-11T15:57:45Z

Thanks @madsbk So if I'm not mistaken, this is similar to creating a single lock with all used workers as the name?

madsbk · 2021-02-12T12:03:43Z

Thanks @madsbk So if I'm not mistaken, this is similar to creating a single lock with all used workers as the name?

Not exactly because different worker sets that overlap will also block each other.
It is the same as doing Lock(w) for w in workers with the condition that no lock is acquired before all locks can be acquired.

This was referenced Feb 5, 2021

hang in dask xgboost on CPU dmlc/xgboost#6604

Closed

Dask XGBoost hangs during training with multiple GPU workers dmlc/xgboost#6649

Closed

madsbk mentioned this issue Feb 10, 2021

Multi-lock extension #4503

Merged

5 tasks

jrbourbeau closed this as completed in #4503 Mar 15, 2021

trivialfis mentioned this issue Apr 19, 2024

Support collective style tasks #8624

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensuring a group of tasks are scheduled together. #4485

Ensuring a group of tasks are scheduled together. #4485

trivialfis commented Feb 5, 2021 •

edited

Loading

madsbk commented Feb 5, 2021

quasiben commented Feb 5, 2021

trivialfis commented Feb 5, 2021 •

edited

Loading

madsbk commented Feb 10, 2021 •

edited

Loading

jrbourbeau commented Feb 10, 2021

madsbk commented Feb 11, 2021 •

edited

Loading

madsbk commented Feb 11, 2021

trivialfis commented Feb 11, 2021

madsbk commented Feb 12, 2021 •

edited

Loading

Ensuring a group of tasks are scheduled together. #4485

Ensuring a group of tasks are scheduled together. #4485

Comments

trivialfis commented Feb 5, 2021 • edited Loading

Motivation

Feature request

madsbk commented Feb 5, 2021

quasiben commented Feb 5, 2021

trivialfis commented Feb 5, 2021 • edited Loading

madsbk commented Feb 10, 2021 • edited Loading

jrbourbeau commented Feb 10, 2021

madsbk commented Feb 11, 2021 • edited Loading

madsbk commented Feb 11, 2021

trivialfis commented Feb 11, 2021

madsbk commented Feb 12, 2021 • edited Loading

trivialfis commented Feb 5, 2021 •

edited

Loading

trivialfis commented Feb 5, 2021 •

edited

Loading

madsbk commented Feb 10, 2021 •

edited

Loading

madsbk commented Feb 11, 2021 •

edited

Loading

madsbk commented Feb 12, 2021 •

edited

Loading