dask scheduler num_workers #2403

bjlittle · 2017-02-22T15:58:51Z

See here for context.

We need to consider how we easily control the number of threads, processes or workers that the dask scheduler uses - particularly for users targeting a shared resource such as a server or cluster.

Ping @marqh

AlexHilson · 2017-02-23T11:37:17Z

Fwiw if a Distributed scheduler is defined then that will be used by default, so the current implementation will work with either the threaded / distributed scheduler depending on what the user has done.

Defining a way to pass arguments into the scheduler is definitely useful. I feel like I would want the default settings to use 100% of available resources and let dask / my cpu worry about the consequences, but maybe that's naive.

AlexHilson · 2017-02-23T16:52:15Z

I briefly discussed this with @marqh, and he mentioned some of the multi-user systems where Iris is deployed and used.

I agree that we should test how dask behaves on these systems before going 'live', but I still believe that if you're running a multi-user system it's your responsibility to ensure that your users cpu access is managed. Making our code slower by default seems wrong to me, but I appreciate that we may need to be pragmatic...

DPeterK · 2017-03-24T09:39:20Z

My feeling about this is that Iris should run single thread/process by default, with users needing to opt in if they wish to use the multiprocess goodness that dask offers. This needn't be the default long-term, but I think it is the most appropriate introductory approach.

I have a couple of reasons to back up this assertion:

many Iris users may not be used to parallel processing and may be put out if an innocent processing command given to Iris uses up all the available compute resource and physical memory on their machine, causing it to crash.
Python multiprocessing seems to disregard the amount of resource offered by slurm, so if we open up unrestricted multiprocess capability (both in terms of being on by default in Iris and in terms of not heeding slurm limits) then we may enter a realm where Iris is regularly taking out scalable compute resource through numerous, concurrent, resource unrestricted uses of Iris. This would look bad.

Of course once the two concerns above are resolved then we can reconsider the default multiprocess behaviour in Iris.

DPeterK · 2017-03-24T10:50:58Z

In #2457 @bjlittle suggested the pattern of having iris.options. I wonder if we could make use of that here, both for setting the number of workers and setting the scheduler. Consider:

with iris.options.parallel(num_workers=6, scheduler='multiprocessing'):
    iris.load('my_dataset.nc')

Or

iris.options.parallel(num_workers=6, scheduler='multiprocessing')
iris.load('my_dataset.nc')

We can get allowed values for scheduler from the options available in dask. We could even set this to point to the IP address and port of a running distributed scheduler and use that in all following user code:

iris.options.parallel(scheduler='192.168.0.219:8786')

In _lazy_data we could interface with these options as follows:

def as_concrete_data(array):
    if is_lazy_data(array):
        num_workers = iris.options.parallel.get('num_workers')
        scheduler = iris.options.parallel.get('scheduler')
        result = array.compute(num_workers=num_workers, get=scheduler.get)
        ...

Alternatively, we could use dask.set_options to apply these options and call this in iris.options to globally set the state of dask for the lifetime of this session.

Thoughts please people!

marqh · 2017-04-07T08:36:43Z

My feeling about this is that Iris should run single thread/process by default, with users needing to opt in if they wish to use the multiprocess goodness that dask offers. This needn't be the default long-term, but I think it is the most appropriate introductory approach.

I think this represents a reasonable approach

I would advocate a '1' being explicitly set somewhere and I would like to make time to investigate the potential implications of selecting a different, suitably small number, such as 'Three' (it's a magic number ;)

as this might provide some neat benefit with limited risk

Alternatively, we could use dask.set_options to apply these options and call this in iris.options to globally set the state of dask for the lifetime of this session.

I don't know how clear this would be

As part of the documentation for the release, I think we must provide a page on dask. As part of this, we should explore scenarios where I want to reconfigure dask in a certain way. A couple come to mind:

I want to have dask use all available resources on this host
I want to have dask use most of the available resources on this host, but it is my laptop, so leave enough to keep my browser session and chat windows working smoothly

So, all in favour, keen to contribute to the thought process and implementation

DPeterK · 2017-04-10T08:40:41Z

such as 'Three' (it's a magic number ;)

Unbelievable 🎵

Alternatively, we could use dask.set_options

I don't know how clear this would be

The methodology preferred by dask to set runtime options is dask.set_options, so I think using it is worthwhile. It is the approach I ended up following in #2462, which does hide calling dask.set_options behind an Iris API, but I'm not convinced that's a bad thing, especially if we document what Iris does and how users can also make use of this. I also like the idea of the documented parallel-processing examples 👍

bjlittle added the dask label Feb 22, 2017

bjlittle added this to the dask milestone Feb 22, 2017

DPeterK mentioned this issue Mar 28, 2017

Iris options (dask parallel processing) #2462

Closed

DPeterK mentioned this issue Apr 11, 2017

Ensure docs still build (2 man days) #2460

Closed

bjlittle mentioned this issue Apr 20, 2017

**User documentation page on dask options (8 man days)** #2494

Closed

This was referenced Apr 24, 2017

Safe dask (2 man days' effort) #2499

Closed

Dask options for Iris processing #2511

Merged

DPeterK closed this as completed May 8, 2017

QuLogic modified the milestones: dask, v2.0 Aug 2, 2017

pp-mo mentioned this issue Oct 27, 2017

Use default Dask scheduler settings #2879

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dask scheduler num_workers #2403

dask scheduler num_workers #2403

bjlittle commented Feb 22, 2017

AlexHilson commented Feb 23, 2017

AlexHilson commented Feb 23, 2017

DPeterK commented Mar 24, 2017

DPeterK commented Mar 24, 2017

marqh commented Apr 7, 2017

DPeterK commented Apr 10, 2017

dask scheduler num_workers #2403

dask scheduler num_workers #2403

Comments

bjlittle commented Feb 22, 2017

AlexHilson commented Feb 23, 2017

AlexHilson commented Feb 23, 2017

DPeterK commented Mar 24, 2017

DPeterK commented Mar 24, 2017

marqh commented Apr 7, 2017

DPeterK commented Apr 10, 2017