Allow limiting the number of concurrent processes #5

dhrrgn · 2021-05-01T19:11:29Z

Love this package. However, when testing it for speeding up some of our tooling, I noticed that with performance degraded quickly with a larger number of processes. This makes sense, since forking a process hundreds or thousands of times is an expensive operation.

The feature allows limiting that concurrency, so fewer concurrent processes are forked. This drastically reduces CPU usage, since it doesn't have to manage so many processes and sockets.

During my testing, running 1000 concurrent tasks (that simply returned foo), took ~20 seconds. Limiting it to 100 concurrent processes reduced that to 14 seconds.

Possible Improvements

As implemented, it waits until all tasks of a concurrent group are completed before it starts the next group. It may be better to kick off a group, and as each task completes, start the next task until all are complete. However, this complicates things, and I am not convinced it would be any faster. Though, that is a gut feeling, not backed by any tests.

Open to suggestions.

PuffyZA · 2021-05-03T06:51:24Z

You beat me to it. Was going to play around with adding process limits to the package this week, since I think it's a pretty important feature to have.

I would however definitely change it so that it always spins up the max processes as each one finishes. For instance, in a use case I want to test, we have billing code that bills each client, but each client can vary wildly in terms of time taken to generate the bill (some clients have 1 package, others have hundreds or thousands). Waiting for one large client to finish before running more, my gut feel is that will be much slower than starting up more while it's still running, though I'd have to do some benchmarks to confirm.

dhrrgn · 2021-05-03T06:58:43Z

Ya, I agree, and I have an idea for a simple way to implement that. I just haven't had time to work on it.

I should have time later today.

brendt · 2021-05-03T07:50:11Z

I would however definitely change it so that it always spins up the max processes as each one finishes.

That was my feedback as well on the discussion thread. If you can change your PR I'm happy to merge it.

brendt · 2021-05-04T03:46:49Z

@dhrrgn I've got some time right now, so I'm going to implement it myself so that we can quickly get it tagged :)

brendt · 2021-05-04T04:19:48Z

This is my PR, feel free to share your feedback: #10

dhrrgn · 2021-05-10T05:06:44Z

@brendt Awesome, thank you! Sorry for disappearing on this for a bit. Got slammed at work and haven't been feeling well.

dhrrgn force-pushed the limit-concurrency branch from 99a07ea to 8157cdd Compare May 1, 2021 19:22

Allow limiting the number of concurrent processes

ea13625

dhrrgn force-pushed the limit-concurrency branch from 8157cdd to ea13625 Compare May 1, 2021 19:31

Merge branch 'master' into limit-concurrency

f251388

brendt closed this May 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow limiting the number of concurrent processes #5

Allow limiting the number of concurrent processes #5

dhrrgn commented May 1, 2021

PuffyZA commented May 3, 2021

dhrrgn commented May 3, 2021

brendt commented May 3, 2021

brendt commented May 4, 2021

brendt commented May 4, 2021

dhrrgn commented May 10, 2021

Allow limiting the number of concurrent processes #5

Allow limiting the number of concurrent processes #5

Conversation

dhrrgn commented May 1, 2021

Possible Improvements

PuffyZA commented May 3, 2021

dhrrgn commented May 3, 2021

brendt commented May 3, 2021

brendt commented May 4, 2021

brendt commented May 4, 2021

dhrrgn commented May 10, 2021