train in ipynb with accelerator="auto" failed #12774

Borda · 2022-04-15T15:37:13Z

🐛 Bug

Is expected to run auto mode of PL Trainer in ipython notebook on a machine that has two GPUs?

To Reproduce

Run a Trainer with

accelerator="auto"
devices="auto"

In a Jupyter notebook cell.

MisconfigurationException: `Trainer(strategy='ddp_spawn')` or `Trainer(accelerator='ddp_spawn')` is not compatible with an interactive environment. Run your code as a script, or choose one of the compatible strategies: Trainer(strategy=None|dp|tpu_spawn). In case you are spawning processes yourself, make sure to include the Trainer creation inside the worker function.

Expected behavior

just take the best supported and run, no crashing

Environment

PyTorch Lightning Version (e.g., 1.5.0):
PyTorch Version (e.g., 1.10):
Python version (e.g., 3.9):
OS (e.g., Linux):
CUDA/cuDNN version:
GPU models and configuration:
How you installed PyTorch (conda, pip, source):
If compiling from source, the output of torch.__config__.show():
Any other relevant information:

Additional context

cc @justusschock @kaushikb11 @awaelchli @ninginthecloud @akihironitta @rohitgr7

The text was updated successfully, but these errors were encountered:

awaelchli · 2022-04-15T16:53:02Z

For accelerator="auto", devices="auto" we didn't consider that in Jupyter notebooks, we need a different default for the strategy. So in this sense, strategy should also be selected automatically. For multi-gpu in notebooks, only strategy="dp" works here. So if we want to support that, we would need to add a condition to the AcceleratorConnector.

nicocheh · 2022-04-29T13:33:19Z

Hi @awaelchli , i used DDPSpawnStrategy(find_unused_parameters=False) with accelerator="gpu" and some weeks ago it worked in a jupyter notebook using multiple GPUs. Did something changed that is not working any more?

awaelchli · 2022-07-27T09:54:58Z

This is fixed now with #13405. accelerator="auto", devices="auto" will select the right strategy in the jupyter notebook.
@nicocheh For setting find_unused_parameters_false, we added the option strategy="ddp_notebook_find_unused_parameters_false".

Borda added the needs triage Waiting to be triaged by maintainers label Apr 15, 2022

Borda assigned awaelchli Apr 15, 2022

awaelchli added trainer: connector accelerator strategy bug Something isn't working and removed needs triage Waiting to be triaged by maintainers labels Apr 15, 2022

awaelchli added this to the 1.6.x milestone Apr 15, 2022

awaelchli closed this as completed Jul 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

train in ipynb with accelerator="auto" failed #12774

train in ipynb with accelerator="auto" failed #12774

Borda commented Apr 15, 2022 •

edited by awaelchli

Loading

awaelchli commented Apr 15, 2022

nicocheh commented Apr 29, 2022

awaelchli commented Jul 27, 2022

train in ipynb with accelerator="auto" failed #12774

train in ipynb with accelerator="auto" failed #12774

Comments

Borda commented Apr 15, 2022 • edited by awaelchli Loading

🐛 Bug

To Reproduce

Expected behavior

Environment

Additional context

awaelchli commented Apr 15, 2022

nicocheh commented Apr 29, 2022

awaelchli commented Jul 27, 2022

Borda commented Apr 15, 2022 •

edited by awaelchli

Loading