Add balanced strategies for device_map in from_pretrained #18349

sgugger · 2022-07-28T19:33:27Z

What does this PR do?

This PR brings to Transformers the functionality introduced in huggingface/accelerate#534 .
Basically device_map can now take several options:

"sequential" which corresponds to the current auto: fill each GPU sequentially (and if the user has lots of GPU spaces, some are not used at all)
"balanced" which will split the model evenly across GPUs
"balanced_low_0" which will split the model evenly across GPUs while leaving the most available memory on GPU 0, since that GPU might have more tensors on it when the outputs are used for some form of post-processing (generate and use_cache for instance)
"auto" which now defaults to "balanced".

When the user does not have enough GPU memory to accommodate the model, all the options are equivalent.

HuggingFaceDocBuilderDev · 2022-07-28T19:49:39Z

The documentation is not available anymore as the PR was closed or merged.

LysandreJik

Very clean implementation! Thanks for porting the work to transformers.

LysandreJik · 2022-08-01T09:14:32Z

src/transformers/modeling_utils.py

+    if version.parse(accelerate_version) > version.parse("0.11.0"):
+        from accelerate.utils import get_balanced_memory
+    else:
+        get_balanced_memory = None


src/transformers/modeling_utils.py

Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

…e#18349) * Add balanced strategies for device_map in from_pretrained * Add safeguards for Accelerate version * Update src/transformers/modeling_utils.py Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr> * Style Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

Add balanced strategies for device_map in from_pretrained

49948cf

sgugger requested a review from LysandreJik July 28, 2022 19:33

Add safeguards for Accelerate version

9927c92

LysandreJik approved these changes Aug 1, 2022

View reviewed changes

sgugger and others added 2 commits August 1, 2022 07:59

Update src/transformers/modeling_utils.py

774843b

Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>

Style

c27316d

sgugger merged commit e0bc4c7 into main Aug 1, 2022

sgugger deleted the balanced_device_map branch August 1, 2022 14:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add balanced strategies for device_map in from_pretrained #18349

Add balanced strategies for device_map in from_pretrained #18349

sgugger commented Jul 28, 2022

HuggingFaceDocBuilderDev commented Jul 28, 2022 •

edited

Loading

LysandreJik left a comment

LysandreJik Aug 1, 2022

Add balanced strategies for device_map in from_pretrained #18349

Add balanced strategies for device_map in from_pretrained #18349

Conversation

sgugger commented Jul 28, 2022

What does this PR do?

HuggingFaceDocBuilderDev commented Jul 28, 2022 • edited Loading

LysandreJik left a comment

Choose a reason for hiding this comment

LysandreJik Aug 1, 2022

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Jul 28, 2022 •

edited

Loading