[`peft`] Fix DP issues #221

younesbelkada · 2023-03-15T12:09:42Z

What does this PR do?

This PR fixes issues related to DP with peft

Adds also instructions on how to properly run DP + peft

cc @lvwerra

HuggingFaceDocBuilderDev · 2023-03-15T12:12:46Z

The documentation is not available anymore as the PR was closed or merged.

lvwerra · 2023-03-15T12:39:43Z

Looks good as a temporary fix but we should really change the API a bit to make this easier. :)

younesbelkada · 2023-03-15T12:41:10Z

I see, what you are suggesting is to simplify the model loading API a bit right? And do it at once directly from AutoModelForCausalLMWithValueHead

lvwerra · 2023-03-15T12:45:47Z

Exactly, otherwise our API becomes more and more dark magic :D

I think for NPP, PEFT, Int8 it should all become:

model = AutoModelForCausalLMWithValueHead.from_pretrained(ckpt, method_specific_kwargs)

Internally we can then check if the kwargs are consistent and work in that combination and also have useful defaults for some of the approaches.

younesbelkada · 2023-03-16T08:17:43Z

Final training run for gpt2-peft in DP: https://wandb.ai/distill-bloom/trl/runs/anb919vh?workspace=user-younesbelkada

lvwerra · 2023-03-16T08:40:35Z

Looking good :)

younesbelkada · 2023-03-16T09:36:14Z

trl/trainer/ppo_trainer.py

@@ -461,15 +461,21 @@ def step(
                model_inputs["decoder_attention_mask"] = self.accelerator.pad_across_processes(
                    model_inputs["decoder_attention_mask"], dim=1, pad_index=0, pad_first=pad_first
                )
+            else:
+                model_inputs['labels'] = self.accelerator.pad_across_processes(


labels were used for decoder based models @lvwerra and I mistakenly deleted then in the previous PR #222

lewtun · 2023-03-16T09:38:43Z

docs/source/sentiment_tuning_peft.mdx

+current_device = Accelerator().process_index
+
+pretrained_model = AutoModelForCausalLM.from_pretrained(
+    config.model_name, load_in_8bit=True, device_map={"": current_device}


Using an empty key in device_map seems a bit like magic to me - could we have a one-liner to explain why?

Sure yes, I will add it

Added in 44f3181

lvwerra · 2023-03-16T09:40:00Z

trl/trainer/ppo_trainer.py

@@ -461,15 +461,21 @@ def step(
                model_inputs["decoder_attention_mask"] = self.accelerator.pad_across_processes(
                    model_inputs["decoder_attention_mask"], dim=1, pad_index=0, pad_first=pad_first
                )
+            else:
+                model_inputs['labels'] = self.accelerator.pad_across_processes(


We actually don't need labels, right? In prepare_model_inputs we pop them for encoder-decoder, I think we should do the same for encoders there.

fix DP issues

362f6c6

add instructions

97b5e9f

younesbelkada requested a review from lvwerra March 15, 2023 12:24

more details

3e9d247

younesbelkada added 2 commits March 15, 2023 15:28

test

c285642

Merge remote-tracking branch 'origin/main' into add-dp-peft

9a1baf3

add pad labels

8a051a0

younesbelkada commented Mar 16, 2023

View reviewed changes

lewtun reviewed Mar 16, 2023

View reviewed changes

lvwerra reviewed Mar 16, 2023

View reviewed changes

younesbelkada added 2 commits March 16, 2023 09:56

ultimate fix

24ab154

explain black magic

44f3181

lvwerra approved these changes Mar 16, 2023

View reviewed changes

younesbelkada merged commit 44f708e into main Mar 16, 2023

younesbelkada deleted the add-dp-peft branch March 16, 2023 10:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[`peft`] Fix DP issues #221

[`peft`] Fix DP issues #221

younesbelkada commented Mar 15, 2023

HuggingFaceDocBuilderDev commented Mar 15, 2023 •

edited

Loading

lvwerra commented Mar 15, 2023

younesbelkada commented Mar 15, 2023 •

edited

Loading

lvwerra commented Mar 15, 2023

younesbelkada commented Mar 16, 2023

lvwerra commented Mar 16, 2023

younesbelkada Mar 16, 2023 •

edited

Loading

lewtun Mar 16, 2023

younesbelkada Mar 16, 2023

younesbelkada Mar 16, 2023

lvwerra Mar 16, 2023

[peft] Fix DP issues #221

[peft] Fix DP issues #221

Conversation

younesbelkada commented Mar 15, 2023

What does this PR do?

HuggingFaceDocBuilderDev commented Mar 15, 2023 • edited Loading

lvwerra commented Mar 15, 2023

younesbelkada commented Mar 15, 2023 • edited Loading

lvwerra commented Mar 15, 2023

younesbelkada commented Mar 16, 2023

lvwerra commented Mar 16, 2023

younesbelkada Mar 16, 2023 • edited Loading

Choose a reason for hiding this comment

lewtun Mar 16, 2023

Choose a reason for hiding this comment

younesbelkada Mar 16, 2023

Choose a reason for hiding this comment

younesbelkada Mar 16, 2023

Choose a reason for hiding this comment

lvwerra Mar 16, 2023

Choose a reason for hiding this comment

[`peft`] Fix DP issues #221

[`peft`] Fix DP issues #221

HuggingFaceDocBuilderDev commented Mar 15, 2023 •

edited

Loading

younesbelkada commented Mar 15, 2023 •

edited

Loading

younesbelkada Mar 16, 2023 •

edited

Loading