Add support for FSDP+QLoRA and DeepSpeed ZeRO3+QLoRA #1416

pacman100 · 2024-03-11T12:00:21Z

What does this PR do?

prepare_model_for_kbit_training and peft_module_casting_to_bf16 should be disabled when using FSDP+QLoRA or DeepSpeed ZeRO3+QLoRA.

This PR should be merged after Transformers PR huggingface/transformers#29587

HuggingFaceDocBuilderDev · 2024-03-11T12:27:45Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

pacman100 · 2024-03-12T14:07:12Z

cc @younesbelkada

younesbelkada

Thanks so much !

trl/trainer/sft_trainer.py

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

younesbelkada · 2024-03-13T09:43:41Z

I can't repro the CI failure locally and on main, will fix in a follow up PR!

* don't do mp casting * don't use `prepare_for_kbit` when using fsdp+qlora or dsz3+qlora * changes to enable fsdp+qlora and dsz3+qlora * revert * Update sft_trainer.py * quality * fix deprecation using changes from PR #1415 * fixes * quality * Update trl/trainer/sft_trainer.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * quality * relaunch tests --------- Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* don't do mp casting * don't use `prepare_for_kbit` when using fsdp+qlora or dsz3+qlora * changes to enable fsdp+qlora and dsz3+qlora * revert * Update sft_trainer.py * quality * fix deprecation using changes from PR huggingface#1415 * fixes * quality * Update trl/trainer/sft_trainer.py Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * quality * relaunch tests --------- Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

pacman100 added 10 commits January 19, 2024 17:22

don't do mp casting

55a2bee

don't use prepare_for_kbit when using fsdp+qlora or dsz3+qlora

58377d5

changes to enable fsdp+qlora and dsz3+qlora

5ba2b8a

revert

64cd85f

Update sft_trainer.py

b3143ef

Merge remote-tracking branch 'upstream/main' into smangrul/fsdp+qlora

510fcba

quality

890e274

fix deprecation using changes from PR huggingface#1415

77b9cb1

fixes

c60c1a8

quality

c0a1d4d

Merge remote-tracking branch 'upstream/main' into smangrul/fsdp+qlora

ee5f867

pacman100 mentioned this pull request Mar 12, 2024

fsdp qlora and dsz3 qlora pacman100/LLM-Workshop#28

Merged

pacman100 marked this pull request as ready for review March 12, 2024 14:05

younesbelkada approved these changes Mar 12, 2024

View reviewed changes

trl/trainer/sft_trainer.py Show resolved Hide resolved

pacman100 and others added 3 commits March 13, 2024 13:02

Update trl/trainer/sft_trainer.py

de78fd3

Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

quality

dde0bef

relaunch tests

b2ca01d

younesbelkada merged commit 58c0888 into huggingface:main Mar 13, 2024
2 of 9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for FSDP+QLoRA and DeepSpeed ZeRO3+QLoRA #1416

Add support for FSDP+QLoRA and DeepSpeed ZeRO3+QLoRA #1416

pacman100 commented Mar 11, 2024

HuggingFaceDocBuilderDev commented Mar 11, 2024

pacman100 commented Mar 12, 2024

younesbelkada left a comment

younesbelkada commented Mar 13, 2024

Add support for FSDP+QLoRA and DeepSpeed ZeRO3+QLoRA #1416

Add support for FSDP+QLoRA and DeepSpeed ZeRO3+QLoRA #1416

Conversation

pacman100 commented Mar 11, 2024

What does this PR do?

HuggingFaceDocBuilderDev commented Mar 11, 2024

pacman100 commented Mar 12, 2024

younesbelkada left a comment

Choose a reason for hiding this comment

younesbelkada commented Mar 13, 2024