Fix: modelloader handling of model_kwargs load_in*bit #1999

NanoCode012 · 2024-10-28T09:45:47Z

Description

I missed one important case where model_kwargs["load_in_8bit"] and model_kwargs["load_in_4bit"] are deleted at the end of self.set_quantization_config() which would break all further dependency on the above kwarg. This PR fixes that and refactors the handling.

Motivation and Context

How has this been tested?

Screenshots (if appropriate)

Types of changes

Social Handles (Optional)

MengqingCao · 2024-10-29T07:49:41Z

This looks much better than checking if the key-value pair exists before each check. :-)

NanoCode012 · 2024-10-29T08:17:01Z

To validate the new e2e test, I ran it on current main which threw an error.

main

this branch

* fix: load_in_*bit not properly read * fix: load_*bit check * fix: typo * refactor: load * bit handling * feat: add test dpo lora multi-gpu * fix: turn off sample packing for dpo * fix: missing warmup_steps * fix: test to load in 8bit for lora * skip 8bit lora on h100, add 4bit lora on h100 to multi gpu tests * chore: reduce max_steps --------- Co-authored-by: Wing Lian <wing.lian@gmail.com>

NanoCode012 changed the title ~~Fix/model loader cast issues~~ Fix: modelloader handling of model_kwargs load_in*bit Oct 28, 2024

NanoCode012 force-pushed the fix/model-loader-cast-issues branch from 0b34c27 to 485eab8 Compare October 28, 2024 12:38

NanoCode012 requested a review from winglian October 29, 2024 05:37

NanoCode012 and others added 10 commits October 30, 2024 12:28

fix: load_in_*bit not properly read

10925d6

fix: load_*bit check

163e02d

fix: typo

0d91e54

refactor: load * bit handling

2de1df4

feat: add test dpo lora multi-gpu

d5f4641

fix: turn off sample packing for dpo

f761ff2

fix: missing warmup_steps

b80aaf5

fix: test to load in 8bit for lora

3eaec39

skip 8bit lora on h100, add 4bit lora on h100 to multi gpu tests

bd23847

chore: reduce max_steps

fc0cb6a

winglian force-pushed the fix/model-loader-cast-issues branch from 2d7a2fe to fc0cb6a Compare October 30, 2024 16:28

winglian approved these changes Oct 30, 2024

View reviewed changes

winglian merged commit 5c7e891 into main Oct 30, 2024
14 of 15 checks passed

winglian deleted the fix/model-loader-cast-issues branch October 30, 2024 18:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: modelloader handling of model_kwargs load_in*bit #1999

Fix: modelloader handling of model_kwargs load_in*bit #1999

NanoCode012 commented Oct 28, 2024

MengqingCao commented Oct 29, 2024

NanoCode012 commented Oct 29, 2024

Fix: modelloader handling of model_kwargs load_in*bit #1999

Fix: modelloader handling of model_kwargs load_in*bit #1999

Conversation

NanoCode012 commented Oct 28, 2024

Description

Motivation and Context

How has this been tested?

Screenshots (if appropriate)

Types of changes

Social Handles (Optional)

MengqingCao commented Oct 29, 2024

NanoCode012 commented Oct 29, 2024