Update `get_unpad_data` patching for multipack #2013

chiragjn · 2024-11-04T14:14:53Z

Fixes #1991 (at least attempts to)

The code for checking if a model has remote code is flawed where it assumes if someone is passing trust_remote_code then it must have remote code. That causes get_unpad_data patching to be missed if someone passes trust_remote_code: True breaking mulit packed sample packing with flash attention

This PR changes that to rely on auto_map defined in model's config.json

Plus additionally removes the pre 4.43 transformers patching code.

How has this been tested?

So far, manually, but would be nice to add tests. First would like to get some review feedback

Requesting review from @winglian and @NanoCode012

NanoCode012 · 2024-11-05T01:49:39Z

src/axolotl/monkeypatch/multipack.py

    modeling_arch = importlib.import_module(module_name)
-    modeling_arch._get_unpad_data = get_unpad_data  # pylint: disable=protected-access
+    if hasattr(modeling_arch, "_get_unpad_data"):


Is any handling needed if"_get_unpad_data" is not available? For ex, throw error to say packing not available?

Wanted to get an opinion on this, I am not sure myself. Maybe it can be an error for some known model types where we expect it and warnings for others?

Hm, I think it's good to have a check in case upstream changes a model type we support without our knowledge.

else: if {model_type} in list_of_known_working_types: raise Error

src/axolotl/utils/models.py

src/axolotl/monkeypatch/multipack.py

src/axolotl/utils/models.py

winglian · 2024-11-13T21:56:38Z

I think once we address the last 2 suggestions, this should be good to go. We should also make sure we manually run the multigpu tests before merging this PR.

NanoCode012 · 2024-11-15T03:22:24Z

I think we should add a small e2e test for this. Perhaps similar to tests/e2e/test_lora_llama.py but for fft with trust_remote_code on. I'll run a few e2e tests first.

NanoCode012 · 2024-11-15T14:42:08Z

Manually Tested CI:

e2e multi-gpu as-is
e2e rebased to main
e2e multi-gpu rebased to main

Edit: Having some issues verifying the last two runs, re-running.

Edit2: Double-checked those tests are working.

* Update `get_unpad_data` patching for multipack * Update src/axolotl/utils/models.py * Update src/axolotl/utils/models.py * Add test case --------- Co-authored-by: Wing Lian <wing.lian@gmail.com> Co-authored-by: Wing Lian <wing@axolotl.ai>

Update get_unpad_data patching for multipack

4fe0208

NanoCode012 reviewed Nov 5, 2024

View reviewed changes

winglian approved these changes Nov 6, 2024

View reviewed changes

winglian added the ready to merge label Nov 6, 2024

winglian reviewed Nov 8, 2024

View reviewed changes

src/axolotl/utils/models.py Outdated Show resolved Hide resolved

winglian removed the ready to merge label Nov 8, 2024

winglian reviewed Nov 13, 2024

View reviewed changes

src/axolotl/utils/models.py Show resolved Hide resolved

winglian added 2 commits November 14, 2024 07:53

Update src/axolotl/utils/models.py

3467129

Update src/axolotl/utils/models.py

8f91e07

winglian requested a review from NanoCode012 November 14, 2024 16:59

Add test case

80cca4c

winglian added the ready to merge label Nov 15, 2024

winglian merged commit 0c8b1d8 into axolotl-ai-cloud:main Nov 16, 2024
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update `get_unpad_data` patching for multipack #2013

Update `get_unpad_data` patching for multipack #2013

chiragjn commented Nov 4, 2024 •

edited

Loading

NanoCode012 Nov 5, 2024

chiragjn Nov 5, 2024

NanoCode012 Nov 15, 2024

winglian commented Nov 13, 2024

NanoCode012 commented Nov 15, 2024 •

edited

Loading

NanoCode012 commented Nov 15, 2024 •

edited

Loading

Update get_unpad_data patching for multipack #2013

Update get_unpad_data patching for multipack #2013

Conversation

chiragjn commented Nov 4, 2024 • edited Loading

How has this been tested?

NanoCode012 Nov 5, 2024

Choose a reason for hiding this comment

chiragjn Nov 5, 2024

Choose a reason for hiding this comment

NanoCode012 Nov 15, 2024

Choose a reason for hiding this comment

winglian commented Nov 13, 2024

NanoCode012 commented Nov 15, 2024 • edited Loading

NanoCode012 commented Nov 15, 2024 • edited Loading

Update `get_unpad_data` patching for multipack #2013

Update `get_unpad_data` patching for multipack #2013

chiragjn commented Nov 4, 2024 •

edited

Loading

NanoCode012 commented Nov 15, 2024 •

edited

Loading

NanoCode012 commented Nov 15, 2024 •

edited

Loading