fix: unpackaging error in Custom Mixture of Experts model when `aux_loss_enabled` is set to True. #2039

Jonathanjordan21 · 2024-09-09T08:05:29Z

What does this PR do?

This PR fixes #2038.

Fix unpackaging error due to additional aux_loss returned by concatenated_forward function when aux_loss_enabled=True.

Before submitting

Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

…d by **concatenated_forward** function when **aux_loss_enabled** is set to True.

kashif · 2024-09-09T08:32:01Z

thanks @Jonathanjordan21

perhaps its more ellegant to do:

reference_chosen_logps, reference_rejected_logps = self.concatenated_forward(self.ref_model, padded_batch)[:2]

what do you think?

Jonathanjordan21 · 2024-09-09T08:47:56Z

@kashif seems good. I actually just followed the earlier code which calculate the policy losses in get_batch_loss_metrics function.

1440        forward_output = self.concatenated_forward(model, batch)
1441        (
1442            policy_chosen_logps,
1443            policy_rejected_logps,
1444            policy_chosen_logits,
1445            policy_rejected_logits,
1446            policy_nll_loss,
1447       ) = forward_output[:5]
1448       if self.aux_loss_enabled:
1449            aux_loss = forward_output[5]

kashif · 2024-09-09T08:49:47Z

yeah... i should have just done the above but happy if you do it!

…`get_batch_loss_metrics` function

kashif · 2024-09-09T09:01:50Z

you might need to run pre-commit run --all-files in the root of the TRL folder fix any formatting issues

HuggingFaceDocBuilderDev · 2024-09-09T09:05:29Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

fix: prevent unpackaging error due to additional **aux_loss** returne…

5336693

…d by **concatenated_forward** function when **aux_loss_enabled** is set to True.

Jonathanjordan21 added 2 commits September 9, 2024 15:55

Refactor: Simplify tuple unpacking in concatenated_forward call in …

dd3f4f1

…`get_batch_loss_metrics` function

Merge branch 'main' into patch-1

7653aec

Refactor: improve code quality

e7a130d

kashif approved these changes Sep 9, 2024

View reviewed changes

kashif merged commit 72f19c3 into huggingface:main Sep 9, 2024
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: unpackaging error in Custom Mixture of Experts model when `aux_loss_enabled` is set to True. #2039

fix: unpackaging error in Custom Mixture of Experts model when `aux_loss_enabled` is set to True. #2039

Jonathanjordan21 commented Sep 9, 2024 •

edited by kashif

Loading

kashif commented Sep 9, 2024

Jonathanjordan21 commented Sep 9, 2024

kashif commented Sep 9, 2024

kashif commented Sep 9, 2024

HuggingFaceDocBuilderDev commented Sep 9, 2024

fix: unpackaging error in Custom Mixture of Experts model when aux_loss_enabled is set to True. #2039

fix: unpackaging error in Custom Mixture of Experts model when aux_loss_enabled is set to True. #2039

Conversation

Jonathanjordan21 commented Sep 9, 2024 • edited by kashif Loading

What does this PR do?

Before submitting

Who can review?

kashif commented Sep 9, 2024

Jonathanjordan21 commented Sep 9, 2024

kashif commented Sep 9, 2024

kashif commented Sep 9, 2024

HuggingFaceDocBuilderDev commented Sep 9, 2024

fix: unpackaging error in Custom Mixture of Experts model when `aux_loss_enabled` is set to True. #2039

fix: unpackaging error in Custom Mixture of Experts model when `aux_loss_enabled` is set to True. #2039

Jonathanjordan21 commented Sep 9, 2024 •

edited by kashif

Loading