Fix Preference Loss and Refactor for Readability #484

austin362667 · 2024-12-17T03:10:42Z

Summary

Thanks to @winglian and @shivam15s noticed and fixed this #481.

This PR suggests negating the preference loss terms to align with the formulas in the docstrings, while maintaining the base preference structure as nll_loss + preference_loss. This would make our loss computations more consistent since both terms would represent losses to be minimized.

This PR also tightened the tolerance in case of encountering a similar issue.

Testing Done

Hardware Type:
run make test to ensure correctness
run make checkstyle to ensure code style
run make test-convergence to ensure convergence

Signed-off-by: Austin Liu <austin362667@gmail.com>

winglian and others added 6 commits December 16, 2024 10:51

preference loss sign is inverted and leads to negative loss

e208924

fix test sign too

1c3a631

Fix readability

d5bd014

Signed-off-by: Austin Liu <austin362667@gmail.com>

Tighten dpo tol

f951da3

Signed-off-by: Austin Liu <austin362667@gmail.com>

Format

65bcc2c

Signed-off-by: Austin Liu <austin362667@gmail.com>

Merge branch 'main' into preference-loss-sign

22cca3c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Preference Loss and Refactor for Readability #484

Fix Preference Loss and Refactor for Readability #484

austin362667 commented Dec 17, 2024 •

edited

Loading

Fix Preference Loss and Refactor for Readability #484

Are you sure you want to change the base?

Fix Preference Loss and Refactor for Readability #484

Conversation

austin362667 commented Dec 17, 2024 • edited Loading

Summary

Testing Done

austin362667 commented Dec 17, 2024 •

edited

Loading