Add model merging callback #2241

lewtun · 2024-10-16T12:11:40Z

Feature request

Add a MergeModelCallback that merges the reference model with the current policy and optionally pushes the merged checkpoint to the Hub. This could be done on step/epoch end and/or the end of training. Implementation-wise, we could use Arcee's mergekit lib and include it as an optional dependency: https://github.com/arcee-ai/mergekit

Motivation

Various papers show that model merging can non-trivially improve performance, especially if the models belong to the same architecture:

https://arxiv.org/abs/2410.10801
https://arxiv.org/abs/2406.16768 (for reward models)

Your contribution

Open to the community!

The text was updated successfully, but these errors were encountered:

coding-famer · 2024-10-17T23:41:52Z

I'm interested in working on this!

qgallouedec · 2024-10-18T13:49:57Z

Nice! Thanks @coding-famer. Feel free to open a PR then and request any help if needed

August-murr · 2024-10-25T10:28:42Z

@lewtun
After reading the paper, I noticed that the DPO checkpoints were combined with a different model rather than the reference model used in DPO training. So, I added an option in my PR to set an external model for merging instead of the reference model.

coding-famer · 2024-10-25T18:01:36Z

Hi @August-murr , happy to see that you have already worked it out! However I noticed that your implementation only allows merge models in the disk after training, this could be done by user using mergekit directly after training. I think the thing here is to merge the model during the training steps/epochs?

August-murr · 2024-10-25T18:41:13Z

@coding-famer The callback has an optional parameter called merge_at_every_checkpoint, which merges the saved checkpoint at either every step or at the end of each epoch during training.

coding-famer · 2024-10-25T19:21:02Z

@coding-famer The callback has an optional parameter called merge_at_every_checkpoint, which merges the saved checkpoint at either every step or at the end of each epoch during training.

Sounds great!

lewtun added the ✨ enhancement New feature or request label Oct 16, 2024

qgallouedec added the 🧒 good second issue Good for contributors with basic project familiarity label Oct 16, 2024

August-murr mentioned this issue Oct 25, 2024

🔀 Add MergeModelCallBack #2282

Merged

5 tasks

qgallouedec closed this as completed in #2282 Nov 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add model merging callback #2241

Add model merging callback #2241

lewtun commented Oct 16, 2024

coding-famer commented Oct 17, 2024

qgallouedec commented Oct 18, 2024

August-murr commented Oct 25, 2024

coding-famer commented Oct 25, 2024

August-murr commented Oct 25, 2024 •

edited

Loading

coding-famer commented Oct 25, 2024

Add model merging callback #2241

Add model merging callback #2241

Comments

lewtun commented Oct 16, 2024

Feature request

Motivation

Your contribution

coding-famer commented Oct 17, 2024

qgallouedec commented Oct 18, 2024

August-murr commented Oct 25, 2024

coding-famer commented Oct 25, 2024

August-murr commented Oct 25, 2024 • edited Loading

coding-famer commented Oct 25, 2024

August-murr commented Oct 25, 2024 •

edited

Loading