KTO - support loading the adapter twice #1430

claralp · 2024-03-14T15:23:51Z

For DPOTrainer there exists the option to load the Adapter from SFT training twice, as in Reference model considerations with PEFT - load-the-adapter-twice:

# Initialize the trainer, without a ref_model param.
dpo_trainer = DPOTrainer(
    model,
    ...
    model_adapter_name="train",
    ref_adapter_name="reference",
)

In DPOTrainer this allows me to use a larger batch size and speed up training.
Would be cool to have this option in KTOTrainer as well. Anyone working on this already?

PhilipMay · 2024-03-18T10:34:18Z

Hi @lewtun , we had a discussion about KTO. Do you already work on this or should we come up with a PR?

We would try and use the code from DPO and apply it to KTO to implement this.

claralp · 2024-04-11T12:18:52Z

@younesbelkada @kashif, is this a desired feature for you as well?
If yes, is someone already implementing it or should we come up with a PR?

kashif · 2024-04-11T14:09:03Z

I believe would be good to have yes, and no one is working on it

PhilipMay · 2024-04-11T14:52:23Z

That feature would be super useful @claralp . Thanks.

younesbelkada added ✨ enhancement New feature or request 🏋 KTO Related to KTO labels Apr 8, 2024

claralp mentioned this issue Apr 16, 2024

[KTO] support to load the adapter twice #1542

Merged

kashif closed this as completed in #1542 Apr 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KTO - support loading the adapter twice #1430

KTO - support loading the adapter twice #1430

claralp commented Mar 14, 2024

PhilipMay commented Mar 18, 2024 •

edited

Loading

claralp commented Apr 11, 2024

kashif commented Apr 11, 2024 •

edited

Loading

PhilipMay commented Apr 11, 2024

KTO - support loading the adapter twice #1430

KTO - support loading the adapter twice #1430

Comments

claralp commented Mar 14, 2024

PhilipMay commented Mar 18, 2024 • edited Loading

claralp commented Apr 11, 2024

kashif commented Apr 11, 2024 • edited Loading

PhilipMay commented Apr 11, 2024

PhilipMay commented Mar 18, 2024 •

edited

Loading

kashif commented Apr 11, 2024 •

edited

Loading