You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
# Initialize the trainer, without a ref_model param.dpo_trainer=DPOTrainer(
model,
...
model_adapter_name="train",
ref_adapter_name="reference",
)
In DPOTrainer this allows me to use a larger batch size and speed up training.
Would be cool to have this option in KTOTrainer as well. Anyone working on this already?
The text was updated successfully, but these errors were encountered:
For DPOTrainer there exists the option to load the Adapter from SFT training twice, as in Reference model considerations with PEFT - load-the-adapter-twice:
In DPOTrainer this allows me to use a larger batch size and speed up training.
Would be cool to have this option in KTOTrainer as well. Anyone working on this already?
The text was updated successfully, but these errors were encountered: