Feat: Add support for APO-zero in KTOTrainer #1952

KarelDO · 2024-08-21T01:09:12Z

Feat: Add support for APO-zero in KTOTrainer

Now ready to merge

This PR adds support for the unpaired variant of APO-zero in the KTOTrainer. See the APO paper.

To achieve this, I:

Added a loss_type variable to KTOConfig (similar to DPOConfig)
Added the APO loss to KTOTrainer (similar to DPOTrainer)
Updated KTOTrainer to only calculate the KL when the loss requires this (calculating the KL is expensive, as a result APO-zero runs faster than KTO).

Additionally, I updated the kto.py script to be interoperable with any dpo-formatted datasets:

I added a util (in data_util.py) which checks the format of the dataset and turns a dpo-formatted dataset into a kto-formatted dataset.
I've updated the kto script to also work with ChatML formatted datasets (similar to what happens in dpo.py).

KarelDO · 2024-08-23T22:52:15Z

I've confirm the training dynamics of APO-zero-unpaired are as intended.

Compared to a KTO-run, APO-zero produces about a 40% faster training times since no KL values need to be calculated.

I will run more downstream evaluations to understand the differences between KTO and APO-zero for unpaired alignment later.

This PR can now be merged

qgallouedec · 2024-08-24T08:29:36Z

Thank you @KarelDO, it's a feature we're very happy to see coming to TRL. We'll be reviewing your PR soon for sure. Can you share the elements you have to confirm that everything is working as expected? Maybe curves, trained models, etc?

HuggingFaceDocBuilderDev · 2024-08-24T08:31:24Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec · 2024-08-24T08:31:36Z

trl/trainer/kto_config.py

@@ -60,6 +65,10 @@ class KTOConfig(TrainingArguments):
            Number of processes to use for processing the datasets.
    """

+    loss_type: Literal[
+        "kto",
+        "apo_zero_unpaired",


is "unpaired" really necessary? As far as I understand, there is no such thing as "paired" version for kto, right?

APO-zero does have a paired and unpaired variant, and you could definitely construct a paired variant of KTO.

We can remove "_unpaired" here since the KTOTrainer also implies it, but I thought it would be good for people to actively think about the distinction when selecting a loss.

Yes given we have an apo_zero loss also in the DPOTrainer, It's good to retain the _unpaired distinction IMO

Would you mind adding this loss term to the intergration tests here:

trl/tests/test_kto_trainer.py

Line 76 in 47ab034

@parameterized.expand(

You might want to look at the DPO trainer for inspiration:

trl/tests/test_dpo_trainer.py

Line 251 in 47ab034

@parameterized.expand(

KarelDO · 2024-08-24T21:56:28Z

Hey @qgallouedec , thanks!

Here's a wandb report pdf with some training curves. I can't share the full WandB project yet, but I have a print out of the report attached. I've summarized the main take-aways below:

APO-zero trains considerably faster
On an RLAIF preference dataset, the training dynamics of KTO and APO-zero seem identical
on a CLAIR preference dataset, the training dynamics of KTO and APO-zero differ. This is due to a higher KL on this dataset. APO-zero does display the intended training dynamic of smoothly increasing desirable rewards and decrease undesirable rewards without calculating a KL.

TL;DR: different loss functions, they sometimes behave similarly depending on underlying preference dataset, APO-zero trains faster.

lewtun

Thanks a lot for this nice contribution @KarelDO ! Overall it LGTM once we have some unit / integration tests added.

lewtun · 2024-08-28T14:17:48Z

trl/data_utils.py

+    return new_rows
+
+
+def maybe_reformat_dpo_to_kto(dataset: DatasetDict, num_proc: int = None):


For public methods, would you mind adding a docstring and a unit test please?

lewtun · 2024-08-28T14:19:36Z

trl/trainer/kto_config.py

@@ -60,6 +65,10 @@ class KTOConfig(TrainingArguments):
            Number of processes to use for processing the datasets.
    """

+    loss_type: Literal[
+        "kto",
+        "apo_zero_unpaired",


Yes given we have an apo_zero loss also in the DPOTrainer, It's good to retain the _unpaired distinction IMO

Would you mind adding this loss term to the intergration tests here:

trl/tests/test_kto_trainer.py

Line 76 in 47ab034

@parameterized.expand(

You might want to look at the DPO trainer for inspiration:

trl/tests/test_dpo_trainer.py

Line 251 in 47ab034

@parameterized.expand(

lewtun · 2024-08-29T07:11:22Z

Thanks for iterating! Would you mind fixing the code quality test and then we can merge!

KarelDO · 2024-08-29T16:27:29Z

Thanks @lewtun , should be fixed now!

lewtun · 2024-09-02T21:03:46Z

Ah it seems some of the KTO tests are now failing after rebasing on main - would you mind fixing those 🙏 ?

karel-contextual · 2024-09-03T21:47:37Z

@lewtun we should be good now!

lewtun · 2024-09-04T07:31:42Z

Thanks for iterating!

karel-contextual added 4 commits August 20, 2024 23:04

feat : add kto command

db9cf6a

feat : add support for apo loss in KTO Trainer

ddb8ac5

feat : make kto script compatible with dpo-formatted datasets

6a56cb7

fix: lint data utils

4b30563

Merge branch 'main' into feat/apo-unpaired

082c996

qgallouedec reviewed Aug 24, 2024

View reviewed changes

Merge branch 'main' into feat/apo-unpaired

06fa0f8

qgallouedec mentioned this pull request Aug 28, 2024

how to convert dpodata to ktodata #1986

Closed

Merge branch 'main' into feat/apo-unpaired

da13cd6

lewtun approved these changes Aug 28, 2024

View reviewed changes

lewtun and others added 5 commits August 28, 2024 16:23

Merge branch 'main' into feat/apo-unpaired

1075c4b

Merge branch 'main' into feat/apo-unpaired

1a9bcaf

add loss_type in kto test

4e32eed

fix: data utils docstrings

13617e9

fix: add dataset reformat test

ad33bc7

fix: lint tests

a541923

Merge branch 'main' into feat/apo-unpaired

67b52b9

lewtun and others added 2 commits September 3, 2024 10:42

Merge branch 'main' into feat/apo-unpaired

1f3ab5a

fix: only reference kl_logps if needed

eec33ba

Merge branch 'main' into feat/apo-unpaired

3ddf513

lewtun merged commit 7acb9c2 into huggingface:main Sep 4, 2024
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: Add support for APO-zero in KTOTrainer #1952

Feat: Add support for APO-zero in KTOTrainer #1952

KarelDO commented Aug 21, 2024 •

edited

Loading

KarelDO commented Aug 23, 2024 •

edited

Loading

qgallouedec commented Aug 24, 2024

HuggingFaceDocBuilderDev commented Aug 24, 2024

qgallouedec Aug 24, 2024

KarelDO Aug 24, 2024

lewtun Aug 28, 2024

KarelDO commented Aug 24, 2024

lewtun left a comment

lewtun Aug 28, 2024

KarelDO Aug 28, 2024

lewtun Aug 28, 2024

lewtun commented Aug 29, 2024

KarelDO commented Aug 29, 2024

lewtun commented Sep 2, 2024

karel-contextual commented Sep 3, 2024

lewtun commented Sep 4, 2024

		return new_rows


		def maybe_reformat_dpo_to_kto(dataset: DatasetDict, num_proc: int = None):

Feat: Add support for APO-zero in KTOTrainer #1952

Feat: Add support for APO-zero in KTOTrainer #1952

Conversation

KarelDO commented Aug 21, 2024 • edited Loading