👈 Add `tokenizer` arg back and add deprecation guidelines #2348

qgallouedec · 2024-11-11T21:16:17Z

What does this PR do?

#2162 introduced a breaking change on all trainers except DPO and SFT (tokenizer arg replaced by processing_class). We've had feedback that this change was too abrupt, so we're reintroducing this argument with an extended timeline for its removal and also clarifying our removal strategy.

This PR will be part of a patch release for v0.12

Here is the proposed schedule depending of the usage for each trainer:

Trainer	Num models on the Hub	Argument removed in version
SFT	15,097	0.16
DPO	3,053	0.16
PPO	441	0.15
ORPO	320	0.15
Reward	270	0.15
KTO	80	0.14
CPO	26	0.14
Online DPO	14	0.14
RLOO	4	0.14
XPO	3	0.13
Nash	1	0.13
GKD	1	0.13
Iterative SFT	1	0.13
BCO	0	0.13

cc @muellerzr

Related: #2290 #2226 #2207 #2218

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

HuggingFaceDocBuilderDev · 2024-11-11T21:19:57Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

lewtun

Thanks a lot for enabling backwards compatibility @qgallouedec! The PR LGTM and the new docs are really well written 🔥

lewtun · 2024-11-11T22:26:45Z

CONTRIBUTING.md

@@ -256,3 +256,30 @@ That's how `make test` is implemented (without the `pip install` line)!

 You can specify a smaller set of tests to test only the feature
 you're working on.
+
+### Deprecation and Backward Compatibility


Super nice documentation!

CONTRIBUTING.md

muellerzr · 2024-11-11T22:33:16Z

Beautiful! 🔥

* Add deprecation and backward compatibility guidelines * Update tokenizer argument in trainer classes * Add warning message for TRL Judges API

Add deprecation and backward compatibility guidelines

767dd8a

Update tokenizer argument in trainer classes

c5b71db

qgallouedec marked this pull request as ready for review November 11, 2024 21:21

qgallouedec requested review from lewtun, kashif and abhishekkrthakur November 11, 2024 21:21

qgallouedec changed the title ~~👈 Add tokenizer arg back and deprecation and backward compatibility guidelines~~ 👈 Add tokenizer arg back and add deprecation guidelines Nov 11, 2024

kashif approved these changes Nov 11, 2024

View reviewed changes

lewtun approved these changes Nov 11, 2024

View reviewed changes

Add warning message for TRL Judges API

3ce399c

qgallouedec merged commit 015321e into main Nov 11, 2024
14 checks passed

qgallouedec deleted the tokenizer-back branch November 11, 2024 23:06

qgallouedec added a commit that referenced this pull request Nov 11, 2024

👈 Add tokenizer arg back and add deprecation guidelines (#2348)

f662824

* Add deprecation and backward compatibility guidelines * Update tokenizer argument in trainer classes * Add warning message for TRL Judges API

qgallouedec mentioned this pull request Nov 11, 2024

👋 Remove deprecated tokenizer argument in BCO, GKD, Iterative SFT, Nash MD and XPO #2349

Merged

5 tasks

dame-cell mentioned this pull request Nov 13, 2024

unexpected keyword argument tokenizer [FIXED] unslothai/unsloth#1285

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

👈 Add `tokenizer` arg back and add deprecation guidelines #2348

👈 Add `tokenizer` arg back and add deprecation guidelines #2348

qgallouedec commented Nov 11, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Nov 11, 2024

lewtun left a comment

lewtun Nov 11, 2024

muellerzr commented Nov 11, 2024

👈 Add tokenizer arg back and add deprecation guidelines #2348

👈 Add tokenizer arg back and add deprecation guidelines #2348

Conversation

qgallouedec commented Nov 11, 2024 • edited Loading

What does this PR do?

Before submitting

Who can review?

HuggingFaceDocBuilderDev commented Nov 11, 2024

lewtun left a comment

Choose a reason for hiding this comment

lewtun Nov 11, 2024

Choose a reason for hiding this comment

muellerzr commented Nov 11, 2024

👈 Add `tokenizer` arg back and add deprecation guidelines #2348

👈 Add `tokenizer` arg back and add deprecation guidelines #2348

qgallouedec commented Nov 11, 2024 •

edited

Loading