Preference dataset docs #1636

SalmanMohammadi · 2024-09-20T12:21:48Z

Context

What is the purpose of this PR? Is it to

add a new feature
fix a bug
[ x update tests and/or documentation
other (please add here)

Please link to any issues this PR addresses.
#1529

Details on expected preference dataset format, where you can use it, and how to use custom preference datasets.
I haven't covered using different preference message transforms - maybe that can go in the message docs?

pytorch-bot · 2024-09-20T12:21:51Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchtune/1636

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 02ca414 with merge base 9a863c8 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

docs/source/basics/preference_datasets.rst

RdoubleA · 2024-09-20T22:06:30Z

docs/source/basics/preference_datasets.rst

+The ground-truth in preference datasets is usually the outcome of a binary comparison between two completions for the same prompt,
+and where a human annotator has indicated that one completion is more preferable than the other, according to some pre-set criterion.
+These prompt-completion pairs could be instruct style (single-turn, optionally with a single prompt), chat style (multi-turn), or
+some other set of interactions between a user and model (e.g. free-form text completion).


is this true, or do we only support preference chat? I guess as long as you make the transform it should work for all three, but your example below implies only chat

Since we support an optional system prompt it should work for all three right? That was the whole point of the preference dataset refactor to support arbitrary interactions

docs/source/basics/preference_datasets.rst

…e into preference_docs

SalmanMohammadi added 3 commits September 15, 2024 22:25

init

1fa2e41

Merge branch 'main' into preference_docs

d3cba9d

adding docs

2875c91

SalmanMohammadi requested a review from RdoubleA September 20, 2024 12:21

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 20, 2024

SalmanMohammadi requested review from felipemello1 and ebsmothers September 20, 2024 13:56

Update preference_datasets.rst

853fd9f

RdoubleA approved these changes Sep 20, 2024

View reviewed changes

SalmanMohammadi added 3 commits September 21, 2024 12:00

Merge branch 'main' into preference_docs

12469ea

comments

5ea2cfd

Merge branch 'preference_docs' of github.com:SalmanMohammadi/torchtun…

02ca414

…e into preference_docs

RdoubleA merged commit 908e6a2 into pytorch:main Sep 21, 2024
17 checks passed

SalmanMohammadi deleted the preference_docs branch September 27, 2024 13:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preference dataset docs #1636

Preference dataset docs #1636

SalmanMohammadi commented Sep 20, 2024

pytorch-bot bot commented Sep 20, 2024 •

edited

Loading

RdoubleA Sep 20, 2024

SalmanMohammadi Sep 21, 2024

Preference dataset docs #1636

Preference dataset docs #1636

Conversation

SalmanMohammadi commented Sep 20, 2024

Context

pytorch-bot bot commented Sep 20, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchtune/1636

✅ No Failures

RdoubleA Sep 20, 2024

Choose a reason for hiding this comment

SalmanMohammadi Sep 21, 2024

Choose a reason for hiding this comment

pytorch-bot bot commented Sep 20, 2024 •

edited

Loading