-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Clean up DPO example #2043
Clean up DPO example #2043
Changes from all commits
d5f2cb2
7ed8db8
3aba854
7986fbf
d1d2cfc
ddf30cb
d1fffe8
29087d8
aef1726
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -111,7 +111,6 @@ class DPOScriptArguments: | |
dataset_name: str = field(default=None, metadata={"help": "the dataset name"}) | ||
dataset_train_split: str = field(default="train", metadata={"help": "The dataset split to use for training"}) | ||
dataset_test_split: str = field(default="test", metadata={"help": "The dataset split to use for evaluation"}) | ||
sanity_check: bool = field(default=False, metadata={"help": "only train on 1000 samples"}) | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. This type of debugging arg shouldn't live in the lib IMO There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Let's remove them all There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Good idea! Done in ddf30cb |
||
ignore_bias_buffers: bool = field( | ||
default=False, | ||
metadata={ | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
FYI @kashif @qgallouedec @edbeeching we should not add this logic into the example scripts IMO - it's best solved by adding support for something like the dataset mixer we have in the handbook or H4 repo
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ah yes my bad!