-
Notifications
You must be signed in to change notification settings - Fork 399
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug in DatasetForTextClassification._prepare_for_training_with_transformers
when multi_label=True
#2606
Milestone
Comments
Thank you for reporting this! See the Stack trace here
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Describe the bug
When calling
.prepare_for_training()
with the default arguments on aDatasetForTextClassification
dataset withmulti_label=True
records, the_prepare_for_training_with_transformers
function fails. This is due to the recent addition of thecontext
key when building the 🤗Dataset
.Then, IMO the following line should either be removed or made optional just in case the
context
key is available.https://github.com/argilla-io/argilla/blame/ba4ae63dc43ae1d36aa0efcf03896bcf5d9206ee/src/argilla/client/datasets.py#L764.
So on, the bug was introduced in 1.5.0, as in 1.4.0 it works fine and the CI seems to be passing.
To Reproduce
Run the following script to reproduce:
Expected behavior
The
context
key shouldn't be mandatory, so as to ensure consistency with the previous datasets. So on, the code above should work and the CI should pass.Environment (please complete the following information):
The text was updated successfully, but these errors were encountered: