Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve Hugging Face SFT Script #539

Open
wants to merge 6 commits into
base: main
Choose a base branch
from

Conversation

ParagEkbote
Copy link
Contributor

Fixes #487

I've chosen to remove the deprecated parameter as previously mentioned in the issue. The sequence length for the training dataset can be specified using the HF datasets library as mentioned here.

Please let me know if any further rectification is required and I will make the necessary changes.

cc: @Tcc0403

@Tcc0403
Copy link
Collaborator

Tcc0403 commented Jan 25, 2025

Let's check whether training_multimodal.py has deprecated args in SFTTrainer as well

@ParagEkbote
Copy link
Contributor Author

Let's check whether training_multimodal.py has deprecated args in SFTTrainer as well

There seems to be deprecated args in data preparation and will require a re-write.

@Tcc0403
Copy link
Collaborator

Tcc0403 commented Jan 26, 2025

Can you help with that?

@ParagEkbote
Copy link
Contributor Author

Can you help with that?

Yes. Can you please help find the PRs/diff for the SFT Trainer like you have previously done here. That could be quite helpful:)

@Tcc0403
Copy link
Collaborator

Tcc0403 commented Jan 26, 2025

You could use git blame and commit history on those module files to find relevant changes.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

error when run sh run_qwen.sh
2 participants