Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Various fixes for TP #260

Merged
merged 3 commits into from
Oct 23, 2023
Merged

Various fixes for TP #260

merged 3 commits into from
Oct 23, 2023

Conversation

michaelbenayoun
Copy link
Member

@michaelbenayoun michaelbenayoun commented Oct 13, 2023

  • Evaluation can now happens in a data parallel setting when it makes sense.
  • Loss is averaged properly under TP setting

@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

@michaelbenayoun michaelbenayoun changed the title Fix loss log when using TP Various fix for TP Oct 20, 2023
@michaelbenayoun michaelbenayoun changed the title Various fix for TP Various fixes for TP Oct 20, 2023
@michaelbenayoun michaelbenayoun merged commit 4b37209 into main Oct 23, 2023
14 of 16 checks passed
@michaelbenayoun michaelbenayoun deleted the fix_trainer_for_tp branch October 23, 2023 09:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants