Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CTC example: updated trainer parameters to save tokenizer #23243

Merged
merged 1 commit into from
May 10, 2023

Conversation

MKhalusova
Copy link
Contributor

The current example only passes feature_extractor to Trainer and thus tokenizer is not saved and won't be pushed to Hub. This PR fixes this by passing the processor to Trainer. It can probably be refactored further to get the tokenizer and feature_extractor from the instantiated processor, but with regard to behavior, this small fix seems to address the problem.

@HuggingFaceDocBuilderDev
Copy link

HuggingFaceDocBuilderDev commented May 9, 2023

The documentation is not available anymore as the PR was closed or merged.

Copy link
Collaborator

@amyeroberts amyeroberts left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for fixing!

@MKhalusova MKhalusova merged commit 91f4c84 into huggingface:main May 10, 2023
@sanchit-gandhi
Copy link
Contributor

Thanks @MKhalusova!

gojiteji pushed a commit to gojiteji/transformers that referenced this pull request Jun 5, 2023
…e#23243)

trainer parameters changed to save tokenizer in addition to feature_extractor
novice03 pushed a commit to novice03/transformers that referenced this pull request Jun 23, 2023
…e#23243)

trainer parameters changed to save tokenizer in addition to feature_extractor
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants