Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Only apply activation if transformer block is used in DIETClassifier #5626

Merged
merged 4 commits into from
Apr 15, 2020

Conversation

tabergma
Copy link
Contributor

@tabergma tabergma commented Apr 14, 2020

Proposed changes:
Only apply activation if transformer block is used in DIETClassifier

Status (please check what you already did):

  • added some tests for the functionality
  • updated the documentation
  • updated the changelog (please check changelog for instructions)
  • reformat files using black (please check Readme for instructions)

@dakshvar22
Copy link
Contributor

I think @Ghostvv wanted to place the call to the activation inside DIET conditioned on usage of transformer and not place it inside transformer?

@Ghostvv
Copy link
Contributor

Ghostvv commented Apr 14, 2020

yes, let's keep transformer a "classic" transformer

Copy link
Contributor

@dakshvar22 dakshvar22 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@tabergma tabergma merged commit dc58581 into 1.9.x Apr 15, 2020
@tabergma tabergma deleted the bug-activation branch April 15, 2020 08:13
@Ghostvv
Copy link
Contributor

Ghostvv commented Apr 16, 2020

@tabergma this update requires retraining the model

@tabergma
Copy link
Contributor Author

True. I guess it is already released 🙈

@Ghostvv
Copy link
Contributor

Ghostvv commented Apr 16, 2020

that is a bit dangerous, since prediction might go haywire, because technically models can be loaded

@tabergma
Copy link
Contributor Author

any suggestion on how to solve that?

@Ghostvv
Copy link
Contributor

Ghostvv commented Apr 16, 2020

can we update docs and rerelease?

@erohmensing erohmensing changed the title Move activation to transformer block make DIET activation call contitioned on transformer usage Apr 24, 2020
@tabergma tabergma changed the title make DIET activation call contitioned on transformer usage Only apply activation if transformer block is used in DIETClassifier Apr 24, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants