-
Notifications
You must be signed in to change notification settings - Fork 37
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Improve T5 encoder tests with more prompts and static context length
The set of prompts is not big enough for statistically sound testing of the T5 encoder. This is true for other text encoders. With the expansion of the prompt set the bf16 numerical difference between eager and IREE vanished. IREE is even more accurate. In tests the tokenizer padding has been change to produce always max length token sequence. This is in line how T5 is used int the Flux pipeline. The T5 encoder export has been expanded with an option to export with a static token sequence length. The tests were refactored to share tolerance values for f32 and bf16.
- Loading branch information
Showing
3 changed files
with
96 additions
and
98 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.