New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Load model in the target export precision by default in PTQ #10267

Merged

oyilmaz-nvidia merged 2 commits into main from jlasek/ptq_model_precision

Aug 27, 2024

Commits on Aug 27, 2024

Load model in the target export precision by default
```
Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
```
janekl committed Aug 27, 2024
Configuration menu
View commit details

Copy full SHA for 75e19a9

Browse repository at this point
Copy the full SHA

75e19a9 View commit details

Browse the repository at this point in the history
Enable megatron_amp_O2=true to actually use half-precision
```
Signed-off-by: Jan Lasek <jlasek@nvidia.com>
```
janekl committed Aug 27, 2024
Configuration menu
View commit details

Copy full SHA for 6df0950

Browse repository at this point
Copy the full SHA

6df0950 View commit details

Browse the repository at this point in the history