New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Fp8 matmul support on AMD MI300 #9531

Closed

wenchenvincent wants to merge 1 commit into openxla:main from ROCm:ci_fp8_gemm_support

Commits on Mar 13, 2024

[ROCM] Initial support of fp8 Matmul via hipBlasLt.

[ROCM] Code refactoring of initial hipBlasLt fp8 Matmul support

 - Clean up unnecessary code, particularly regarding output types of fp8

 - Override methods in ParameterizedFp8GemmRewriteTest to replace the
   patterns for CUDA and ROCm respectively for HLO checks.

 - Explicitly set c_scale and d_scale to nullptr as hipblasLt currently
   does not support them.

wenchenvincent committed Mar 13, 2024