Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fp8 matmul support on AMD MI300 #9531

Closed
wants to merge 1 commit into from

Commits on Mar 13, 2024

  1. [ROCM] Initial support of fp8 Matmul via hipBlasLt.

    [ROCM] Code refactoring of initial hipBlasLt fp8 Matmul support
    
     - Clean up unnecessary code, particularly regarding output types of fp8
    
     - Override methods in ParameterizedFp8GemmRewriteTest to replace the
       patterns for CUDA and ROCm respectively for HLO checks.
    
     - Explicitly set c_scale and d_scale to nullptr as hipblasLt currently
       does not support them.
    wenchenvincent committed Mar 13, 2024
    Configuration menu
    Copy the full SHA
    a4423f9 View commit details
    Browse the repository at this point in the history