Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unify WMMA and FPU operator typevars [NFC] #122

Merged
merged 1 commit into from
Jun 30, 2023
Merged

Unify WMMA and FPU operator typevars [NFC] #122

merged 1 commit into from
Jun 30, 2023

Commits on Jun 30, 2023

  1. Unify WMMA and FPU operator typevars [NFC]

    The WMMA operator only had a T typevar for the accumulator type,
    while the FPU operator had DT for the destination type and CT for
    the compute type. Unify that by adding both compute type (CT) and
    accumulator type (AT) typevars that indicate the type that should
    be used for the register-level storage and operations.
    
    Note that the WMMA operator's typevars are actually not useful, and
    should match the eltype of the shared memory (as we use WMMA intrinsics
    to load/store shared memory, so cannot convert between shared memory and
    registers). However, we need the accumulator typevar as it cannot be
    inferred from arguments at some points, so I decided to add the compute
    typevar too for alignment with the FPU operator.
    maleadt committed Jun 30, 2023
    Configuration menu
    Copy the full SHA
    ff15b5b View commit details
    Browse the repository at this point in the history