We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
877bdca
CUTLASS 1.3 adds efficient GEMM kernels targeting Volta Tensor Cores via mma.sync instruction added in CUDA 10.1.