-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix the bug that for block_k=16 mma, the compilation crash on Ampere. #15
Commits on Aug 26, 2024
-
Configuration menu - View commit details
-
Copy full SHA for b535b55 - Browse repository at this point
Copy the full SHA b535b55View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7a88766 - Browse repository at this point
Copy the full SHA 7a88766View commit details -
[BACKEND] Update LLVM version to llvm/llvm-project@dd7d81e (triton-la…
…ng#4374) Update LLVM version to llvm/llvm-project@dd7d81e
Configuration menu - View commit details
-
Copy full SHA for abdaaff - Browse repository at this point
Copy the full SHA abdaaffView commit details -
[BACKEND] Update LLVM version to llvm/llvm-project@99bb9a7 (triton-la…
…ng#4410) Included the use of the non-deprecated version of createMCObjectStreamer (needed after llvm/llvm-project@f1422a8).
Configuration menu - View commit details
-
Copy full SHA for 3dd3657 - Browse repository at this point
Copy the full SHA 3dd3657View commit details -
Configuration menu - View commit details
-
Copy full SHA for 958e9a5 - Browse repository at this point
Copy the full SHA 958e9a5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 87538e5 - Browse repository at this point
Copy the full SHA 87538e5View commit details -
[BACKEND] Update gcc debian package to point to a version 14.1.0-2 wh…
…ich exists in gcc-defaults. (triton-lang#4548) The llvm build check is trying to get http://ftp.de.debian.org/debian/pool/main/g/gcc-defaults/gcc-aarch64-linux-gnu_13.2.0-7_amd64.deb, which does not exist and therefore fails. Updating the version to an existing one (14.1.0-2). [x] I am not making a trivial change, such as fixing a typo in a comment. [x] I have written a PR description following these [rules](https://cbea.ms/git-commit/#why-not-how). [x] I have run `pre-commit run --from-ref origin/main --to-ref HEAD`. [x] This PR does not need a test because it is not a functional change, should fix git checks builds. [x] I have not added any `lit` tests.
Configuration menu - View commit details
-
Copy full SHA for 32fc9c5 - Browse repository at this point
Copy the full SHA 32fc9c5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 494f55c - Browse repository at this point
Copy the full SHA 494f55cView commit details -
Configuration menu - View commit details
-
Copy full SHA for b2de88f - Browse repository at this point
Copy the full SHA b2de88fView commit details
Commits on Aug 28, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 7a5940c - Browse repository at this point
Copy the full SHA 7a5940cView commit details
Commits on Sep 6, 2024
-
fix the bug that for block_k=16 mma, the compilation crash on Ampere.
The origin issue is reported here: triton-lang#3435 The issue happens during compilation, when arith.sitofp (from i8 to fp16) operates on the tensor operand which has dot_op layout with the first dimension of the tensor being 16 and opidx = 1. For example: %104 = arith.sitofp %103 : tensor<16x64xi8, #triton_gpu.dot_op<{opIdx = 1, parent = #mma, kWidth = 4}>> to tensor<16x64xf16, #triton_gpu.dot_op<{opIdx = 1, parent = #mma, kWidth = 4}>> Investigation shows that the bug happens in TritonGPUToLLVM pass. in the corner case (block_k = 16 and opidx = 1) extra elements will be unpacked in include/triton/Conversion/TritonGPUToLLVM/ElementwiseOpToLLVM.h:line 186-194. The code unpack extra elements due to an implicit assumption in lib/Dialect/TritonGPU/IR/Dialect.h, at line 2000, at least 4 rep will be loaded. Therefore, in our patch, extra loaded elements are dropped in the corner case.
Configuration menu - View commit details
-
Copy full SHA for daed93f - Browse repository at this point
Copy the full SHA daed93fView commit details