Skip to content

Commit

Permalink
Update instructions
Browse files Browse the repository at this point in the history
  • Loading branch information
nicolasvasilache committed Apr 21, 2023
1 parent e8b0966 commit 00d25eb
Showing 1 changed file with 6 additions and 6 deletions.
12 changes: 6 additions & 6 deletions transform_dialect/examples/cuda/Makefile
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
# Instructions
# ============
# ```
# make clean && make -j 8 some_tests
# make clean && make -j 8 unaligned_matmuls
# ```
# This will generate the following files:
# - logs/*.log the runtimes for various matmul sizes
Expand All @@ -10,24 +10,24 @@
# To produce PTX:
# ```
# # IREE without transform dialect.
# make gen/iree_cuda_a100_fill_matmul_sizes_200_300_400.vmfb
# make gen/cuda_a100_fill_matmul_sizes_200_300_400.vmfb
# # IREE with transform dialect.
# make gen/iree_cuda_a100_fill_matmul_sizes_200_300_400.td.vmfb
# make gen/cuda_a100_fill_matmul_sizes_200_300_400.td.vmfb
# ```
#
# To produce the TD script:
# ```
# make gen/fill_matmul_sizes_2052_2052_2052.embedded.td.mlir
# make gen/cuda_a100_fill_matmul_sizes_2052_2052_2052.embedded.td.mlir
# ```
#
# Then further process with:
# ```
# iree-opt --pass-pipeline='builtin.module(hal.executable(hal.executable.variant(iree-llvmgpu-lower-executable-target)))' gen/fill_matmul_sizes_2052_2052_2052.embedded.td.mlir
# iree-opt --pass-pipeline='builtin.module(hal.executable(hal.executable.variant(iree-llvmgpu-lower-executable-target)))' gen/cuda_a100_fill_matmul_sizes_2052_2052_2052.embedded.td.mlir
# ```
#
# Or:
# ```
# iree-compile --iree-hal-target-backends=cuda --iree-hal-cuda-llvm-target-arch=sm_80 gen/fill_matmul_sizes_2052_2052_2052.embedded.td.mlir
# iree-compile --iree-hal-target-backends=cuda --iree-hal-cuda-llvm-target-arch=sm_80 gen/cuda_a100_fill_matmul_sizes_2052_2052_2052.embedded.td.mlir
# ```

LLVM_SRC=$(HOME)/github/llvm-project
Expand Down

0 comments on commit 00d25eb

Please sign in to comment.