Add support for AMD GPUs #619

jakurzak · 2023-08-18T18:50:35Z

Overview

This pull request adds support for AMD Instinct GPUs.

Changes

`CMakeLists.txt`

The newly added section of the CMake script introduces logic to detect the presence of the hipcc compiler, which is part of AMD's ROCm software platform. If the hipcc compiler is detected, CMake will configure the build system to compile the HIP source files using hipcc.

`Makefile`

Add HIPCC=hipcc and HIPCCFLAGS = -O3 variables, similarly to the existing NVCC=nvcc and HIPCCFLAGS = -O3 variables.
Add .PHONY: qsim-hip, .PHONY: hip-tests, and .PHONY: run-hip-tests sections, similarly to the existing .PHONY: qsim-cuda, .PHONY: cuda-tests, and .PHONY: run-cuda-tests sections.

`apps/Makefile`

Generate the list of HIP_TARGETS wih the .hip.x suffix from all files with the .cuda.cu suffix, similarly to how the list of CUDA_TARGETS is generated.
Add the .PHONY: qsim-hip and the %hip.x: %cuda.cu section, similarly to the existing .PHONY: qsim-cuda and %cuda.x: %cuda.cu sections.

`apps/make.sh`

The added section compiles CUDA source files with the hipcc compiler instead of nvcc when hipcc is available.

`docs/_book.yaml`

Add "AMD GPU support" article to the "Other tutorials" section.

`docs/tutorials/amd_gpu.md`

Add basic documentation for the AMD GPU support.

`lib/cuda2hip.h`

Define a series of macros that map CUDA function calls, types, constants, and utilities to their corresponding HIP counterparts.
Provide a template function __shfl_down_sync(), as a wrapper around HIP's __shfl_down() function, since the former is not currently available in HIP.

`lib/fuser_mqubit.h`

Remove unused variable to eliminate Clang warning.

`lib/simulator_cuda_kernels.h`

Add a section for including HIP headers if the HIP compiler is detected.

`lib/statespace_cuda.h`

Add a section for including HIP headers if the HIP compiler is detected.
Add a missing error check for cudaMemset().

`lib/statespace_cuda_kernels.h`

Add a section for including HIP headers if the HIP compiler is detected.

`lib/util_cuda.h`

Add a section for including HIP headers if the HIP compiler is detected.

`lib/vectorspace_cuda.h`

Add a section for including HIP headers if the HIP compiler is detected.
Add missing error checks for cudaFree(), cudaMemcpy(), and cudaDeviceSynchronize().

`pybind_interface/Makefile`

Add QSIMLIB_HIP, similar to the existing QSIMLIB_CUDA.
Rename pybind-gpu and decide-gpu to pybind-cuda and decide-cuda.
Add pybind-hip and decide-hip.
Add removal of HIP files in the clean section.

`pybind_interface/decide/CMakeLists.txt`

To add HIP support to the CMake file, checks were introduced to detect whether hipcc is available. If hipcc is found, the project is configured to use HIP by setting HIP as one of the project languages. The CMake file then adjusts the build process to use HIP-specific commands (hip_add_library) and properties for compiling the relevant source files. It also includes the necessary directories and sets the module properties for creating a Python extension module with HIP support, allowing the same build system to work across both CUDA and HIP platforms depending on the available compiler.

`pybind_interface/decide/decide.cpp`

Add HIP = 2 to enum GPUCapabilities.
Set GPUCapabilities gpu = HIP if HIP compiler is detected.

`pybind_interface/hip/CMakeLists.txt`

Set up HIP compilation for a Python extension module written in HIP and C++. Configure compiler optimizations and OpenMP settings. Integrate Pybind11 for C++/Python interfacing, and locate necessary Python libraries.

`pybind_interface/hip/pybind_main_hip.cpp`

Integrate with Pybind11 for creating Python bindings by creating a factory pattern for object creation and setting up the necessary environment.

`pybind_interface/hip/pybind_main_hip.h`

Introduce Python bindings for a module called qsim_hip using Pybind11.

`pybind_interface/pybind_main.cpp`

Remove unused lambda captures: circuit, ncircuit, and state_space.

`qsimcirq/init.py`

Add detection of HIP. If qsim_decide.detect_gpu() returns 2, import the HIP-compatible module qsimcirq.qsim_hip.

`setup.py`

Add import shutil.
If hipcc is found, append arguments to the cmake_args list to specify hipcc as the C and C++ compiler.
Specify that the qsim_hip directory contains a CMake project to be built as a Python extension with support for HIP.

`tests/Makefile`

Add rules for building HIP tests, similar to the rules for building CUDA tests.

`tests/make.sh`

Add rules for building HIP tests, similar to the rules for building CUDA tests.

Building

make -j qsim      # to build CPU qsim
make -j qsim-hip  # to build HIP qsim
make -j pybind    # to build Python bindings
make -j cxx-tests # to build CPU tests
make -j hip-tests # to build HIP tests
pip install .

Testing

Simulator

make run-cxx-tests # to run CPU tests
make run-hip-tests # to run HIP tests

or

cd tests
for file in *.x; do ./"$file"; done          # to run all tests
for file in *_hip_test.x; do ./"$file"; done # to run HIP tests only

Python bindings

make run-py-tests

or

cd qsimcirq_tests
python3 -m pytest -v qsimcirq_test.py

google-cla · 2023-08-18T18:50:38Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

tanujkhattar · 2023-09-27T17:12:10Z

@jakurzak Can you please add a PR description ?

@sergeisakov Can you take a look at this PR?

sergeisakov · 2023-10-02T16:23:09Z

@sergeisakov Can you take a look at this PR?

Yes, I'll take a look at this PR.

sergeisakov

Thank you for adding support for AMD GPUs. I can't really test if it works or not. Nevertheless the PR looks good to me. Could you fix a few formatting issues? Could you also add some doc?

lib/statespace_cuda.h

pybind_interface/Makefile

lib/vectorspace_cuda.h

sergeisakov · 2023-10-05T10:55:43Z

lib/vectorspace_cuda.h

-               sizeof(fp_type) * Impl::MinSize(dest.num_qubits()),
-               cudaMemcpyHostToDevice);
+    ErrorCheck(
+      cudaMemcpy(dest.get(), src,


There should be a four space indent if the line breaks after (.

lib/vectorspace_cuda.h

jakurzak · 2023-10-09T20:20:32Z

Thank you for taking the time to look at the PR!
I am out of office this week and traveling overseas.
I'll do my best to address the comments ASAP when I'm back in office.

jakurzak · 2023-10-16T18:49:13Z

I fixed the style issues and commented on the use of HIPCC for CPU code when compiling Python bindings.
I'll be soliciting help with updating of documentation.

jakurzak · 2023-11-07T19:25:19Z

I added basic documentation for AMD GPU support.
I added detailed description to the PR.
Please let me know if anything else needs to be done before the PR can be accepted.

sergeisakov · 2023-11-08T16:48:33Z

I added basic documentation for AMD GPU support. I added detailed description to the PR. Please let me know if anything else needs to be done before the PR can be accepted.

Thank you. Could you replace Qsim with qsim in amd_gpu.md? Could you also add something about how to install the build tools? Is it ROCm?

jakurzak · 2023-11-08T17:29:11Z

Thank you. Could you replace Qsim with qsim in amd_gpu.md? Could you also add something about how to install the build tools? Is it ROCm?

done

sergeisakov

LGTM

jakurzak · 2023-11-09T14:37:04Z

Looks like it's getting hung up on the kokoro test.
Any clue why?

95-martin-orion · 2023-11-09T15:00:58Z

The cuQuantum test failed during bringup:

Logs

#21 ERROR: process "/bin/sh -c ${PIP_CMD} install --upgrade --force-reinstall ${tensorflow_pip_spec} ${extra_pip_specs}" did not complete successfully: exit code: 2
------
 > [17/26] RUN python3.9 -m pip install --upgrade --force-reinstall tf-nightly :
78.73     for chunk in response.raw.stream(
78.73   File "/usr/local/lib/python3.9/dist-packages/pip/_vendor/urllib3/response.py", line 622, in stream
78.73     data = self.read(amt=amt, decode_content=decode_content)
78.73   File "/usr/local/lib/python3.9/dist-packages/pip/_vendor/urllib3/response.py", line 587, in read
78.73     raise IncompleteRead(self._fp_bytes_read, self.length_remaining)
78.73   File "/usr/lib/python3.9/contextlib.py", line 137, in __exit__
78.73     self.gen.throw(typ, value, traceback)
78.73   File "/usr/local/lib/python3.9/dist-packages/pip/_vendor/urllib3/response.py", line 443, in _error_catcher
78.73     raise ReadTimeoutError(self._pool, None, "Read timed out.")
78.73 pip._vendor.urllib3.exceptions.ReadTimeoutError: HTTPSConnectionPool(host='files.pythonhosted.org', port=443): Read timed out.
------
Dockerfile_ubuntu_2004_tf_cuda_11:91
--------------------
  89 |     RUN ${PIP_CMD} install wheel
  90 |     RUN ${PIP_CMD} install absl-py
  91 | >>> RUN ${PIP_CMD} install --upgrade --force-reinstall ${tensorflow_pip_spec} ${extra_pip_specs}
  92 |     
  93 |     RUN ${PIP_CMD} install tfds-nightly
--------------------
ERROR: failed to solve: process "/bin/sh -c ${PIP_CMD} install --upgrade --force-reinstall ${tensorflow_pip_spec} ${extra_pip_specs}" did not complete successfully: exit code: 2

From the logs, it's possible this was a network error. I'll re-trigger the test, but in the meantime check the logs to see if this PR could have caused the issue.

95-martin-orion · 2023-11-09T15:09:10Z

Hmm. Doesn't look like I can re-trigger the kokoro test specifically. Could you push a commit so Github picks up on it?

jakurzak · 2023-11-09T15:21:12Z

No cuQuantum support in the AMD port, BTW, as there's no AMD equivalent.
The HIP port should not affect anything in the CUDA implementation, though.
Pushed a commit to re-trigger.

jakurzak · 2023-11-09T20:56:12Z

@sergeisakov, @95-martin-orion, can you help move this forward?
Thanks in advance!

jakurzak added 11 commits August 9, 2023 13:45

Remove unused lambda captures

aa76b30

Remove unused variable

036654d

Add missing CUDA error checks

61c9d09

Add CUDA to HIP translation

caa5463

Add HIP to pybind11

6dd69e2

Add HIP to make and cmake files

5edafd6

Add HIP cleanup in pybind_interface

c80f91d

Remove APPLE from pybind for HIP

09611cf

Use HIPCC for CPU and GPU pybind files

d732ea8

Add CMAKE_MODULE_PATH for pybind with HIP

31fff3b

Add hipcc detection in setup.py

9e12b68

tanujkhattar requested a review from sergeisakov September 27, 2023 17:11

sergeisakov requested changes Oct 5, 2023

View reviewed changes

Fix style

d254543

Add documentation for AMD GPU support

a1ab2e2

jakurzak added 2 commits November 8, 2023 12:13

Replace Qsim with qsim in amd_gpu.md

56c27c6

Add pointers to AMD ROCm Platform to amd_gpu.md

62d64c4

sergeisakov added the kokoro:run Trigger Kokoro builds for this PR. label Nov 9, 2023

qsim-qsimh-bot removed the kokoro:run Trigger Kokoro builds for this PR. label Nov 9, 2023

sergeisakov added the kokoro:run Trigger Kokoro builds for this PR. label Nov 9, 2023

sergeisakov approved these changes Nov 9, 2023

View reviewed changes

95-martin-orion added kokoro:run Trigger Kokoro builds for this PR. and removed kokoro:run Trigger Kokoro builds for this PR. labels Nov 9, 2023

Merge branch 'master' into master

e09e6b7

jakurzak force-pushed the master branch from 9aa50c5 to e09e6b7 Compare November 9, 2023 15:21

qsim-qsimh-bot removed the kokoro:run Trigger Kokoro builds for this PR. label Nov 9, 2023

Merge branch 'master' into master

33c49f2

95-martin-orion added the kokoro:run Trigger Kokoro builds for this PR. label Nov 9, 2023

qsim-qsimh-bot removed the kokoro:run Trigger Kokoro builds for this PR. label Nov 9, 2023

95-martin-orion merged commit a5e7a82 into quantumlib:master Nov 9, 2023
16 checks passed

This was referenced Nov 9, 2023

Update version to 0.17.1 #636

Closed

Update version to 0.18.0 #637

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for AMD GPUs #619

Add support for AMD GPUs #619

jakurzak commented Aug 18, 2023 •

edited

Loading

google-cla bot commented Aug 18, 2023

tanujkhattar commented Sep 27, 2023

sergeisakov commented Oct 2, 2023

sergeisakov left a comment

sergeisakov Oct 5, 2023

jakurzak Oct 16, 2023

jakurzak commented Oct 9, 2023

jakurzak commented Oct 16, 2023 •

edited

Loading

jakurzak commented Nov 7, 2023

sergeisakov commented Nov 8, 2023

jakurzak commented Nov 8, 2023

sergeisakov left a comment

jakurzak commented Nov 9, 2023

95-martin-orion commented Nov 9, 2023

95-martin-orion commented Nov 9, 2023

jakurzak commented Nov 9, 2023

jakurzak commented Nov 9, 2023

Add support for AMD GPUs #619

Add support for AMD GPUs #619

Conversation

jakurzak commented Aug 18, 2023 • edited Loading

Overview

Changes

CMakeLists.txt

Makefile

apps/Makefile

apps/make.sh

docs/_book.yaml

docs/tutorials/amd_gpu.md

lib/cuda2hip.h

lib/fuser_mqubit.h

lib/simulator_cuda_kernels.h

lib/statespace_cuda.h

lib/statespace_cuda_kernels.h

lib/util_cuda.h

lib/vectorspace_cuda.h

pybind_interface/Makefile

pybind_interface/decide/CMakeLists.txt

pybind_interface/decide/decide.cpp

pybind_interface/hip/CMakeLists.txt

pybind_interface/hip/pybind_main_hip.cpp

pybind_interface/hip/pybind_main_hip.h

pybind_interface/pybind_main.cpp

qsimcirq/__init__.py

setup.py

tests/Makefile

tests/make.sh

Building

Testing

Simulator

Python bindings

google-cla bot commented Aug 18, 2023

tanujkhattar commented Sep 27, 2023

sergeisakov commented Oct 2, 2023

sergeisakov left a comment

Choose a reason for hiding this comment

sergeisakov Oct 5, 2023

Choose a reason for hiding this comment

jakurzak Oct 16, 2023

Choose a reason for hiding this comment

jakurzak commented Oct 9, 2023

jakurzak commented Oct 16, 2023 • edited Loading

jakurzak commented Nov 7, 2023

sergeisakov commented Nov 8, 2023

jakurzak commented Nov 8, 2023

sergeisakov left a comment

Choose a reason for hiding this comment

jakurzak commented Nov 9, 2023

95-martin-orion commented Nov 9, 2023

95-martin-orion commented Nov 9, 2023

jakurzak commented Nov 9, 2023

jakurzak commented Nov 9, 2023

jakurzak commented Aug 18, 2023 •

edited

Loading

`CMakeLists.txt`

`Makefile`

`apps/Makefile`

`apps/make.sh`

`docs/_book.yaml`

`docs/tutorials/amd_gpu.md`

`lib/cuda2hip.h`

`lib/fuser_mqubit.h`

`lib/simulator_cuda_kernels.h`

`lib/statespace_cuda.h`

`lib/statespace_cuda_kernels.h`

`lib/util_cuda.h`

`lib/vectorspace_cuda.h`

`pybind_interface/Makefile`

`pybind_interface/decide/CMakeLists.txt`

`pybind_interface/decide/decide.cpp`

`pybind_interface/hip/CMakeLists.txt`

`pybind_interface/hip/pybind_main_hip.cpp`

`pybind_interface/hip/pybind_main_hip.h`

`pybind_interface/pybind_main.cpp`

`qsimcirq/init.py`

`setup.py`

`tests/Makefile`

`tests/make.sh`

jakurzak commented Oct 16, 2023 •

edited

Loading