Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Refiled] IFU-main-2023-07-31 #1469

Closed
wants to merge 56 commits into from

Conversation

jithunnair-amd
Copy link
Collaborator

@jithunnair-amd jithunnair-amd commented Aug 7, 2023

PLEASE IGNORE THIS PR: USED THE WRONG TARGET REPO

KyleCZH and others added 30 commits March 22, 2022 19:07
* add dependencies for rocm5.1

* install miopen dependencies using cmake

* change lib path in build_rocm.sh for rocm5.1

* change amdgpu_version for rocm5.1

* remove 4.5.2 and add 5.1
[ROCm] add dependencies for rocm5.2
* change the MAYBE_LIB64 path for just for 5.2
* Changes to support ROCm 5.3

* Updated as per comments
- In ROCm 5.3 libtorch build are failing during magma build due to
  to missing python binary so added install statement
* Updating the condition for noRCCL build

* Updated changes as per comments
…rocm_fork

Disable MLIR backend when building MIOpen
jithunnair-amd and others added 26 commits October 5, 2022 02:07
…taging_branch

Use staging branch of MIOpen for ROCm5.3
* Update to so patching for ROCm

Wildcard used in grep to grab the actual numbered so file referenced
in patchelf. This allows the removal of specifying the so number in
DEPS_LIST & DEPS_SONAME

This commit also adds the functionality for trimming so names to
build_libtorch.sh from build_common.sh

* Refactor to remove switch statement in build_rocm.sh

This commit refactors build_rocm.sh and brings in a few major updates:
 - No longer required to specify the full .so name (with number) for ROCm libraries
       - The .so versions are copied and the patching code will fix the links to point to this version
 - No longer required to specify paths for ROCm libraries allowing the removal of the large switch
       - Paths are acquired programmatically with find
 - No longer required to specify both the path and filename for the OS specific libraries
       - Programatically extract file name from the path
 - Automatically extract Tensile/Kernels files for the architectures specified in PYTORCH_ROCM_ARCH
   and any non-arch specific files e.g. TensileLibrary.dat
* Remove miopen custom build step

* Bundle MIOpen db files in wheel

* Correct path
* Use libtinfo.so.6 for Ubuntu 2004

* Fix to origname grep

* Condition on ROCM_VERSION for libtinfo6
We require the same fix that was made on upstream pytorch
pytorch/pytorch#91371
ROCm/pytorch@b72ec7c

Without this change install_conda.sh stage fails
```
#21 6.254 CondaFileIOError: '/opt/conda/pkgs/envs/*/env.txt'. [Errno 2] No such file or directory: '/opt/conda/pkgs/envs/*/env.txt'
#21 6.254 
#21 ERROR: executor failed running [/bin/sh -c bash ./install_conda.sh && rm install_conda.sh]: exit code: 1
------
 > [conda 2/3] RUN bash ./install_conda.sh && rm install_conda.sh:
------
executor failed running [/bin/sh -c bash ./install_conda.sh && rm install_conda.sh]: exit code: 1
```

Locally tested with the `/builder/libtorch/build_docker.sh`
Co-authored-by: Jack Taylor <jack.taylor@amd.com>
…tes (#31)

* Update magma commits for various branches to include header path updates

* Add PYTORCH_BRANCH as dockerfile ARG for magma build
* Bundle msccl-algorithms into wheel

* Use correct src path for msccl-algorithms
Extract both devel and non-devel rpms and copy headers as well (from 3e27cef)
…pdate msccl path for ROCm5.7

(cherry picked from commit 36c10cc)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants