Cache OpenBLAS to docker image for SBSA builds #1842

tinglvv · 2024-05-30T10:59:29Z

follow up on #1833
applying it for cpu aarch64 as build is failing with

 -- Could not find OpenBLAS include. Turning OpenBLAS_FOUND off
-- Could not find OpenBLAS lib. Turning OpenBLAS_FOUND off

to reduce build time
Caching OpenBLAS to docker image.

Combining fixes for cpu-aarch64 and cuda-aarch64.

relevant issue: pytorch/pytorch#126980

cc @atalman @ptrblck @nWEIdia @Aidyn-A

atalman · 2024-05-31T17:09:28Z

hi @tinglvv looks good, but please open test PY in pytorch/pytorch to test this change with aarch64 cpu builds, to make sure we are not failing

atalman

see the comment, test pr is required

* Remove triton constraint for py312 (#1846) * Cache OpenBLAS to docker image for SBSA builds (#1842) * apply openblas cache for cpu-aarch64 * reapply for cuda-aarch64 * [MacOS] Don't build wheel while building libtorch Not sure why this was ever done twice * Allow validate doker images to be called from different workflow (#1850) * Allow validate doker images to be called from different workflow * Revert "[MacOS] Don't build wheel while building libtorch" This reverts commit d88495a. * [MacOS] Don't build libtorch twice (take 2) By not invoking `tools/build_libtorch.py` as as it's not done on Linux * [MacOs][LibTorch] Copy libomp.dylib into libtorch package * Update cudnn from v8 to v9 across CUDA versions and x86/arm (#1847) * Update cudnn to v9.1.0.70 for cuda11.8, cuda12.1, and cuda12.4 * Add CUDNN_VERSION variable * Remove 2 spaces for install_cu124 * trivial fix * Fix DEPS_LIST and DEPS_SONAME for x86 Update cudnn to v9 for arm cuda binary as well * libcudnn_adv_infer/libcudnn_adv_train becomes libcudnn_adv * Change DEPS due to cudnn v9 libraries name changes (and additions) * Fix lint * Add missing changes to cu121/cu124 * Change OpenSSL URL (#1854) * Change OpenSSL URL * Change to use openssl URL (but no longer ftp!) * Update build-manywheel-images.yml - Add a note about manylinux_2_28 state * Revert "Update cudnn from v8 to v9 across CUDA versions and x86/arm" (#1855) This reverts commit 5783bcc. * Don't run torch.compile on runtime images in docker validations (#1858) * Don't run torch.compile on runtime images * test * Don't run torch.compile on runtime images in docker validations * Update cudnn from v8 to v9 across CUDA versions and x86/arm (#1857) * Update cudnn to v9.1.0.70 for cuda11.8, cuda12.1, and cuda12.4 * Add CUDNN_VERSION variable * Remove 2 spaces for install_cu124 * trivial fix * Fix DEPS_LIST and DEPS_SONAME for x86 Update cudnn to v9 for arm cuda binary as well * libcudnn_adv_infer/libcudnn_adv_train becomes libcudnn_adv * Change DEPS due to cudnn v9 libraries name changes (and additions) * Fix lint * Add missing changes to cu121/cu124 * Fix aarch64 cuda typos * Update validate-docker-images.yml - disable runtime error check for now * Update validate-docker-images.yml - use validation_runner rather then hardcoded one * Update validate-docker-images.yml - fix MATRIX_GPU_ARCH_TYPE setting for cpu only workflows * [aarch64 cuda cudnn] Add RUNPATH to libcudnn_graph.so.9 (#1859) * Add executorch to pypi prep, promotion and validation scripts (#1860) * Add AOTriton install step for ROCm manylinux images (#1862) * Add AOTriton install step for ROCm * No common_utils.sh needed * temporary disable runtime error check * Add python 3.13 builder (#1845) --------- Co-authored-by: Ting Lu <92425201+tinglvv@users.noreply.github.com> Co-authored-by: Nikita Shulga <nikita.shulga@gmail.com> Co-authored-by: Wei Wang <143543872+nWEIdia@users.noreply.github.com> Co-authored-by: Jithun Nair <37884920+jithunnair-amd@users.noreply.github.com>

facebook-github-bot added the cla signed label May 30, 2024

tinglvv changed the title ~~apply openblas cache for cpu-aarch64~~ Apply openblas cache for cpu-aarch64 May 30, 2024

tinglvv changed the title ~~Apply openblas cache for cpu-aarch64~~ Cache OpenBLAS to docker image for SBSA builds May 30, 2024

apply openblas cache for cpu-aarch64

ad063df

tinglvv force-pushed the cpu_arm_openblas branch from 8f4287c to ad063df Compare May 30, 2024 15:50

reapply for cuda-aarch64

dedb51d

atalman requested changes May 31, 2024

View reviewed changes

atalman approved these changes May 31, 2024

View reviewed changes

atalman merged commit 3d4cd59 into pytorch:main May 31, 2024
26 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache OpenBLAS to docker image for SBSA builds #1842

Cache OpenBLAS to docker image for SBSA builds #1842

tinglvv commented May 30, 2024 •

edited

Loading

atalman commented May 31, 2024

atalman left a comment

Cache OpenBLAS to docker image for SBSA builds #1842

Cache OpenBLAS to docker image for SBSA builds #1842

Conversation

tinglvv commented May 30, 2024 • edited Loading

atalman commented May 31, 2024

atalman left a comment

Choose a reason for hiding this comment

tinglvv commented May 30, 2024 •

edited

Loading