ci: bump TF to 2.18, PT to 2.5 #4228

njzjz · 2024-10-17T20:57:07Z

This is prepared for the upcoming TF 2.18, which needs CUDNN 9. In the future, I may move all pinnings into pyproject.toml...

Summary by CodeRabbit

New Features
- Enhanced dependency management for CUDA and Python workflows.
- Introduced new jobs for better organization of test duration handling.
Bug Fixes
- Updated TensorFlow and Torch versions for improved compatibility and performance.
- Refined version requirements for TensorFlow based on detected CUDA versions.
Documentation
- Adjusted testing commands and linting configurations for clarity and compliance.
Chores
- Streamlined caching mechanisms to optimize test duration tracking.

Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>

coderabbitai · 2024-10-17T21:01:13Z

📝 Walkthrough

Walkthrough

The pull request updates the workflow configurations for testing CUDA and Python in the .github/workflows/test_cuda.yml and .github/workflows/test_python.yml files, respectively. The CUDA workflow changes the Docker image to nvidia/cuda:12.6-devel-ubuntu24.04, modifies TensorFlow and Torch versions, updates the libtorch download link, and skips a CUDA installation command. The Python workflow enhances dependency management, introduces new jobs, and simplifies installation commands. Additionally, the pyproject.toml file updates dependency versions, testing commands, and linting configurations.

Changes

File	Change Summary
`.github/workflows/test_cuda.yml`	Updated Docker image to `nvidia/cuda:12.6.0-devel-ubuntu24.04`, modified TensorFlow and Torch versions, updated libtorch download link, and skipped CUDA 12.3 installation command.
`.github/workflows/test_python.yml`	Enhanced installation process, ignored branches matching "gh-readonly-queue/**", added concurrency settings, simplified torch installation, introduced new jobs for test duration management.
`pyproject.toml`	Removed `PYTORCH_VERSION`, updated `nvidia-cudnn-cu12` dependency, specified test commands for Linux and Windows, and adjusted linting configurations.
`backend/find_pytorch.py`	Enhanced `get_pt_requirement` function to determine PyTorch version based on `CUDA_VERSION`.
`backend/find_tensorflow.py`	Updated TensorFlow version requirements based on detected CUDA version and refined conditional logic.

Possibly related PRs

fix: unpin h5py on aarch64 #4176: The changes in backend/find_tensorflow.py update the TensorFlow version requirements based on the detected CUDA version, which is directly related to the updates made in the main PR regarding TensorFlow version changes in the workflow configuration.

Suggested reviewers

wanghan-iapcm

📜 Recent review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Files that changed from the base of the PR and between 363ace6 and 8a8969a.

📒 Files selected for processing (1)

backend/find_pytorch.py (3 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

backend/find_pytorch.py

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 0

🧹 Outside diff range and nitpick comments (3)

.github/workflows/test_cuda.yml (3)
50-50: LGTM: TensorFlow and PyTorch versions updated

The update to TensorFlow 2.18.0rc2 and PyTorch 2.5.0 aligns with the PR objectives. The use of ~= for version specification is a good practice.

Consider using more specific version ranges to avoid potential issues with future releases:
-    - run: source/install/uv_with_retry.sh pip install --system "tensorflow~=2.18.0rc2" "torch~=2.5.0"
+    - run: source/install/uv_with_retry.sh pip install --system "tensorflow~=2.18.0,<2.19.0" "torch~=2.5.0,<2.6.0"
This ensures compatibility with patch releases while preventing automatic updates to major versions.

Line range hint 41-47: Consider removing the commented-out CUDA installation step

The CUDA installation step has been correctly disabled as it's no longer needed with the NVIDIA Docker image. This is a good change.

For code cleanliness, consider removing this entire block instead of keeping it commented out. If you want to preserve this information for future reference, consider moving it to a separate documentation file or adding it as a comment in the workflow file's header.

You can remove these lines:
-    - run: |
-         wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64/cuda-keyring_1.0-1_all.deb \
-         && sudo dpkg -i cuda-keyring_1.0-1_all.deb \
-         && sudo apt-get update \
-         && sudo apt-get -y install cuda-12-3 libcudnn8=8.9.5.*-1+cuda12.3
-      if: false  # skip as we use nvidia image
🧰 Tools

🪛 actionlint

51-51: shellcheck reported issue in this script: SC2155:warning:1:8: Declare and assign separately to avoid masking return values

(shellcheck)

51-51: shellcheck reported issue in this script: SC2155:warning:2:8: Declare and assign separately to avoid masking return values

(shellcheck)

51-51: shellcheck reported issue in this script: SC2102:info:3:61: Ranges can only match single chars (mentioned due to duplicates)

(shellcheck)

Line range hint 1-105: Summary of CUDA workflow update

The changes in this PR successfully update the CUDA testing workflow to use CUDA 12.6, TensorFlow 2.18, and PyTorch 2.5, aligning with the PR objectives. Here's a summary of the main points:

The Docker image has been updated to use CUDA 12.6 and Ubuntu 24.04.

TensorFlow and PyTorch versions have been updated as intended.

The libtorch download link has been updated, but there's a potential CUDA version mismatch to verify.

The redundant CUDA installation step has been disabled.

Please address the following points:

Verify compatibility with Ubuntu 24.04 for all dependencies and scripts.

Consider using more specific version ranges for TensorFlow and PyTorch.

Check and update the libtorch download link to ensure CUDA 12.6 compatibility.

Remove the commented-out CUDA installation step for code cleanliness.

Once these points are addressed, the PR will be ready for merge.

🧰 Tools

🪛 actionlint

19-19: label "nvidia" is unknown. available labels are "windows-latest", "windows-latest-8-cores", "windows-2022", "windows-2019", "ubuntu-latest", "ubuntu-latest-4-cores", "ubuntu-latest-8-cores", "ubuntu-latest-16-cores", "ubuntu-24.04", "ubuntu-22.04", "ubuntu-20.04", "macos-latest", "macos-latest-xl", "macos-latest-xlarge", "macos-latest-large", "macos-14-xl", "macos-14-xlarge", "macos-14-large", "macos-14", "macos-14.0", "macos-13-xl", "macos-13-xlarge", "macos-13-large", "macos-13", "macos-13.0", "macos-12-xl", "macos-12-xlarge", "macos-12-large", "macos-12", "macos-12.0", "macos-11", "macos-11.0", "self-hosted", "x64", "arm", "arm64", "linux", "macos", "windows". if it is a custom label for self-hosted runner, set list of labels in actionlint.yaml config file

(runner-label)

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Files that changed from the base of the PR and between cc4b23d and f1e974e.

📒 Files selected for processing (1)

.github/workflows/test_cuda.yml (3 hunks)

🧰 Additional context used

🔇 Additional comments (2)

.github/workflows/test_cuda.yml (2)

22-22: LGTM: Docker image updated to CUDA 12.6

The update to nvidia/cuda:12.6-devel-ubuntu24.04 aligns with the PR objective of bumping CUDA to 12.6. This change also includes an upgrade to Ubuntu 24.04.

Please ensure that all dependencies and scripts are compatible with Ubuntu 24.04. Run the following command to check the Ubuntu version in the workflow:

66-66: Verify CUDA compatibility for libtorch

The libtorch download link has been updated to version 2.5.0, which is consistent with the PyTorch version installed earlier. However, there's a potential version mismatch:

The Docker image uses CUDA 12.6

The libtorch download is for CUDA 12.4

Please verify if libtorch 2.5.0 with CUDA 12.4 is compatible with CUDA 12.6, or if there's a more appropriate version available. You can check the PyTorch website for the latest compatible versions:

If a CUDA 12.6 compatible version is available, consider updating the download link.

coderabbitai

Actionable comments posted: 0

🧹 Outside diff range and nitpick comments (2)

.github/workflows/test_python.yml (1)
29-29: Simplified torch installation looks good, but consider version pinning.

The change to simplify the torch installation and ensure the CPU version is installed is a good improvement. It aligns with the PR objective of updating dependencies.

However, consider the following suggestions:

To ensure reproducibility across different environments and times, it might be beneficial to pin the torch version. This prevents potential issues from unexpected changes in newer versions.

Verify that this version of torch is compatible with the updated TensorFlow version (2.18 as per PR objectives) to avoid any integration issues.

Consider modifying the line to include a version pin:
-        source/install/uv_with_retry.sh pip install --system torch -i https://download.pytorch.org/whl/cpu
+        source/install/uv_with_retry.sh pip install --system torch==2.5.* -i https://download.pytorch.org/whl/cpu
This ensures you're using the latest patch version of PyTorch 2.5, which aligns with the PR objective of bumping to PT 2.5.
pyproject.toml (1)

279-280: Overall assessment of version updates

The changes to TENSORFLOW_VERSION and PYTORCH_VERSION align with the PR objectives to bump CUDA, TensorFlow, and PyTorch versions. These updates are significant and may have far-reaching effects on the project.

To ensure a smooth transition to these new versions:

Conduct thorough testing across the entire codebase, paying special attention to TensorFlow and PyTorch specific functionalities.

Update any documentation or README files to reflect the new version requirements.

Consider creating a migration guide for users of the project, highlighting any breaking changes or new features that may affect their usage.

Monitor the CI/CD pipeline closely after these changes to catch any unforeseen issues early.

Plan for a potential rollback strategy in case critical issues are discovered post-merge.

These version bumps represent a significant change. While they bring new features and improvements, they also introduce the risk of compatibility issues. Careful testing and monitoring will be crucial for a successful integration.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Files that changed from the base of the PR and between f1e974e and 6aca89a.

📒 Files selected for processing (2)

.github/workflows/test_python.yml (1 hunks)
pyproject.toml (1 hunks)

🧰 Additional context used

🔇 Additional comments (3)

.github/workflows/test_python.yml (1)

Line range hint 1-95: Overall workflow improvements look good.

The changes to this workflow file align well with the PR objectives and introduce several improvements:

The torch installation has been simplified and ensures the CPU version is used.

The workflow has been updated to ignore certain branches and include merge group events, which can help with CI efficiency.

Concurrency settings have been refined to better manage parallel runs.

New jobs for updating test durations have been added, which can help with better test distribution and performance tracking.

These changes should lead to a more efficient and maintainable CI process. Good job on the improvements!

🧰 Tools

🪛 actionlint

27-27: shellcheck reported issue in this script: SC2155:warning:3:8: Declare and assign separately to avoid masking return values

(shellcheck)

27-27: shellcheck reported issue in this script: SC2102:info:4:80: Ranges can only match single chars (mentioned due to duplicates)

(shellcheck)

27-27: shellcheck reported issue in this script: SC2102:info:4:102: Ranges can only match single chars (mentioned due to duplicates)

(shellcheck)

pyproject.toml (2)

279-279: Verify compatibility with TensorFlow 2.18.0rc2

The addition of TENSORFLOW_VERSION = "2.18.0rc2" aligns with the PR objective to bump TensorFlow to version 2.18. However, using a release candidate version in a CI/CD pipeline could potentially lead to instability or unexpected behavior.

Please ensure that:

The codebase is compatible with TensorFlow 2.18.0rc2.

Any breaking changes introduced in this version are addressed.

The CI pipeline successfully runs with this version.

Consider adding a TODO comment to update to the stable 2.18.0 release when it becomes available.

280-280: Verify compatibility with PyTorch 2.5.0

The update of PYTORCH_VERSION from "2.3.1" to "2.5.0" is a significant version bump, which aligns with the PR objective. This update may introduce new features and potentially breaking changes.

Please ensure that:

The codebase is compatible with PyTorch 2.5.0.

Any breaking changes or deprecations between versions 2.3.1 and 2.5.0 are addressed.

The CI pipeline successfully runs with this version.

Performance improvements or new features in PyTorch 2.5.0 are leveraged where applicable.

codecov · 2024-10-17T21:47:50Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 84.20%. Comparing base (0f817e1) to head (8a8969a).
Report is 173 commits behind head on devel.

Additional details and impacted files

@@            Coverage Diff             @@
##            devel    #4228      +/-   ##
==========================================
- Coverage   84.22%   84.20%   -0.02%     
==========================================
  Files         547      547              
  Lines       51334    51334              
  Branches     3051     3051              
==========================================
- Hits        43235    43226       -9     
  Misses       7147     7147              
- Partials      952      961       +9

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>

coderabbitai

Actionable comments posted: 0

🧹 Outside diff range and nitpick comments (1)

pyproject.toml (1)

Line range hint 1-580: Summary of changes and potential impact

The main changes in this PR involve updating the versions of key dependencies:

TensorFlow updated to 2.18.0rc2 (release candidate)

PyTorch updated to 2.5.0

CUDNN constraint removed, allowing for newer versions

These updates align with the PR objectives and may bring performance improvements and new features. However, they also introduce potential risks:

Using a TensorFlow release candidate in production

Possible breaking changes with the significant PyTorch version jump

Potential compatibility issues with unconstrained CUDNN versions

To mitigate these risks and ensure a smooth transition:

Implement comprehensive integration tests covering critical functionality with these new versions.

Consider a phased rollout or maintaining a fallback option to quickly revert if issues arise.

Monitor performance metrics closely after deployment to identify any regressions or improvements.

Update documentation to reflect any changes in API or behavior resulting from these version updates.

These steps will help maintain the stability and reliability of the project while benefiting from the latest improvements in the underlying libraries.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Files that changed from the base of the PR and between d3fe74a and 8b895ea.

📒 Files selected for processing (1)

pyproject.toml (2 hunks)

🧰 Additional context used

🔇 Additional comments (2)

pyproject.toml (2)
135-135: Consider the implications of removing CUDNN version constraint

The removal of the version constraint for nvidia-cudnn-cu12 (from <9 to no constraint) allows for the use of newer CUDNN versions. This change aligns with the CUDA update mentioned in the PR title. However, consider the following:

Benefit: This change enables the use of the latest CUDNN optimizations and features.

Risk: Removing the upper bound may lead to compatibility issues with future, untested CUDNN versions.

To ensure compatibility, run the following verification script:
#!/bin/bash
# Verify CUDNN version and compatibility
python3 << END
import torch
print(f"CUDNN version: {torch.backends.cudnn.version()}")
# Add any CUDNN-specific functionality tests here
END
This script will help confirm that the CUDNN version is correctly detected and basic functionality is maintained.

279-280: Verify compatibility with updated TensorFlow and PyTorch versions

The changes update the TensorFlow version to a release candidate (2.18.0rc2) and PyTorch to a newer stable version (2.5.0). While these updates align with the PR objectives, consider the following:

Using a TensorFlow release candidate (2.18.0rc2) in a production environment may introduce instability. Ensure thorough testing is performed, especially for critical functionality.

The PyTorch update to 2.5.0 is a significant version jump. While it likely brings performance improvements and new features, it may also introduce breaking changes.

To ensure compatibility, run the following verification script:

This script will help confirm that the new versions are correctly installed and basic functionality is maintained.

Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>

coderabbitai

Actionable comments posted: 2

🧹 Outside diff range and nitpick comments (4)

backend/find_pytorch.py (2)
111-111: Add validation for CUDA_VERSION format

The CUDA version is retrieved without format validation. Consider adding version format validation to prevent issues with malformed version strings.
-        cuda_version = os.environ.get("CUDA_VERSION", "12.2")
+        cuda_version = os.environ.get("CUDA_VERSION", "12.2")
+        # Validate CUDA version format
+        try:
+            Version(cuda_version)
+        except ValueError:
+            raise RuntimeError(f"Invalid CUDA version format: {cuda_version}")
115-117: Document CUDA-PyTorch compatibility matrix

The version mapping between CUDA and PyTorch versions should be documented to make maintenance easier.

Add a docstring section explaining the compatibility matrix:
def get_pt_requirement(pt_version: str = "") -> dict:
    """Get PyTorch requirement when PT is not installed.
    
    ...

    CUDA-PyTorch Compatibility Matrix:
    - CUDA 12.x (up to 12.6) -> PyTorch 2.5.0
    - CUDA 11.x -> PyTorch 2.3.1
    """
backend/find_tensorflow.py (1)
Line range hint 95-101: LGTM! Consider documenting version compatibility.

The version constraints for CUDA 11.8 are well-defined. The explicit version 2.14.1 provides a stable fallback.

Consider adding a comment explaining why TensorFlow 2.14.1 was chosen as the specific version for CUDA 11.8 compatibility.
                requires.extend(
                    [
                        "tensorflow-cpu>=2.5.0rc0,<2.15; platform_machine=='x86_64' and platform_system == 'Linux'",
                    ]
                )
+               # TF 2.14.1 is the last known stable version for CUDA 11.8
                tf_version = "2.14.1"
pyproject.toml (1)

135-135: LGTM! Consider documenting tested versions.

The removal of the version constraint for nvidia-cudnn-cu12 aligns with the PR objective to bump CUDA support. While allowing any version provides flexibility, consider documenting the tested version range in the project documentation to help users troubleshoot compatibility issues.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Files that changed from the base of the PR and between f116ebb and 363ace6.

📒 Files selected for processing (3)

backend/find_pytorch.py (2 hunks)
backend/find_tensorflow.py (1 hunks)
pyproject.toml (1 hunks)

🔇 Additional comments (3)

backend/find_pytorch.py (2)
25-27: LGTM: Clean import addition

The addition of SpecifierSet import is appropriate for version range comparisons.

110-119: Verify CUDA version compatibility

Let's verify that these CUDA version mappings are consistent with PyTorch's official compatibility matrix.

✅ Verification successful

Let me try to search for any configuration or build files that might contain PyTorch version and CUDA compatibility information.

Let me check the specific file where the CUDA version mapping is defined to understand its context better.

After analyzing the code and context, I can now provide a final response:

CUDA version mappings are correctly aligned with PyTorch releases

The CUDA version mappings in the code are accurate according to PyTorch's official releases:

PyTorch 2.5.0 is the correct version for CUDA 12.x (specifically 12.2)

PyTorch 2.3.1 is the correct version for CUDA 11.x (specifically 11.8)

This is a build-time configuration used only during wheel building (CIBUILDWHEEL=1) to ensure correct PyTorch versions are matched with CUDA versions. The code properly handles the version selection and includes appropriate error handling for unsupported CUDA versions.
🏁 Scripts executed

The following scripts were executed for the analysis:

Script:
#!/bin/bash
# Search for any PyTorch compatibility documentation in the repository
rg -i "cuda.*compatibility|pytorch.*cuda" README.md docs/
Length of output: 105

Script:
#!/bin/bash
# Search for PyTorch and CUDA version references in configuration and build files
fd -e toml -e txt -e ini -e cfg -e yaml -e yml | xargs rg -i "pytorch|cuda"

# Also search in setup files
fd "setup\.py|requirements\.txt" | xargs rg -i "pytorch|cuda"
Length of output: 8715

Script:
#!/bin/bash
# Look at the entire find_pytorch.py file to understand the context
cat backend/find_pytorch.py

# Also search for any test files related to PyTorch version detection
fd -e py | xargs rg -l "test.*find_pytorch|test.*cuda.*version"
Length of output: 4683
backend/find_tensorflow.py (1)

88-91: LGTM! Verify TensorFlow 2.18.0rc0 compatibility with CUDA 12.2.

The version bump aligns with the PR objectives. However, let's verify that TensorFlow 2.18.0rc0 is officially supported with CUDA 12.2.

backend/find_pytorch.py

Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>

ci(cuda): bump CUDA to 12.6, TF to 2.18, PT to 2.5

f1e974e

Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>

njzjz added the Test CUDA Trigger test CUDA workflow label Oct 17, 2024

github-actions bot removed the Test CUDA Trigger test CUDA workflow label Oct 17, 2024

bump more versions

6aca89a

Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>

njzjz changed the title ~~ci(cuda): bump CUDA to 12.6, TF to 2.18, PT to 2.5~~ ci: bump CUDA to 12.6, TF to 2.18, PT to 2.5 Oct 17, 2024

coderabbitai bot reviewed Oct 17, 2024

View reviewed changes

njzjz added the Test CUDA Trigger test CUDA workflow label Oct 17, 2024

github-actions bot removed the Test CUDA Trigger test CUDA workflow label Oct 17, 2024

12.6.0

d3fe74a

Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>

njzjz added the Test CUDA Trigger test CUDA workflow label Oct 17, 2024

github-actions bot removed the Test CUDA Trigger test CUDA workflow label Oct 17, 2024

cudnn

8b895ea

Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>

njzjz added the Test CUDA Trigger test CUDA workflow label Oct 18, 2024

coderabbitai bot reviewed Oct 18, 2024

View reviewed changes

github-actions bot removed the Test CUDA Trigger test CUDA workflow label Oct 18, 2024

revert changes to the CUDA version

f116ebb

Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>

njzjz changed the title ~~ci: bump CUDA to 12.6, TF to 2.18, PT to 2.5~~ ci: bump TF to 2.18, PT to 2.5 Oct 18, 2024

njzjz added 2 commits October 24, 2024 03:02

Merge branch 'devel' into ci-test

106cb22

we need to have different PT versions for cuda 11 and 12

363ace6

Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>

coderabbitai bot reviewed Oct 24, 2024

View reviewed changes

backend/find_pytorch.py Show resolved Hide resolved

backend/find_pytorch.py Show resolved Hide resolved

only apply linux_x86_64

8a8969a

Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>

njzjz marked this pull request as ready for review October 24, 2024 07:37

njzjz added the Test CUDA Trigger test CUDA workflow label Oct 24, 2024

github-actions bot removed the Test CUDA Trigger test CUDA workflow label Oct 24, 2024

njzjz requested a review from wanghan-iapcm October 24, 2024 08:05

wanghan-iapcm approved these changes Oct 24, 2024

View reviewed changes

wanghan-iapcm enabled auto-merge October 24, 2024 08:45

wanghan-iapcm added this pull request to the merge queue Oct 24, 2024

Merged via the queue into deepmodeling:devel with commit 02580c2 Oct 24, 2024
63 checks passed

coderabbitai bot mentioned this pull request Oct 25, 2024

ci: bump tensorflow/build image to 2.18 #4252

Merged

This was referenced Nov 15, 2024

[r2] cherry-pick to fix workflows #4367

Merged

breaking(pt): drop PyTorch 2.0 support #4383

Merged

coderabbitai bot mentioned this pull request Nov 28, 2024

pd: skip certain UT and fix paddle ver in in test_cuda.yml #4439

Merged

coderabbitai bot mentioned this pull request Jan 31, 2025

chore: bump pytorch to 2.6.0 #4575

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: bump TF to 2.18, PT to 2.5 #4228

ci: bump TF to 2.18, PT to 2.5 #4228

njzjz commented Oct 17, 2024 •

edited

Loading

coderabbitai bot commented Oct 17, 2024 •

edited

Loading

Walkthrough

Changes

Possibly related PRs

Suggested reviewers

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (`.coderabbit.yaml`)

Documentation and Community

coderabbitai bot left a comment

coderabbitai bot left a comment

codecov bot commented Oct 17, 2024 •

edited

Loading

coderabbitai bot left a comment

coderabbitai bot left a comment

ci: bump TF to 2.18, PT to 2.5 #4228

ci: bump TF to 2.18, PT to 2.5 #4228

Conversation

njzjz commented Oct 17, 2024 • edited Loading

Summary by CodeRabbit

coderabbitai bot commented Oct 17, 2024 • edited Loading

Walkthrough

Changes

Possibly related PRs

Suggested reviewers

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

codecov bot commented Oct 17, 2024 • edited Loading

Codecov Report

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

njzjz commented Oct 17, 2024 •

edited

Loading

coderabbitai bot commented Oct 17, 2024 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)

codecov bot commented Oct 17, 2024 •

edited

Loading