-
Notifications
You must be signed in to change notification settings - Fork 519
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ci: skip test_data_equal
on the GPU machine
#4260
Conversation
This test crashes on the machine iZ0xih0eykcp6eddga4w5iZ with exit code 1: https://github.com/deepmodeling/deepmd-kit/actions/runs/11533273426/job/32106001782 Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>
📝 WalkthroughWalkthroughThe pull request modifies the Changes
Possibly related PRs
Suggested labels
Suggested reviewers
Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media? 🪧 TipsChatThere are 3 ways to chat with CodeRabbit:
Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments. CodeRabbit Commands (Invoked using PR comments)
Other keywords and placeholders
CodeRabbit Configuration File (
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actionable comments posted: 1
🧹 Outside diff range and nitpick comments (1)
source/tests/consistent/io/test_io.py (1)
74-74
: Consider documenting and investigating the GPU limitationWhile skipping the test is a valid short-term solution, consider:
- Adding a comment in the test explaining why it's CPU-only
- Creating a tracking issue to investigate the root cause of GPU failures
- Adding this limitation to the testing documentation
Example comment to add above the decorator:
# This test is currently limited to CPU due to crashes on GPU machines in CI # See issue #<number> for tracking the investigation of GPU compatibilityWould you like me to help create a tracking issue for investigating the GPU compatibility of this test?
📜 Review details
Configuration used: CodeRabbit UI
Review profile: CHILL
📒 Files selected for processing (1)
- source/tests/consistent/io/test_io.py (2 hunks)
🔇 Additional comments (1)
source/tests/consistent/io/test_io.py (1)
24-27
: LGTM: Clean import additionsThe new imports for
CI
andTEST_DEVICE
are well-organized and necessary for implementing the conditional test skipping mechanism.
It works. |
But I am still wondering what is the reason for the crash. Is it due to out-of-memory? @caic99 Are there detailed logs on that machine? |
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## devel #4260 +/- ##
=======================================
Coverage 84.23% 84.23%
=======================================
Files 549 549
Lines 51462 51463 +1
Branches 3051 3051
=======================================
+ Hits 43348 43349 +1
- Misses 7153 7154 +1
+ Partials 961 960 -1 ☔ View full report in Codecov by Sentry. |
This test crashes on the machine iZ0xih0eykcp6eddga4w5iZ with exit code 1: https://github.com/deepmodeling/deepmd-kit/actions/runs/11533273426/job/32106001782
Summary by CodeRabbit
Bug Fixes
Tests
test_data_equal
method to conditionally skip tests based on the testing device and CI status.tearDown
method to ensure proper test environment management.