-
Notifications
You must be signed in to change notification settings - Fork 224
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CI] Update CI docker and suppress warnings #2333
Merged
Changes from all commits
Commits
Show all changes
23 commits
Select commit
Hold shift + click to select a range
b600234
Update CI docker to ROCm 5.7 RC2
junliume bcf4fcb
fix issue with repo
junliume c6be2b2
update docker build arch list
junliume 0f9c9e3
Update CK commit hash
junliume 3d061fe
update FIN to its latest develop
junliume f779a67
revert CK commit hash
junliume a9f8223
revert CK commit
junliume 70c50d8
suppress warning from hipRTC
junliume 0da7bc3
suppress UNSAN
junliume 83a19ed
fix the alignment issue
junliume 109d623
fix alignment issue
junliume dd72b0d
revert an alignment change to check gfx906 CI
junliume 4d1b82a
Fix potential UB in KernelArgs detected by sanitizer.
atamazov d0b7c14
Update dependencies CK and Half
junliume 6477e68
WORKAROUND_FAULTY_HIPMEMGETINFO_VEGA_NAVI2X
atamazov 2120a23
[CI] Fix: Applid WORKAROUND_ISSUE_1148 to a 'Fp16 Hip All gfx1030' st…
atamazov 93d36ee
Merge branch 'ci_rocm57' into ci_rocm57_ata1
atamazov b59403d
Avoid log clogs and find half include header
junliume 4643c44
Merge branch 'develop' into ci_rocm57_ata1
atamazov b268f40
Merge branch 'ci_rocm57' into ci_rocm57_ata1
atamazov 47d34a4
Update ROCm to 5.7 RC3
junliume dc3c013
Merge branch 'develop' into ci_rocm57
junliume 68e106e
Merge branch 'ci_rocm57_ata1' into ci_rocm57
junliume File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Submodule fin
updated
from ebf9b3 to b2f3f4
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,9 +1,9 @@ | ||
sqlite3@3.17 -DCMAKE_POSITION_INDEPENDENT_CODE=On | ||
boost@1.79 -DCMAKE_POSITION_INDEPENDENT_CODE=On --build -DCMAKE_CXX_FLAGS=" -std=c++14 -Wno-enum-constexpr-conversion " | ||
ROCmSoftwarePlatform/half@4f19ce3e56f3d3a17cf69f9db4ff3722f7445b0d --build | ||
boost@1.79 -DCMAKE_POSITION_INDEPENDENT_CODE=On --build -DCMAKE_CXX_FLAGS=" -std=c++14 -Wno-enum-constexpr-conversion -Wno-deprecated-builtins -Wno-deprecated-declarations " | ||
ROCmSoftwarePlatform/half@10abd99e7815f0ca5d892f58dd7d15a23b7cf92c --build | ||
ROCmSoftwarePlatform/rocMLIR@rocm-5.5.0 -H sha256:a5f62769d28a73e60bc8d61022820f050e97c977c8f6f6275488db31512e1f42 -DBUILD_FAT_LIBROCKCOMPILER=1 -DCMAKE_IGNORE_PATH=/opt/conda/envs/py_3.9 -DCMAKE_IGNORE_PREFIX_PATH=/opt/conda | ||
nlohmann/json@v3.9.1 -DJSON_MultipleHeaders=ON -DJSON_BuildTests=Off | ||
ROCmSoftwarePlatform/FunctionalPlus@v0.2.18-p0 | ||
ROCmSoftwarePlatform/eigen@3.4.0 | ||
ROCmSoftwarePlatform/frugally-deep@9683d557eb672ee2304f80f6682c51242d748a50 | ||
ROCmSoftwarePlatform/composable_kernel@f0831350d15c3d368d7ae321dd08441d6569086e -DDTYPES="fp16;fp32;bf16" -DCMAKE_BUILD_TYPE=Release -DINSTANCES_ONLY=ON | ||
ROCmSoftwarePlatform/composable_kernel@0629870d27397ab10a90fde6d7050f3e3d65fc2f -DDTYPES="fp16;fp32;bf16" -DCMAKE_BUILD_TYPE=Release -DINSTANCES_ONLY=ON |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This may change the layout of kernel arguments in memory. Without alignas(U), the required
buffer
alignment is 1 (i.e., no alignment is required).Therefore, the alignment of KernelArgsPair instances is also 1, and several such instances reside in memory without any gaps.With alignas(U), the padding required for KernelArgsPair is alginof(U)
, which may lead to gaps between instances of KernelArgsPair.If you see kernel failures, then please revert this change.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@atamazov I do see lots of failures on
gfx906
withHowever, it looks more likely a runtime issue since it cannot be reproduced on other newer ASICs. But I will revert this change and try again.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@junliume This is not related and I see the same on navi21.
Reverting this change won't resolve the issue with getting available memory.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@junliume It seems like this change is indeed correct and should be kept.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@atamazov can you verify it happens on Navi21?
For Vega nodes some have problems but some other nodes do not.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@junliume Of course, this happens on Navi21, as I've reported a while ago at #2307 (comment) where you can find the dirty hacks for this. I am working on more or less regular W/A which should be suitable for merging into develop.