-
Notifications
You must be signed in to change notification settings - Fork 224
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
3D forward convolution solver with non-packed input tensors #2418
Merged
Merged
Changes from 49 commits
Commits
Show all changes
51 commits
Select commit
Hold shift + click to select a range
6e37c43
Squash commits together
amberhassaan 0db63b7
redo nonpack solver
iq136boy f9d542a
add gtest for nonpacked tensor solver
iq136boy 94f5fd3
Squash commits together
amberhassaan 5ab82f3
fix formatting. disable strides for fp8 kernel for now
amberhassaan be93522
fix the lengths of weight tensor
amberhassaan 3762672
add new kernel to test non-packed tensor
iq136boy 603fa64
solve conflict
iq136boy d85785b
use 64-bit integers for stride value
amberhassaan ee6abb3
Squash commits together
amberhassaan 4fbcd77
fix test for non-packed strides
amberhassaan 82e0ccf
fix format
amberhassaan 7e8a258
Fix assertion check.
amberhassaan cadfb95
suppress cppcheck warning to test CI
junliume cde6e22
fix build and remove a check that prevents non-strided inputs
amberhassaan 9c371eb
merge pr2334
iq136boy f7b606b
Merge remote-tracking branch 'origin/develop' into amber/non-packed-c…
amberhassaan 0ad674b
Merge branch 'amber/non-packed-conv-ref-kern' into amber/tests-non-pa…
amberhassaan 01b26cd
all gtest passed locally
iq136boy f633030
minor fix gtest
iq136boy 33e251d
clean debug info
iq136boy 4db6cf8
Merge remote-tracking branch 'origin/develop' into amber/non-packed-c…
amberhassaan 8af6d47
Merge branch 'amber/non-packed-conv-ref-kern' into amber/tests-non-pa…
amberhassaan b95806f
resolve conflict
iq136boy 85f415c
Merge branch 'develop' into dfeng/ck_nonpack_conv3d_fwd
iq136boy ec56121
bug fix after merge develop
iq136boy e06c523
addressed comments. Moved common code into an include file
amberhassaan 23d0066
Merge remote-tracking branch 'origin/develop' into amber/non-packed-c…
amberhassaan 35c9072
Merge branch 'amber/non-packed-conv-ref-kern' into amber/tests-non-pa…
amberhassaan 67d9a77
address comments
amberhassaan 0f16c62
address review comments
amberhassaan c66da71
Merge remote-tracking branch 'origin/develop' into amber/non-packed-c…
amberhassaan 2f9867d
combine 3d fwd packed and non-packed solvers
iq136boy 8bb3a7f
Merge branch 'amber/non-packed-conv-ref-kern' into amber/tests-non-pa…
amberhassaan dec88a7
add more checks for strides
amberhassaan 1c45049
disable 3d wrw solver on gfx900
iq136boy 700b623
Merge remote-tracking branch 'origin/develop' into amber/non-packed-c…
amberhassaan 1253aed
Merge branch 'amber/non-packed-conv-ref-kern' into amber/tests-non-pa…
amberhassaan bdfb4d2
Merge branch 'amber/tests-non-packed-conv' into dfeng/ck_nonpack_conv…
amberhassaan 85e8a62
fix test now that strides are supported
amberhassaan e2d7f1d
Merge branch 'amber/tests-non-packed-conv' into dfeng/ck_nonpack_conv…
amberhassaan 39eee97
use C++17 to compile HIP Kernels
amberhassaan f60f182
Merge remote-tracking branch 'origin/develop' into amber/non-packed-c…
amberhassaan e7c9d03
Merge branch 'amber/non-packed-conv-ref-kern' into amber/tests-non-pa…
amberhassaan abac92d
Merge branch 'amber/tests-non-packed-conv' into dfeng/ck_nonpack_conv…
amberhassaan b20ae12
Merge branch 'develop' into dfeng/ck_nonpack_conv3d_fwd
junliume d0f57da
bug fix after merge develop
iq136boy 936282c
merge develop to resolve conflict
iq136boy 8091d1c
address comments
iq136boy fbbaebc
Merge branch 'develop' into dfeng/ck_nonpack_conv3d_fwd
junliume 8c49fc2
resolve merge issue
junliume File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is just a recommendation : place
StartsWith(device_name, "gfx900") || StartsWith(device_name, "gfx906")
in sperate bool function. That way we don't need to passis_wrw
.