Skip to content

Some question about GTC2020 cutlass's talk about conflict free load shared memory #1130

Answered by hwu36
MARD1NO asked this question in Q&A
Discussion options

You must be logged in to vote

thread0 need thread T0,8,16,24's shared memory data to load in registers

correct

cutlass has a special shared memory store layout.

correct.

slides 45-48 just deep dive into an example to show why the special layout can avoid bank conflicts when loading from the shared memory to the registers.

Replies: 13 comments 11 replies

Comment options

You must be logged in to vote
3 replies
@MARD1NO
Comment options

@MARD1NO
Comment options

@hwu36
Comment options

Answer selected by MARD1NO
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@hwu36
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
7 replies
@Ther-LF
Comment options

@hwu36
Comment options

hwu36 Dec 4, 2024
Maintainer

@linuxlonelyeagle
Comment options

@hwu36
Comment options

hwu36 Dec 6, 2024
Maintainer

@linuxlonelyeagle
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
7 participants