[ROCm] include gfx908 as supported #2792

jamestwhedbee · 2024-02-06T21:03:28Z

ROCm/flash-attention supports the gfx908 architecture.

Without this change, vLLM appears to build successfully for me, but serving an LLM on an MI100 results in gibberish output.

With this change, everything works as expected.

jamestwhedbee · 2024-02-12T18:47:42Z

@zhuohan123 would you have time to review this?

jamestwhedbee · 2024-02-20T01:56:31Z

@WoosukKwon is there anything I should be doing differently to get a review here?

zhuohan123

LGTM! Thanks for the fix!

linchen111 · 2024-06-28T15:42:17Z

ROCm/flash-attention supports the gfx908 architecture.

Without this change, vLLM appears to build successfully for me, but serving an LLM on an MI100 results in gibberish output.

With this change, everything works as expected.

get this error in MI100:
RuntimeError: HIP error: invalid argument
Compile with TORCH_USE_HIP_DSA to enable device-side assertions.

include gfx908 as supported

929409c

jamestwhedbee mentioned this pull request Feb 6, 2024

[ROCm] Fixup arch checks for ROCM #2627

Merged

jamestwhedbee changed the title ~~include gfx908 as supported~~ [ROCm] include gfx908 as supported Feb 6, 2024

tjtanaa mentioned this pull request Feb 7, 2024

Installing with ROCM #621

Closed

jamestwhedbee added 2 commits February 12, 2024 09:25

Merge branch 'main' into main

f464318

remove unnecessary whitespace

dfbc232

WoosukKwon added the rocm label Feb 14, 2024

zhuohan123 approved these changes Feb 20, 2024

View reviewed changes

zhuohan123 merged commit 264017a into vllm-project:main Feb 20, 2024
17 checks passed

xjpang pushed a commit to xjpang/vllm that referenced this pull request Feb 22, 2024

[ROCm] include gfx908 as supported (vllm-project#2792)

c1179ac

andy-neuma mentioned this pull request Feb 23, 2024

andy/bump main to v0.3.2 neuralmagic/nm-vllm#49

Closed

xjpang pushed a commit to xjpang/vllm that referenced this pull request Mar 4, 2024

[ROCm] include gfx908 as supported (vllm-project#2792)

4cb28a1

brettkoonce mentioned this pull request Mar 19, 2024

rocm 5.7.1 + 7900 xtx + jax:latest docker image not working jax-ml/jax#18747

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ROCm] include gfx908 as supported #2792

[ROCm] include gfx908 as supported #2792

jamestwhedbee commented Feb 6, 2024

jamestwhedbee commented Feb 12, 2024

jamestwhedbee commented Feb 20, 2024

zhuohan123 left a comment

linchen111 commented Jun 28, 2024

[ROCm] include gfx908 as supported #2792

[ROCm] include gfx908 as supported #2792

Conversation

jamestwhedbee commented Feb 6, 2024

jamestwhedbee commented Feb 12, 2024

jamestwhedbee commented Feb 20, 2024

zhuohan123 left a comment

Choose a reason for hiding this comment

linchen111 commented Jun 28, 2024