-
Notifications
You must be signed in to change notification settings - Fork 588
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[3rdparty, document] Updated Documentation that for triton fused_moe kernel tuning for AMD Instinct GPUs #2191
Conversation
Thanks for contributing this great tuning script! Could we use this script to tune on other devices? Any general steps for adaptation? |
Yes, it can be used on other devices, but some parameter is the ROCm platform only, it needs to be removed when using on other platforms. These two parameters only exist on ROCm
|
@ispobock Later we may come up with a generic script for more devices, after we collect device specific triton kargs extensions. |
from tqdm import tqdm | ||
from transformers import AutoConfig | ||
|
||
from sglang.srt.layers.fused_moe_grok.fused_moe import fused_moe, get_config_file_name |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
hold on
fused_moe_grok
will be removed soon
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ref #2223
ref #2225 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks!
Thanks @kkHuang-amd @HaiShaw May you help update the doc from |
@zhyncs certainly, also notified @merrymercy for #2223 |
Motivation
Updated Documentation for triton fused_moe kernel tuning for AMD Instinct GPUs.
Modifications
Checklist