-
Notifications
You must be signed in to change notification settings - Fork 711
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
minor: support flashinfer nightly #2295
Conversation
ref #2179 |
abcc0da
to
e495d1b
Compare
476a6a9
to
c248778
Compare
adafbbd
to
db93a31
Compare
7d7b20d
to
291bd54
Compare
ref https://github.com/sgl-project/sglang/actions/runs/12104687316 |
""" | ||
Install the dependency in CI. | ||
""" | ||
# Install the dependency in CI. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Comments in bash use #
|
||
./killall_sglang.sh | ||
# Use repo from environment variable, passed from GitHub Actions | ||
FLASHINFER_REPO="${FLASHINFER_REPO:-https://flashinfer.ai/whl/cu121/torch2.4}" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Index URL does not need to include the flashinfer directory
# Use repo from environment variable, passed from GitHub Actions | ||
FLASHINFER_REPO="${FLASHINFER_REPO:-https://flashinfer.ai/whl/cu121/torch2.4}" | ||
|
||
SCRIPT_DIR="$( cd "$( dirname "${BASH_SOURCE[0]}" )" && pwd )" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Resolve the issue of not finding the execution script
required: true | ||
type: choice | ||
default: 'release' | ||
options: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Use the choice method
nightly version issue cc @yzh119 |
This PR currently provides the option to manually enable nightly flashinfer, with the default still being the release version. There is an issue with dtype inconsistency in nightly flashinfer that needs to be fixed by flashinfer. This PR is safe to merge. cc @merrymercy @yzh119 |
For bf16 models, the data type needs to be specified explicitly in plan functions. |
Motivation
Currently, it can only be triggered manually.
Modifications
Checklist