-
-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
v0.4.1 Release Tracker #4181
Comments
@simon-mo I think we should add this fix into release tracker, because it fixed a major bug in the prefix prefill kernel that would cause many model services to crash when using prefix caching, such as bloom, phi-2 series etc. By the way, when will 0.4.1 be released ? Because we want to use a stable release that fixes the prefix prefill issue. |
@DefTruth, if it is merged (which seems to be the case), it will be in the release |
Oh I missed the second question. |
Hi Simon, can we also have #3993 included? This can fix the CPU api server issue thanks |
Because the CPU distribution is currently built from source and Docker directly, and we do not release prebuilt version, we don't need to mark it release blocking. We will release every two weeks. |
@esmeetu good catch. Merged and I'll retag. |
ETA Monday April 22
The text was updated successfully, but these errors were encountered: