[v0.3.3] Release Tracker #3097

WoosukKwon · 2024-02-28T22:36:29Z

ETA: Feb 29th - Mar 1st

Major changes

The text was updated successfully, but these errors were encountered:

simon-mo · 2024-02-28T22:49:52Z

#2819
#3087
Starcoder2

njhill · 2024-02-29T03:51:32Z

#3099 is a fix for the #3087 regression

hanzhi713 · 2024-02-29T06:13:18Z

#2760 Fixes for custom all reduce on some platforms

robertgshaw2-redhat · 2024-02-29T22:42:23Z

#2497 Adds support for Marlin INT4 kernels ~3x faster than current GPTQ kernels

HyperdriveHustle · 2024-03-01T05:33:06Z

#3016
Fix: Output text is always truncated in some models

WoosukKwon added the release Related to new version release label Feb 28, 2024

WoosukKwon mentioned this issue Mar 1, 2024

Bump up to v0.3.3 #3129

Merged

WoosukKwon closed this as completed in #3129 Mar 1, 2024

Xu-Chen mentioned this issue Mar 7, 2024

[v0.4.0] Release Tracker #3155

Closed

3 tasks