Skip to content

Releases: sgl-project/sglang

Release v0.1.18

04 Jul 06:35
Compare
Choose a tag to compare

Highlight

  • 2x large batch prefill improvement with the new flashinfer kernels #579
  • Multi-node tensor parallelism #550
  • New model support: ChatGLM #516

What's Changed

New Contributors

Full Changelog: v0.1.17...v0.1.18

Release v0.1.17

08 Jun 02:58
e8a2327
Compare
Choose a tag to compare

Highlights

  • Add data parallelim #480
  • Add speculative execution for OpenAI API #250
  • Update vllm to v0.4.3 for new quantization features #511
  • Better error handling (#457, #449, #514)

What's Changed

New Contributors

Full Changelog: v0.1.16...v0.1.17

v0.1.16

14 May 00:36
e0ae5d4
Compare
Choose a tag to compare

Highlight

What's Changed

New Contributors

Full Changelog: v0.1.13...v0.1.16

Release v0.1.13

11 Mar 12:52
4aa5dd2
Compare
Choose a tag to compare

Highlights

What's Changed

New Contributors

Full Changelog: v0.1.12...v0.1.13

Release v0.1.12

11 Feb 14:49
624b21e
Compare
Choose a tag to compare

Highlights

  • Fast JSON Decoding (blog)
  • Output logprobs for decoding tokens
  • Multiple bug fixes

What's Changed

New Contributors

Full Changelog: v0.1.11...v0.1.12

Release v0.1.11

03 Feb 10:57
f6bfe3a
Compare
Choose a tag to compare

Highlights

  • Serve the official release demo of LLaVA v1.6 blog
  • Support Yi-VL example
  • Faster JSON decoding blog
  • Support QWen 2

What's Changed

New Contributors

Full Changelog: v0.1.6...v0.1.11

Release v0.1.6

21 Jan 10:09
cc3ada9
Compare
Choose a tag to compare

Major features

  • Add OpenAI-compatible API server (Completion and ChatCompletion)
  • Fix sgl.select

All PRs

New Contributors

Full Changelog: v0.1.5...v0.1.6

Release v0.1.5

18 Jan 02:40
22ec7bc
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.1.3...v0.1.5