Skip to content

Releases: kserve/kserve

v0.13.1

28 Jul 17:22
e7d9ac8
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.13.0...v0.13.1

v0.13.0

05 Jun 13:38
1c51eee
Compare
Choose a tag to compare

🌈 What's New?

⚠️ What's Changed

🐛 What's Fixed

⬆️ Version Upgrade

🔨 Project SDLC

Read more

v0.13.0-rc1

21 May 09:58
Compare
Choose a tag to compare
v0.13.0-rc1 Pre-release
Pre-release

What's Changed

New Contributors

Full Changelog: v0.13.0-rc0...v0.13.0-rc1

v0.13.0-rc0

07 May 10:11
bfc2e21
Compare
Choose a tag to compare
v0.13.0-rc0 Pre-release
Pre-release

🌈 What's New?

  • add support for async streaming in predict by @alexagriffith in #3475
  • Fix: Support model parallelism in HF transformer by @gavrishp in #3459
  • Support model revision and tokenizer revision in huggingface server by @lizzzcai in #3558
  • OpenAI schema by @tessapham in #3477
  • Support OpenAIModel in ModelRepository by @grandbora in #3590
  • updated xgboost to support json and ubj models by @andyi2it in #3551
  • Add OpenAI API support to Huggingfaceserver by @cmaddalozzo in #3582
  • VLLM support for OpenAI Completions in HF server by @gavrishp in #3589
  • Add a user friendly error message for http exceptions by @grandbora in #3581
  • feat: Provide minimal distribution of CRDs by @terrytangyuan in #3492
  • set default SAFETENSORS_FAST_GPU and HF_HUB_DISABLE_TELEMETRY in HF Server by @lizzzcai in #3594
  • Enabled the multiple domains support on an inference service by @houshengbo in #3615
  • Add base model for proxying request to an OpenAI API enabled model server by @cmaddalozzo in #3621
  • Add headers to predictor exception logging by @grandbora in #3658
  • Enhance controller setup based on available CRDs by @israel-hdez in #3472

⚠️ What's Changed

🐛 What's Fixed

⬆️ Version Upgrade

🔨 Project SDLC

CVE patches

📝 Documentation Update

New Contributors

Full Changelog: v0.12.1...v0.13.0-rc0

v0.12.1

23 Apr 12:20
d94ca25
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.12.0...v0.12.1

v0.12.0

25 Feb 17:17
Compare
Choose a tag to compare

🌈 What's New?

Core Inference & Serving Runtimes

Advanced Inference

KServe Python SDK, Storage

⚠️ What's Changed

  • Change the default value for enableDirectPvcVolumeMount to true by @Jooho in #3371
  • Add model arguments to API and update BERT inference example by @yuzisun in #3332

--model_name, --predictor_host, --predictor_use_ssl, --predictor_request_timeout_seconds are added to the kserve model server and no longer need to be defined in the custom predictor or transformer. --protocol is deprecated and superceded by --predictor_protocol. More details can be found on API reference doc.

🐛 What's Fixed

⬆️ Version Upgrade

🔨 Project SDLC

Read more

v0.12.0-rc1

27 Jan 14:10
Compare
Choose a tag to compare
v0.12.0-rc1 Pre-release
Pre-release

What's Changed

New Contributors

Full Changelog: v0.12.0-rc0...v0.12.0-rc1

v0.12.0-rc0

24 Dec 19:14
85eca89
Compare
Choose a tag to compare
v0.12.0-rc0 Pre-release
Pre-release

What's Changed

New Contributors

Full Changelog: v0.11.1...v0.12.0-rc0

v0.11.2

15 Nov 14:13
f7db2a3
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.11.1...v0.11.2

v0.11.1

22 Sep 22:53
52b8804
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.11.0...v0.11.1