Skip to content

Releases: AI-Hypercomputer/JetStream

v0.3

18 Dec 19:41
973647d
Compare
Choose a tag to compare

Key Changes

  • Observability improvements in JetStream Server (prometheus metrics)
  • Tensorboard support for remote access
  • Engine API update for TTFT and TPOT measurements
  • Hugginface tokenizer support
  • Copybara G3 support
  • Threading optimizations

What's Changed

New Contributors

Full Changelog: v0.2.2...v0.3

v0.2.2

31 May 18:44
ec26ec2
Compare
Choose a tag to compare

Key Changes

  • Enable observability in JetStream Server (prometheus metrics)
  • Enable JAX profiler support on single-host JetStream Server
  • Support both text and token ids I/O for JetStream Decode API
  • Add health check API
  • Support MLPerf evaluation
  • Enable JetStream Server E2E tests
  • Increase unit test coverage (>=96%)

What's Changed

New Contributors

Full Changelog: v0.2.1...v0.2.2

v0.2.1

03 May 21:23
dabded4
Compare
Choose a tag to compare

Key Changes

  • Support Llama3 tokenizer
  • JetStream Tokenizer refactor
  • Disaggregation preparation work

What's Changed

New Contributors

Full Changelog: v0.2.0...v0.2.1

v0.2.0

05 Apr 20:42
ae2ca8c
Compare
Choose a tag to compare

Major Changes

  • Support JetStream MaxText inference on Cloud TPU VM
  • Support JetStream Pytorch inference on Cloud TPU VM
  • Support Continuous Batching with interleaved mode in JetStream
  • Support online serving benchmarking

What's Changed

New Contributors

Full Changelog: https://github.com/google/JetStream/commits/v0.2.0