Skip to content

Actions: triple-Mu/vllm_official

clang-format

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
235 workflow runs
235 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[core][distributed] simplify code to support pipeline parallel (#6406)
clang-format #60: Commit 69672f1 pushed by triple-Mu
July 15, 2024 05:09 20s main
July 15, 2024 05:09 20s
Report usage for beam search (#6404)
clang-format #59: Commit 32c9d7f pushed by triple-Mu
July 15, 2024 03:16 15s main
July 15, 2024 03:16 15s
July 15, 2024 02:34 24s
[Kernel] Turn off CUTLASS scaled_mm for Ada Lovelace (#6384)
clang-format #57: Commit 9dad5cc pushed by triple-Mu
July 14, 2024 13:39 19m 42s main
July 14, 2024 13:39 19m 42s
Remove unnecessary trailing period in spec_decode.rst (#6405)
clang-format #56: Commit 6ef3bf9 pushed by triple-Mu
July 14, 2024 13:24 14s main
July 14, 2024 13:24 14s
[Misc][Bugfix] Update transformers for tokenizer issue (#6364)
clang-format #55: Commit f7160d9 pushed by triple-Mu
July 12, 2024 09:16 18s main
July 12, 2024 09:16 18s
[Doc] Guide for adding multi-modal plugins (#6205)
clang-format #54: Commit 8a924d2 pushed by triple-Mu
July 10, 2024 08:27 18s main
July 10, 2024 08:27 18s
[Bugfix][TPU] Add prompt adapter methods to TPUExecutor (#6279)
clang-format #53: Commit 5ed3505 pushed by triple-Mu
July 10, 2024 05:26 18s main
July 10, 2024 05:26 18s
Add FlashInfer to default Dockerfile (#6172)
clang-format #52: Commit 4f0e0ea pushed by triple-Mu
July 9, 2024 02:00 15s main
July 9, 2024 02:00 15s
[Model] Add PaliGemma (#5189)
clang-format #51: Commit 6206dcb pushed by triple-Mu
July 7, 2024 07:17 16s main
July 7, 2024 07:17 16s
[VLM] Cleanup validation and update docs (#6149)
clang-format #50: Commit ea4b570 pushed by triple-Mu
July 5, 2024 08:05 16s main
July 5, 2024 08:05 16s
July 4, 2024 06:28 18s
[Distributed][Core] Support Py39 and Py38 for PP (#6120)
clang-format #48: Commit 0ed646b pushed by triple-Mu
July 4, 2024 01:35 15s main
July 4, 2024 01:35 15s
July 3, 2024 06:57 20s
[Model] Jamba support (#4115)
clang-format #46: Commit 9d6a8da pushed by triple-Mu
July 3, 2024 01:37 15s main
July 3, 2024 01:37 15s
[Speculative Decoding] MLPSpeculator Tensor Parallel support (1/2) (#…
clang-format #45: Commit 15aba08 pushed by triple-Mu
July 2, 2024 15:33 15m 37s main
July 2, 2024 15:33 15m 37s
[VLM] Remove image_input_type from VLM config (#5852)
clang-format #44: Commit 98d6682 pushed by triple-Mu
July 2, 2024 10:14 22s main
July 2, 2024 10:14 22s
[Frontend] Add template related params to request (#5709)
clang-format #43: Commit 2c37540 pushed by triple-Mu
July 2, 2024 06:42 17s main
July 2, 2024 06:42 17s
July 2, 2024 03:18 23s
July 1, 2024 02:01 18s
[VLM][BugFix] Make sure that multi_modal_kwargs can broadcast prope…
clang-format #40: Commit 74d55c0 pushed by triple-Mu
June 28, 2024 07:47 25s main
June 28, 2024 07:47 25s
[Kernel][ROCm][AMD] fused_moe Triton configs v2 for mi300X (#5932)
clang-format #39: Commit c3dde36 pushed by triple-Mu
June 28, 2024 02:13 18s main
June 28, 2024 02:13 18s
[Model] Add base class for LoRA-supported models (#5018)
clang-format #38: Commit 96354d6 pushed by triple-Mu
June 27, 2024 08:22 18s main
June 27, 2024 08:22 18s
[CI/Build] Refactor image test assets (#5821)
clang-format #37: Commit 6984c02 pushed by triple-Mu
June 26, 2024 09:13 15s main
June 26, 2024 09:13 15s
[Bugfix] Add fully sharded layer for QKVParallelLinearWithLora (#5665)
clang-format #36: Commit 67005a0 pushed by triple-Mu
June 21, 2024 05:09 14s main
June 21, 2024 05:09 14s