Skip to content

Actions: triple-Mu/vllm_official

Lint GitHub Actions workflows

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
13 workflow runs
13 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[misc] benchmark_throughput : Add LoRA (#11267)
Lint GitHub Actions workflows #13: Commit 9835673 pushed by triple-Mu
December 19, 2024 07:56 16s main
December 19, 2024 07:56 16s
[Bugfix] Handle <|tool_call|> token in granite tool parser (#11039)
Lint GitHub Actions workflows #12: Commit beb16b2 pushed by triple-Mu
December 10, 2024 10:43 17s main
December 10, 2024 10:43 17s
[model] Reduce medusa weight (#10454)
Lint GitHub Actions workflows #11: Commit 343041c pushed by triple-Mu
November 20, 2024 06:32 15s main
November 20, 2024 06:32 15s
[Misc][LoRA] Replace hardcoded cuda device with configurable argument…
Lint GitHub Actions workflows #10: Commit 7f5edb5 pushed by triple-Mu
November 12, 2024 03:50 18s main
November 12, 2024 03:50 18s
Fixes a typo about 'max_decode_seq_len' which causes crashes with cud…
Lint GitHub Actions workflows #9: Commit da07a9e pushed by triple-Mu
November 8, 2024 07:11 15s main
November 8, 2024 07:11 15s
Adds method to read the pooling types from model's files (#9506)
Lint GitHub Actions workflows #8: Commit aa9078f pushed by triple-Mu
November 7, 2024 08:49 15s main
November 7, 2024 08:49 15s
[v1] reduce graph capture time for piecewise cudagraph (#10059)
Lint GitHub Actions workflows #7: Commit c4cacba pushed by triple-Mu
November 6, 2024 02:34 3m 38s main
November 6, 2024 02:34 3m 38s
[Bugfix] Fix ray instance detect issue (#9439)
Lint GitHub Actions workflows #6: Commit 2adb440 pushed by triple-Mu
October 28, 2024 14:24 20s main
October 28, 2024 14:24 20s
[V1] Support sliding window attention (#9679)
Lint GitHub Actions workflows #5: Commit 9645b9f pushed by triple-Mu
October 25, 2024 11:59 19s main
October 25, 2024 11:59 19s
[CI/Build] Split up decoder-only LM tests (#9488)
Lint GitHub Actions workflows #4: Commit 696b01a pushed by triple-Mu
October 21, 2024 14:25 1m 9s main
October 21, 2024 14:25 1m 9s
[misc] hide best_of from engine (#9261)
Lint GitHub Actions workflows #3: Commit cbc2ef5 pushed by triple-Mu
October 11, 2024 07:04 17s main
October 11, 2024 07:04 17s
Add classifiers in setup.py (#9171)
Lint GitHub Actions workflows #2: Commit ffc4b27 pushed by triple-Mu
October 9, 2024 03:22 17s main
October 9, 2024 03:22 17s
[Frontend] API support for beam search for MQLLMEngine (#9117)
Lint GitHub Actions workflows #1: Commit 8c74622 pushed by triple-Mu
October 8, 2024 13:37 15s main
October 8, 2024 13:37 15s