Skip to content
This repository has been archived by the owner on Oct 25, 2024. It is now read-only.

[vLLM] QBits Perf Enhence #4969

[vLLM] QBits Perf Enhence

[vLLM] QBits Perf Enhence #4969

Triggered via pull request May 30, 2024 10:42
Status Success
Total duration 5m 6s
Artifacts

chatbot-test.yml

on: pull_request
call-inference-llama-2-7b-chat-hf  /  inference test
2m 19s
call-inference-llama-2-7b-chat-hf / inference test
call-inference-mpt-7b-chat  /  inference test
2m 11s
call-inference-mpt-7b-chat / inference test
Fit to window
Zoom out
Zoom in