Skip to content

Issues: vllm-project/vllm

[Roadmap] vLLM Roadmap Q1 2025
#11862 opened Jan 8, 2025 by simon-mo
Open
vLLM's V1 Engine Architecture
#8779 opened Sep 24, 2024 by simon-mo
Open 10
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Bug]: The output of Aria model is not correct bug Something isn't working
#12241 opened Jan 21, 2025 by xffxff
1 task done
[Usage]: how to input messages as multi-message (a batch) instead of just one usage How to use vllm
#12234 opened Jan 21, 2025 by Hyfred
1 task done
[New Model]: Add support for DeepSeek R1 new model Requests to new models
#12226 opened Jan 20, 2025 by jorgeantonio21
1 task done
[Usage]: Guided choice not working as expected usage How to use vllm
#12225 opened Jan 20, 2025 by srsingh24
1 task done
[Usage]: Context window crashes web window when full usage How to use vllm
#12221 opened Jan 20, 2025 by seabastard
1 task done
[Usage]: BNB quantization not supported for Paligemma2 model usage How to use vllm
#12216 opened Jan 20, 2025 by ken2190
1 task done
[Usage]: how to generate results and get the embeddings of the result usage How to use vllm
#12213 opened Jan 20, 2025 by daiwk
1 task done
[Bug]: Inconsistent data received and sent using PyNcclPipe bug Something isn't working
#12197 opened Jan 20, 2025 by fanfanaaaa
1 task done
[Bug]: CUDA initialization error with vLLM 0.5.4 and PyTorch 2.4.0+cu121 bug Something isn't working
#12189 opened Jan 19, 2025 by TaoShuchang
1 task done
[Bug]: Fail to use beamsearch with llm.chat bug Something isn't working
#12183 opened Jan 18, 2025 by gystar
1 task done
[Bug]: Multi-Node Online Inference on TPUs Failing bug Something isn't working
#12179 opened Jan 17, 2025 by BabyChouSr
1 task done
[Bug]: Slow huggingface weights download. Sequential download bug Something isn't working
#12177 opened Jan 17, 2025 by NikolaBorisov
1 task done
[New Model]: openbmb/MiniCPM-o-2_6 new model Requests to new models
#12162 opened Jan 17, 2025 by myoss
1 task done
[Usage]: Terminates without any error 30 seconds after a successful run. usage How to use vllm
#12160 opened Jan 17, 2025 by hznnnnnn
1 task done
ProTip! Type g i on any issue or pull request to go back to the issue listing page.