-
Notifications
You must be signed in to change notification settings - Fork 475
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug] Qwen2-VL占用显存过大导致OOM #2565
Comments
https://github.com/InternLM/lmdeploy/blob/main/docs/zh_cn/multi_modal/qwen2_vl.md https://github.com/InternLM/lmdeploy/blob/main/lmdeploy/vl/model/qwen2.py#L85-L87 |
以上文档里面指定 |
request 中添加 |
@Titan-p 试过了,确实可行。就是用起来很不方便,不知道是否得通过扩展客户端聊天UI应用程序来给每个带图片请求私下增加这个像素限制属性。 有没有办法直接在Server端直接配置呢? |
Checklist
Describe the bug
Qwen2-VL 7B按理说80G的显存是能跑下的,但实际部署时推理会OOM
Reproduction
lmdeploy serve api_server ../Qwen2-VL-7B-Instruct --server-port 12345
Environment
Error traceback
The text was updated successfully, but these errors were encountered: