Which transfomer version could be used with VLLM 0.6.2? #282

bash99 · 2024-09-26T09:53:42Z

VLLM 0.6.2 had just released few hours ago, it said no support multi image inference with Qwen2-VL.

I've try it, but it require the newest transformer and automatic install it.

When I start it use follow script (worked with vllm 0.6.1)

VLLM_WORKER_MULTIPROC_METHOD=spawn CUDA_VISIBLE_DEVICES=0,1 python -m vllm.entrypoints.openai.api_server --served-model-name Qwen2-VL-72B-Instruct-GPTQ-Int4 --model Qwen/Qwen2-VL-72B-Instruct-GPTQ-Int4 --port 7869 --dtype half --trust-remote-code --kv-cache-dtype fp8 -q gptq --disable-log-requests --gpu-memory-utilization 0.998 --max-model-len 24576 --max_n
um_seqs 16 -tp 2

it report error like

  File "/DaTa/.local/home/hai.li/miniforge3/envs/vllm/lib/python3.12/site-packages/vllm/config.py", line 1746, in _get_and_verify_max_len
    assert "factor" in rope_scaling
           ^^^^^^^^^^^^^^^^^^^^^^^^
AssertionError
Unrecognized keys in `rope_scaling` for 'rope_type'='default': {'mrope_section'}

if I return to old transformer with

pip install git+https://github.com/huggingface/transformers@21fac7abba2a37fae86106f87fcf9974fd1e3830

it report error like

  File "/DaTa/.local/home/hai.li/miniforge3/envs/vllm/lib/python3.12/site-packages/vllm/transformers_utils/configs/mll
ama.py", line 1, in <module>
    from transformers.models.mllama import configuration_mllama as mllama_hf_config
ModuleNotFoundError: No module named 'transformers.models.mllama'

The text was updated successfully, but these errors were encountered:

xiehust · 2024-09-26T12:30:06Z

same issue +1

mkaskov · 2024-09-26T13:14:39Z

the same

xiehust · 2024-09-26T13:22:55Z

seems the issue is found:vllm-project/vllm#8829

DarkLight1337 · 2024-09-26T14:57:22Z

We have just now fixed the issue in vllm-project/vllm#8837. Please install vLLM from source to resolve the config loading problem.

verigle · 2024-09-27T02:44:08Z

We have just now fixed the issue in vllm-project/vllm#8837. Please install vLLM from source to resolve the config loading problem.

vllm 依然不支持多张图片或视频的问答，请问是否有计划修复？

DarkLight1337 · 2024-09-27T02:53:20Z

We have just now fixed the issue in vllm-project/vllm#8837. Please install vLLM from source to resolve the config loading problem.

vllm 依然不支持多张图片或视频的问答，请问是否有计划修复？

Multi-image input is currently supported in both offline and online inference, while video input is only supported for offline inference at the moment. If you need to pass videos via OpenAI API, you can instead provide multiple images for now. Please check the example in examples/openai_vision_api_client.py (especially the part labelled "Multi-image input inference")

jbohnslav · 2024-09-27T18:44:17Z

We have just now fixed the issue in vllm-project/vllm#8837. Please install vLLM from source to resolve the config loading problem.

Can we get a .post0 release for this? Installing from source is a lot more difficult.

seetimee · 2024-09-29T03:28:55Z

use
pip install https://vllm-wheels.s3.us-west-2.amazonaws.com/nightly/vllm-1.0.0.dev-cp38-abi3-manylinux1_x86_64.whl
it will install the latest version of vllm.It worked for me just now

DarkLight1337 · 2024-09-29T03:34:12Z

use pip install https://vllm-wheels.s3.us-west-2.amazonaws.com/nightly/vllm-1.0.0.dev-cp38-abi3-manylinux1_x86_64.whl it will install the latest version of vllm.It worked for me just now

I'd recommend against installing .whl files directly unless it's from a source that you trust.

leonbadboy · 2024-10-04T09:43:03Z

vllm 0.62 运行还是报这个错：Unrecognized keys in rope_scaling for 'rope_type'='default': {'mrope_section'}

DarkLight1337 · 2024-10-04T09:53:35Z

vllm 0.62 运行还是报这个错：Unrecognized keys in rope_scaling for 'rope_type'='default': {'mrope_section'}

Please install vLLM from source to fix the issue.

Carkham · 2024-10-10T08:55:40Z

Hi all, I'm encountering the same error. It seems related to the rope_scaling["type"] settings in Qwen2VLConfig from the transformers library.

You can check the relevant code here:

if self.rope_scaling is not None and "type" in self.rope_scaling:
    if self.rope_scaling["type"] == "mrope":
        # self.rope_scaling["type"] = "default"
    self.rope_scaling["rope_type"] = self.rope_scaling["type"]
rope_config_validation(self, ignore_keys={"mrope_section"})

After commenting out that line, my program work well with transformers=4.45.2 and vllm=0.6.2.

Tongjilibo · 2024-10-14T06:53:22Z

change type to rope_type in config.json can fix this error in vllm==0.6.2

furmak331 mentioned this issue Sep 26, 2024

Implement using transformers furmak331/Vision-OCR#3

Open

fyabc self-assigned this Sep 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Which transfomer version could be used with VLLM 0.6.2? #282

Which transfomer version could be used with VLLM 0.6.2? #282

bash99 commented Sep 26, 2024

xiehust commented Sep 26, 2024

mkaskov commented Sep 26, 2024

xiehust commented Sep 26, 2024

DarkLight1337 commented Sep 26, 2024 •

edited

Loading

verigle commented Sep 27, 2024

DarkLight1337 commented Sep 27, 2024 •

edited

Loading

jbohnslav commented Sep 27, 2024

seetimee commented Sep 29, 2024

DarkLight1337 commented Sep 29, 2024

leonbadboy commented Oct 4, 2024

DarkLight1337 commented Oct 4, 2024

Carkham commented Oct 10, 2024 •

edited

Loading

Tongjilibo commented Oct 14, 2024

Which transfomer version could be used with VLLM 0.6.2? #282

Which transfomer version could be used with VLLM 0.6.2? #282

Comments

bash99 commented Sep 26, 2024

xiehust commented Sep 26, 2024

mkaskov commented Sep 26, 2024

xiehust commented Sep 26, 2024

DarkLight1337 commented Sep 26, 2024 • edited Loading

verigle commented Sep 27, 2024

DarkLight1337 commented Sep 27, 2024 • edited Loading

jbohnslav commented Sep 27, 2024

seetimee commented Sep 29, 2024

DarkLight1337 commented Sep 29, 2024

leonbadboy commented Oct 4, 2024

DarkLight1337 commented Oct 4, 2024

Carkham commented Oct 10, 2024 • edited Loading

Tongjilibo commented Oct 14, 2024

DarkLight1337 commented Sep 26, 2024 •

edited

Loading

DarkLight1337 commented Sep 27, 2024 •

edited

Loading

Carkham commented Oct 10, 2024 •

edited

Loading