[Feature] turbomind mode support Llava-Qwen with new ImageEncoder #2497

deepindeed2022 · 2024-09-23T08:24:27Z

Motivation

我们想实现 turbomind 的离线推理模型 llava-interleave-qwen-7b-hf, 请问有什么可以参考的案例吗？包括模型参数配置、模型转换/加载过程以及模型推理实现中的注意问题等。

Related resources

模型：https://huggingface.co/llava-hf/llava-interleave-qwen-7b-hf
模型定义： https://github.com/LLaVA-VL/LLaVA-NeXT/blob/main/llava/model/language_model/llava_qwen.py

Additional context

No response

irexyc · 2024-09-24T03:05:39Z

为了兼容pytorch/turbomind两个后端，我们把vision模型从vlm中拆了出来。

要支持turbomind离线推理，主要部分有三个：

turbomind 模型加载逻辑 (已支持的 llm 模型一般映射一下 key 就好)
vision 模型加载以及推理逻辑
对话模版部分，主要是 IMAGE_TOKEN 如何插入的问题。

我觉得可以参考一下这个PR是如何做的，基本上覆盖了上面说的几个部分，如果有问题的话可以再讨论。
https://github.com/InternLM/lmdeploy/pull/1425/files

- fix init raise exception because tie_word_embeddings config - max_batch_size option for start

- fix init raise exception because tie_word_embeddings config

lvhan028 assigned irexyc Sep 23, 2024

lvhan028 added the awaiting response label Sep 25, 2024

deepindeed2022 closed this as completed Sep 25, 2024

deepindeed2022 added a commit to deepindeed2022/lmdeploy that referenced this issue Oct 25, 2024

bugfix: llava-hf/llava-interleave-qwen-7b-hf (InternLM#2497)

6ef3a7e

- fix init raise exception because tie_word_embeddings config - max_batch_size option for start

deepindeed2022 mentioned this issue Oct 25, 2024

bugfix: llava-hf/llava-interleave-qwen-7b-hf (#2497) #2657

Merged

deepindeed2022 added a commit to deepindeed2022/lmdeploy that referenced this issue Oct 26, 2024

bugfix: llava-hf/llava-interleave-qwen-7b-hf (InternLM#2497)

092d960

- fix init raise exception because tie_word_embeddings config

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] turbomind mode support Llava-Qwen with new ImageEncoder #2497

[Feature] turbomind mode support Llava-Qwen with new ImageEncoder #2497

deepindeed2022 commented Sep 23, 2024

irexyc commented Sep 24, 2024

[Feature] turbomind mode support Llava-Qwen with new ImageEncoder #2497

[Feature] turbomind mode support Llava-Qwen with new ImageEncoder #2497

Comments

deepindeed2022 commented Sep 23, 2024

Motivation

Related resources

Additional context

irexyc commented Sep 24, 2024