Support mllama for pytorch engine #2605

AllentDan · 2024-10-15T09:45:11Z

Based on #2591.
~~Tensor parallel failed due to accelerate pp mode disabled during vision model inference. I am not sure if we should hack the code of transformers.~~

Conflicts: lmdeploy/pytorch/backends/attention.py lmdeploy/pytorch/backends/cuda/attention.py lmdeploy/pytorch/model_inputs.py

lmdeploy/turbomind/deploy/source_model/mllama.py

docs/zh_cn/multi_modal/mllama.md

lmdeploy/pytorch/models/module_map.py

docs/zh_cn/multi_modal/mllama.md

RunningLeon

LGTM

lmdeploy/pytorch/models/mllama.py

lmdeploy/vl/model/mllama.py

grimoire and others added 2 commits October 11, 2024 14:59

support cross-cache

aedd65b

Support mllama in pytorch engine

2be0cbb

lvhan028 requested a review from irexyc October 15, 2024 09:49

AllentDan added 3 commits October 16, 2024 11:13

add rewrite to support accelerate pp

cc71897

update cross kv lens accordingly

5b9c354

Merge branch 'main' into mllama

f021d23

Conflicts: lmdeploy/pytorch/backends/attention.py lmdeploy/pytorch/backends/cuda/attention.py lmdeploy/pytorch/model_inputs.py

lvhan028 added the enhancement New feature or request label Oct 16, 2024

fix ut and fill cache index

f2a751a

lvhan028 reviewed Oct 16, 2024

View reviewed changes

lmdeploy/turbomind/deploy/source_model/mllama.py Outdated Show resolved Hide resolved

AllentDan added 4 commits October 18, 2024 14:58

remove mllama.py

1f3dfd9

fix pure text input error

7a2f310

another cat for tp

4cb10a8

fix

069aa46

lvhan028 requested a review from RunningLeon October 21, 2024 02:21

RunningLeon reviewed Oct 21, 2024

View reviewed changes

docs/zh_cn/multi_modal/mllama.md Show resolved Hide resolved

RunningLeon reviewed Oct 21, 2024

View reviewed changes

lmdeploy/pytorch/models/module_map.py Show resolved Hide resolved

AllentDan added 3 commits October 21, 2024 10:58

add no split module for 90B

f570a4a

refine

ce06c6b

update supported_models

1406981

RunningLeon reviewed Oct 21, 2024

View reviewed changes

docs/zh_cn/multi_modal/mllama.md Show resolved Hide resolved

AllentDan added 2 commits October 22, 2024 10:08

strict check

d1f38a2

update image inputs

3c8b97c

RunningLeon approved these changes Oct 22, 2024

View reviewed changes

handle image with shape 1x1

4dbd419

RunningLeon reviewed Oct 22, 2024

View reviewed changes

lmdeploy/pytorch/models/mllama.py Show resolved Hide resolved

irexyc reviewed Oct 23, 2024

View reviewed changes

lmdeploy/vl/model/mllama.py Outdated Show resolved Hide resolved

use config device

6875122

irexyc approved these changes Oct 23, 2024

View reviewed changes

lvhan028 merged commit f4e0343 into InternLM:main Oct 24, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support mllama for pytorch engine #2605

Support mllama for pytorch engine #2605

AllentDan commented Oct 15, 2024 •

edited

Loading

RunningLeon left a comment

Support mllama for pytorch engine #2605

Support mllama for pytorch engine #2605

Conversation

AllentDan commented Oct 15, 2024 • edited Loading

RunningLeon left a comment

Choose a reason for hiding this comment

AllentDan commented Oct 15, 2024 •

edited

Loading