Support Mono-InternVL with PyTorch backend #2727

wzk1015 · 2024-11-07T16:20:06Z

Support Mono-InternVL with Pytorch backend by creating internlm2_ve.py and adding is_mono in internvl.py. The MoE structure is implemented with vision_embedding_indexing and text_embedding_indexing. Also fix some typos.

grimoire · 2024-11-08T03:07:54Z

OMG, we have so many typos...

lvhan028 · 2024-11-08T03:08:06Z

@zhulinJulia24 Please add Mono-InternVL into test cases

wzk1015 · 2024-11-08T04:23:50Z

One known bug is that Mono-InternVL encounters NaN when using float16 (this issue), due to some numerical instability. We will fix this in the next version of model, but for the current version I think float16 is not supported. Bfloat16 works fine. Should I add a note in FP16/BF16 of supported_models.md?

I test KV INT 8 and INT 4 and they work fine. For W8A8 and W4A16 I suppose they are not supported, following InternVL2 PyTorchEngine.

lvhan028 · 2024-11-08T06:25:05Z

One known bug is that Mono-InternVL encounters NaN when using float16 (this issue), due to some numerical instability. We will fix this in the next version of model, but for the current version I think float16 is not supported. Bfloat16 works fine. Should I add a note in FP16/BF16 of supported_models.md?

I test KV INT 8 and INT 4 and they work fine. For W8A8 and W4A16 I suppose they are not supported, following InternVL2 PyTorchEngine.

Sure. A note about this situation is appreciated.

grimoire

LGTM

lmdeploy/pytorch/models/module_map.py

RunningLeon

LGTM

* support Mono-InternVL; fix typos * update readme * add assertion for FP16 * add assertion for FP16 * update _SUPPORTED_ARCHS

wzk1015 added 2 commits November 8, 2024 00:09

support Mono-InternVL; fix typos

12c080d

update readme

cba0de2

lvhan028 requested review from grimoire and RunningLeon November 8, 2024 03:03

lvhan028 added the enhancement New feature or request label Nov 8, 2024

lvhan028 requested a review from zhulinJulia24 November 8, 2024 03:07

wzk1015 added 2 commits November 8, 2024 15:04

add assertion for FP16

1bd6ec3

add assertion for FP16

11d11fe

grimoire approved these changes Nov 8, 2024

View reviewed changes

RunningLeon reviewed Nov 8, 2024

View reviewed changes

lmdeploy/pytorch/models/module_map.py Show resolved Hide resolved

update _SUPPORTED_ARCHS

1fa2d77

RunningLeon approved these changes Nov 8, 2024

View reviewed changes

lvhan028 merged commit 06aea5d into InternLM:main Nov 11, 2024
5 checks passed

wzk1015 mentioned this pull request Nov 11, 2024

[Feature] Mono-Internvl #2695

Closed

AllentDan pushed a commit to AllentDan/lmdeploy that referenced this pull request Nov 13, 2024

Support Mono-InternVL with PyTorch backend (InternLM#2727)

53561fa

* support Mono-InternVL; fix typos * update readme * add assertion for FP16 * add assertion for FP16 * update _SUPPORTED_ARCHS

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Mono-InternVL with PyTorch backend #2727

Support Mono-InternVL with PyTorch backend #2727

wzk1015 commented Nov 7, 2024

grimoire commented Nov 8, 2024

lvhan028 commented Nov 8, 2024 •

edited

Loading

wzk1015 commented Nov 8, 2024 •

edited

Loading

lvhan028 commented Nov 8, 2024

grimoire left a comment

RunningLeon left a comment

Support Mono-InternVL with PyTorch backend #2727

Support Mono-InternVL with PyTorch backend #2727

Conversation

wzk1015 commented Nov 7, 2024

grimoire commented Nov 8, 2024

lvhan028 commented Nov 8, 2024 • edited Loading

wzk1015 commented Nov 8, 2024 • edited Loading

lvhan028 commented Nov 8, 2024

grimoire left a comment

Choose a reason for hiding this comment

RunningLeon left a comment

Choose a reason for hiding this comment

lvhan028 commented Nov 8, 2024 •

edited

Loading

wzk1015 commented Nov 8, 2024 •

edited

Loading