[MODEL] Add Telechat2 (China Telecom) #1106

1096125073 · 2025-01-20T01:41:36Z

Add support for the Telechat model (VLLM now supports the Telechat model).

Qubitium · 2025-01-20T02:01:22Z

@1096125073 Do you have HF model url for this model? We need to download this model and ci test

1096125073 · 2025-01-20T02:05:14Z

@1096125073 Do you have HF model url for this model? We need to download this model and ci test

Yes, you can try this one
https://huggingface.co/Tele-AI/TeleChat2-7B

gptqmodel/models/auto.py

gptqmodel/models/definitions/telechat2.py

Qubitium · 2025-01-20T02:34:07Z

@1096125073 Thank you for the PR. I will have @LRL-ModelCloud take over the PR changes from here. He will fix this PR so it can run correctly in GPTQModel with CI tests. Right now, this code will not run since gptqmodel uses different structure on how to define models.

1096125073 · 2025-01-20T02:40:48Z

@1096125073 Thank you for the PR. I will have @LRL-ModelCloud take over the PR changes from here. He will fix this PR so it can run correctly in GPTQModel with CI tests. Right now, this code will not run since gptqmodel uses different structure on how to define models.

Sorry, I just discovered that the properties of this library are different from AutoGPTQ and have been updated

Qubitium · 2025-01-20T02:52:05Z

@1096125073 Thank you for the PR. I will have @LRL-ModelCloud take over the PR changes from here. He will fix this PR so it can run correctly in GPTQModel with CI tests. Right now, this code will not run since gptqmodel uses different structure on how to define models.

Sorry, I just discovered that the properties of this library are different from AutoGPTQ and have been updated

No problem. We wil fix this. But stop force-pushing so we can fix this PR.

Qubitium · 2025-01-20T03:00:35Z

@1096125073 We are testing.

If you are with telechat2 team, please ask manager to add torch_dtype to config.json

https://huggingface.co/Tele-AI/TeleChat2-7B/blob/main/modeling_telechat2.py#L186

The assert codes here are strange. Why is it forcing (asserting) cuda? GPTQModel supports inference on multiple devices and does not require cuda.

1096125073 · 2025-01-20T03:13:27Z

@1096125073 We are testing.

If you are with telechat2 team, please ask manager to add torch_dtype to config.json

https://huggingface.co/Tele-AI/TeleChat2-7B/blob/main/modeling_telechat2.py#L186

The assert codes here are strange. Why is it forcing (asserting) cuda? GPTQModel supports inference on multiple devices and does not require cuda.

Sorry, this is being handled by another team, but we are currently organizing the code and submitting the PR to the transformer library. I believe this will be resolved soon.

Qubitium · 2025-01-20T03:39:04Z

@1096125073 We are running into dtype assert bugs in forward if we load the original model as bfloat16. float16 can run. Can you confirm the model .bin files are native bfloat16 or float16?

1096125073 · 2025-01-20T03:46:08Z

@1096125073 We are running into dtype assert bugs in forward if we load the original model as bfloat16. float16 can run. Can you confirm the model .bin files are native bfloat16 or float16?

yes, 7b run as float16

Qubitium requested changes Jan 20, 2025

View reviewed changes

gptqmodel/models/auto.py Outdated Show resolved Hide resolved

gptqmodel/models/definitions/telechat2.py Outdated Show resolved Hide resolved

gptqmodel/models/definitions/telechat2.py Show resolved Hide resolved

1096125073 force-pushed the main branch from a0a28a0 to 7e196af Compare January 20, 2025 02:30

1096125073 force-pushed the main branch from da1fda3 to 1f94910 Compare January 20, 2025 02:35

add support for telechat2

fa032dd

1096125073 force-pushed the main branch from 1f94910 to fa032dd Compare January 20, 2025 02:38

Qubitium changed the title ~~add support for telechat2~~ [MODEL] Add Telechat2 (China Telecom) Jan 20, 2025

1096125073 requested a review from Qubitium January 20, 2025 02:52

Update telechat2.py

8c1d4f5

Qubitium approved these changes Jan 20, 2025

View reviewed changes

Update auto.py

4cccbba

Qubitium merged commit 23603f6 into ModelCloud:main Jan 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MODEL] Add Telechat2 (China Telecom) #1106

[MODEL] Add Telechat2 (China Telecom) #1106

1096125073 commented Jan 20, 2025

Qubitium commented Jan 20, 2025

1096125073 commented Jan 20, 2025 •

edited

Loading

Qubitium commented Jan 20, 2025

1096125073 commented Jan 20, 2025

Qubitium commented Jan 20, 2025

Qubitium commented Jan 20, 2025 •

edited

Loading

1096125073 commented Jan 20, 2025

Qubitium commented Jan 20, 2025

1096125073 commented Jan 20, 2025

[MODEL] Add Telechat2 (China Telecom) #1106

[MODEL] Add Telechat2 (China Telecom) #1106

Conversation

1096125073 commented Jan 20, 2025

Qubitium commented Jan 20, 2025

1096125073 commented Jan 20, 2025 • edited Loading

Qubitium commented Jan 20, 2025

1096125073 commented Jan 20, 2025

Qubitium commented Jan 20, 2025

Qubitium commented Jan 20, 2025 • edited Loading

1096125073 commented Jan 20, 2025

Qubitium commented Jan 20, 2025

1096125073 commented Jan 20, 2025

1096125073 commented Jan 20, 2025 •

edited

Loading

Qubitium commented Jan 20, 2025 •

edited

Loading