Release GPTQModel v1.7.3 · ModelCloud/GPTQModel

What's Changed

⚡ Telechat2 (China Telecom) model support
⚡ PhiMoE model support
🐛 Fix lm_head weights duplicated in post-quantize save() for models with tied-embedding.

Add util.tensor_parameters() by @ZX-ModelCloud in #1107
add require_dtype by @LRL-ModelCloud in #1109
[MODEL] Add Telechat2 (China Telecom) by @1096125073 in #1106
[FIX] Filter weight-sharing tensors when save by @ZX-ModelCloud in #1112
Add telechat test by @LRL-ModelCloud in #1111
[FIX] fix convert_gptq_to_mlx_weights by @LRL-ModelCloud in #1113
add test_parameter_count.py by @ZX-ModelCloud in #1115
Add gpqa eval task by @CL-ModelCloud in #1117
[FIX] Call tied_weights() after load_checkpoint_in_model() by @ZX-ModelCloud in #1119
add phimoe support by @CSY-ModelCloud in #1118

New Contributors

@1096125073 made their first contribution in #1106

Full Changelog: v1.7.2...v1.7.3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPTQModel v1.7.3

What's Changed

New Contributors

Contributors