GPTQModel v1.7.3
What's Changed
⚡ Telechat2 (China Telecom) model support
⚡ PhiMoE model support
🐛 Fix lm_head weights duplicated in post-quantize save() for models with tied-embedding.
- Add util.tensor_parameters() by @ZX-ModelCloud in #1107
- add require_dtype by @LRL-ModelCloud in #1109
- [MODEL] Add Telechat2 (China Telecom) by @1096125073 in #1106
- [FIX] Filter weight-sharing tensors when save by @ZX-ModelCloud in #1112
- Add telechat test by @LRL-ModelCloud in #1111
- [FIX] fix convert_gptq_to_mlx_weights by @LRL-ModelCloud in #1113
- add test_parameter_count.py by @ZX-ModelCloud in #1115
- Add gpqa eval task by @CL-ModelCloud in #1117
- [FIX] Call tied_weights() after load_checkpoint_in_model() by @ZX-ModelCloud in #1119
- add phimoe support by @CSY-ModelCloud in #1118
New Contributors
- @1096125073 made their first contribution in #1106
Full Changelog: v1.7.2...v1.7.3