Skip to content

GPTQModel v1.7.0

Compare
Choose a tag to compare
@Qubitium Qubitium released this 17 Jan 01:34
· 114 commits to main since this release
d247fd0

What's Changed

backend.MLX added for runtime-conversion and execution of GPTQ models on Apple's MLX framework on Apple Silicon (M1+). ⚡ Exports of gptq models to mlx also now possible. We have added mlx exported models to huggingface.co/ModelCloud.
⚡ lm_head quantization now fully support by GPTQModel without external pkg dependency.
🐛 Fixed setup.py not correctly detecting incompatible setuptools/wheel pkgs.

Full Changelog: v1.6.1...v1.7.0