Skip to content

GPTQModel v1.1.0

Compare
Choose a tag to compare
@Qubitium Qubitium released this 29 Oct 15:48
· 676 commits to main since this release
6468062

What's Changed

IBM Granite model support. Full auto-buildless wheel install from pypi. Reduce max cpu memory usage by >20% during quantization. 100% CI model/feature coverage. Updated hf-integration support with latest transformers.

Full deprecations: liger-kernel support and exllama v1 quant kernel.

Full Changelog: v1.0.9...v1.1.0