DAQ: Density-Aware Post-Training Weight-Only Quantization For LLMs

Install

Clone this repository and navigate to DAQ folder

git clone http://this/repo
cd daq

Install Package

conda create -n daq python=3.10 -y
conda activate daq
pip install --upgrade pip  # enable PEP 660 support
pip install -e .

Usage

To run DAQ:

python awq/entry.py --sample 1 --model_path /Llama-2-7b-hf --run_daq --tasks wikitext --w_bit 4 --q_group_size -1 --q_backend fake --dump daq_cache/Llama-2-7b-hf.pt

To run DAQ+AWQ:

python awq/entry.py --model_path /Llama-2-7b-hf --calibration daq --run_awq --tasks wikitext --w_bit 4 --q_group_size -1 --q_backend fake --dump awq_cache/Llama-2-7b-hf.pt --sample 2 --data_type nf4

Acknowledgements

We would like to express our gratitude to the AWQ project for their pioneering work in weight quantization for LLMs. Our work builds upon their insights and implementations.

Name		Name	Last commit message	Last commit date
Latest commit History 126 Commits
awq		awq
examples		examples
figures		figures
scripts		scripts
tinychat		tinychat
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DAQ: Density-Aware Post-Training Weight-Only Quantization For LLMs

Install

Usage

Acknowledgements

About

Releases

Packages

Languages

License

LuoYingSong/DAQ

Folders and files

Latest commit

History

Repository files navigation

DAQ: Density-Aware Post-Training Weight-Only Quantization For LLMs

Install

Usage

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages