Skip to content

Issues: mit-han-lab/smoothquant

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

How to use this for custom model?
#98 opened Dec 7, 2024 by siddagra
Why only 4 layers?
#95 opened Sep 7, 2024 by VincentXWD
Support for Qwen2
#94 opened Jul 31, 2024 by JiaXinLI98
How to quantize llama3?
#92 opened Jul 22, 2024 by jpyo0803
quantify other models,
#90 opened Jul 9, 2024 by AlexMa0
best Alpha value for Qwen 1.5 72B
#89 opened Jun 26, 2024 by Riskin1999
Huggingface_Hub Issue
#87 opened May 23, 2024 by faize5
Can SmoothQuant be used on ViT models?
#86 opened Apr 24, 2024 by n9s8a
Inquiry about Int8 BMM overflow
#84 opened Apr 9, 2024 by luzai
adjust activations
#80 opened Mar 28, 2024 by muzi0111
Weight migration for Llama?
#77 opened Mar 14, 2024 by atyshka
Question about code
#76 opened Mar 6, 2024 by Lucky-Lance
ProTip! Follow long discussions with comments:>50.