-
Notifications
You must be signed in to change notification settings - Fork 211
Conversation
Signed-off-by: changwangss <chang1.wang@intel.com>
⚡ Required checks status: All passing 🟢Groups summary🟢 Format Scan Tests workflow
These checks are required after the changes to 🟢 Optimize Unit Test workflow
These checks are required after the changes to 🟢 NeuralChat Unit Test
These checks are required after the changes to 🟢 Chat Bot Test workflow
These checks are required after the changes to Thank you for your contribution! 💜
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
After this pr, we can load a HF gptq/ awq model for both CPU and GPU to do inference? |
Signed-off-by: changwangss <chang1.wang@intel.com>
Signed-off-by: changwangss <chang1.wang@intel.com>
Signed-off-by: changwangss <chang1.wang@intel.com>
Signed-off-by: changwangss <chang1.wang@intel.com>
Signed-off-by: Wang, Chang <chang1.wang@intel.com>
Signed-off-by: changwangss <chang1.wang@intel.com>
Signed-off-by: changwangss <chang1.wang@intel.com>
Signed-off-by: changwangss <chang1.wang@intel.com>
nf4, fp4, int4_fullrange is waiting kernel ready. |
Type of Change
Code Changes
RTN
toRtn
,AWQ
toAwq
,TEQ
toTeq
,GPTQ
no change,AUTOROUND
toAutoRound
Local validation
Rtn
Awq
Teq
GPTQ
AutoRound
Description
detail description
JIRA ticket: xxx
Expected Behavior & Potential Risk
the expected behavior that triggered by this PR
How has this PR been tested?
how to reproduce the test (including hardware information)
Dependency Change?
any library dependency introduced or removed