Skip to content
This repository has been archived by the owner on Oct 25, 2024. It is now read-only.

Improve WOQ model saving and loading #4355

Improve WOQ model saving and loading

Improve WOQ model saving and loading #4355

call-inference-llama-2-7b-chat-hf  /  inference test

succeeded Mar 18, 2024 in 2m 34s