Skip to content

Improve cuBLAS performance by dequantizing on the GPU#1065

Merged
slaren merged 4 commits intoggerganov:masterfrom slaren:cuda-dqApr 20, 2023