You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Improve cuBLAS performance by dequantizing on the GPU#1065
Merged
slaren merged 4 commits intoggerganov:masterggerganov/llama.cpp:masterfrom slaren:cuda-dqslaren/llama.cpp:cuda-dqCopy head branch name to clipboardApr 20, 2023