Skip to content

Implement '--keep-split' to quantize model into several shards #10947

Implement '--keep-split' to quantize model into several shards

Implement '--keep-split' to quantize model into several shards #10947

Annotations

1 error

windows-latest-cmake (avx512, -DLLAMA_NATIVE=OFF -DLLAMA_BUILD_SERVER=ON -DLLAMA_AVX512=ON -DBUIL...

succeeded Apr 23, 2024 in 21m 21s