Skip to content

feat(llama.cpp): expose cache_type_k and cache_type_v for quant of kv cache#4329

Merged
mudler merged 2 commits intomasterfrom feat/llama.cpp-quantcacheDec 6, 2024

Commits

Commits on Dec 6, 2024