Skip to content

Pull requests: ggerganov/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

server : use httplib status codes examples server
#11624 opened Feb 3, 2025 by danbev Loading…
HIP: force max threads per block to be 1024 ggml changes relating to the ggml tensor library for machine learning
#11621 opened Feb 3, 2025 by fxzjshm Loading…
HIP: add doc on small default launch bounds documentation Improvements or additions to documentation
#11619 opened Feb 3, 2025 by fxzjshm Loading…
ci: add bash script to check if llama-impl.h was included erroneously devops improvements to build systems and github actions script Script related
#11617 opened Feb 3, 2025 by mofosyne Loading…
Clean up Test Script + Update it to work on Instruct Tuned Models examples SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#11610 opened Feb 3, 2025 by Mr-Thack Loading…
tool-call: command r7b fix for normal responses examples python python script changes server testing Everything test related
#11608 opened Feb 3, 2025 by ochafik Loading…
tool-call: fix DeepSeek R1 Qwen distill (WIP) examples python python script changes server testing Everything test related
#11607 opened Feb 3, 2025 by ochafik Draft
2 of 8 tasks
Update common.cpp
#11605 opened Feb 2, 2025 by magicse Loading…
Change umlaut test python python script changes
#11600 opened Feb 2, 2025 by n00b001 Loading…
scripts: added inline script metadata per PEP 723 python python script changes script Script related
#11597 opened Feb 2, 2025 by isaac-mcfadyen Loading…
vulkan: add specific MMV kernels for IQ2 and IQ3 quants + optimizations devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#11595 opened Feb 2, 2025 by remyoudompheng Draft
vulkan: add environment variable to avoid VRAM allocation ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#11592 opened Feb 2, 2025 by wbruna Loading…
try optimize llama_mmap::impl::impl
#11589 opened Feb 2, 2025 by lexasub Draft
NUMA-aware KV cache buffer type (experimental) ggml changes relating to the ggml tensor library for machine learning
#11580 opened Feb 1, 2025 by fairydreaming Draft
Load all MoE experts during warmup
#11571 opened Feb 1, 2025 by fairydreaming Loading…
Update CMakeLists.txt examples server
#11558 opened Jan 31, 2025 by magicse Loading…
Add support for Deepseek-R1 flash attention ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#11557 opened Jan 31, 2025 by siddartha-RE Loading…
vulkan: use smaller combined allocations to avoid fragmentation ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#11551 opened Jan 31, 2025 by jeffbolznv Loading…
vulkan: initial support for IQ1_S and IQ1_M quantizations devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#11528 opened Jan 30, 2025 by remyoudompheng Draft
ProTip! Type g i on any issue or pull request to go back to the issue listing page.