Releases · NeoZhangJianyu/llama.cpp

20 Mar 03:49

d26e8b6

b2466

increase igpu cluster limit (#6159)

Assets 15

19 Mar 02:06

github-actions

b2460

2d15886

b2460

flake.lock: Update

Flake lock file updates:

• Updated input 'nixpkgs':
    'github:NixOS/nixpkgs/9df3e30ce24fd28c7b3e2de0d986769db5d6225d' (2024-03-06)
  → 'github:NixOS/nixpkgs/d691274a972b3165335d261cc4671335f5c67de9' (2024-03-14)

Assets 15

15 Mar 11:54

github-actions

b2437

46acb36

b2437

fix set main gpu error (#6073)

Assets 15

15 Mar 06:37

github-actions

b2431

4755afd

b2431

llama : fix integer overflow during quantization (#6063)

Assets 15

13 Mar 03:03

github-actions

b2409

306d34b

b2409

ci : remove tidy-review (#6021)

Assets 15

12 Mar 13:07

github-actions

b2408

8030da7

b2408

ggml : reuse quantum structs across backends (#5943)

* ggml : reuse quant blocks across backends

ggml-ci

* ggml : define helper constants only for CUDA and SYCL

ggml-ci

* ggml : define helper quantum constants for SYCL

ggml-ci

Assets 15

12 Mar 12:19

github-actions

b2407

184215e

b2407

ggml : fix UB in IQ2_S and IQ3_S (#6012)

Assets 15

12 Mar 04:12

github-actions

b2405

5cdb371

b2405

grammar : fix unnecessarily retained pointer to rules (#6003)

Assets 15

06 Mar 02:44

github-actions

b2351

652ca2b

b2351

compare-llama-bench.py : remove mul_mat_q (#5892)

Assets 14

05 Mar 06:25

github-actions

b2343

29eee40

b2343

fix speculative decoding build on windows (#5874)

Assets 14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: NeoZhangJianyu/llama.cpp

b2466

b2460

b2437

b2431

b2409

b2408

b2407

b2405

b2351

b2343