Support GGJT v3 #252

philpax · 2023-05-19T22:29:17Z

There is a new quantization level in llama.cpp, which means there will be models published with it in the near-future. We will need to support this.

ggml-org/llama.cpp#1508

philpax · 2023-05-21T01:14:13Z

To clarify: this has landed as v3 of the format with qnt level 2. We don't currently support this as we're trying to figure out if there's a good way to handle the breaking changes, but we're likely to update to it shortly.

philpax · 2023-05-25T00:38:36Z

This is now done, with the caveat that all of your v2 models will now be busted. #261 is the plan for that.

philpax added the issue:enhancement New feature or request label May 19, 2023

philpax added this to the 0.2 milestone May 20, 2023

philpax changed the title ~~Support quantization level 2~~ Support GGJT v3 May 21, 2023

philpax added the meta:breaking-change Will require action on behalf of the developer label May 22, 2023

okpatil4u mentioned this issue May 23, 2023

Support for the latest model formats vv9k/AIrtifex#3

Closed

philpax closed this as completed May 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support GGJT v3 #252

Support GGJT v3 #252

philpax commented May 19, 2023

philpax commented May 21, 2023

philpax commented May 25, 2023

Support GGJT v3 #252

Support GGJT v3 #252

Comments

philpax commented May 19, 2023

philpax commented May 21, 2023

philpax commented May 25, 2023