Adding IQ2_S and IQ2_M to complete coverage of the 2-3 bit quantizati… #2

sorasoras · 2024-02-26T16:44:26Z

Adding IQ2_S and IQ2_M as a single cumulative commit
Update examples/quantize/quantize.cpp

…on range (#5721) * Adding IQ2_S and IQ2_M as a single cumulative commit * Update examples/quantize/quantize.cpp Co-authored-by: Georgi Gerganov <ggerganov@gmail.com> --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com> Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

* vulkan : do not use tensor->extra This patch allows using the Vulkan backend with the RPC backend as tensor->extra is no longer used. Ref: ggerganov#8536 * Adapt GGML_VULKAN_CHECK_RESULTS to extra removal (#2) --------- Co-authored-by: 0cc4m <picard12@live.de>

sorasoras closed this Feb 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding IQ2_S and IQ2_M to complete coverage of the 2-3 bit quantizati… #2

Adding IQ2_S and IQ2_M to complete coverage of the 2-3 bit quantizati… #2

sorasoras commented Feb 26, 2024

Adding IQ2_S and IQ2_M to complete coverage of the 2-3 bit quantizati… #2

Adding IQ2_S and IQ2_M to complete coverage of the 2-3 bit quantizati… #2

Conversation

sorasoras commented Feb 26, 2024