Backport the performance improvement from llama.cpp #709

kovaacs · 2023-04-01T20:11:57Z

It would be very cool if the performance improvements from ggerganov/llama.cpp#613 could be backported to this repo.

I couldn't find an issue for this, if there is one, I'm happy to close this.

edwios · 2023-04-02T06:44:05Z

Related. #702

Since whisper.cpp shares the same llama.* and ggml.* files with llama.cpp, I think it is just one step away - convert the whisper model from ggml into the new ggjt format.

ggerganov · 2023-04-10T19:31:50Z

Just updated ggml and the performance should be much better.
I observe > x2 overall improvement on Apple Silicon

No need to generate new models - everything should just work

…estore the speed This reverts commit 69b8503.

- About x2 overall performance improvement on Apple Silicon - Results should now be the same for different number of threads (not tested)

ggerganov closed this as completed in 69b8503 Apr 10, 2023

ggerganov added the performance CPU and memory usage - results and comparisons label Apr 10, 2023

darth-vader-lg added a commit to darth-vader-lg/whisper.cpp that referenced this issue Apr 16, 2023

Revert "ggml : backport llama.cpp updates (close ggerganov#709)" to r…

4726f97

…estore the speed This reverts commit 69b8503.

landtanin pushed a commit to landtanin/whisper.cpp that referenced this issue Dec 16, 2023

ggml : backport llama.cpp updates (close ggerganov#709)

549af69

- About x2 overall performance improvement on Apple Silicon - Results should now be the same for different number of threads (not tested)

iThalay pushed a commit to iThalay/whisper.cpp that referenced this issue Sep 23, 2024

ggml : backport llama.cpp updates (close ggerganov#709)

6d6482c

- About x2 overall performance improvement on Apple Silicon - Results should now be the same for different number of threads (not tested)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Backport the performance improvement from llama.cpp #709

Backport the performance improvement from llama.cpp #709

kovaacs commented Apr 1, 2023

edwios commented Apr 2, 2023 •

edited

Loading

ggerganov commented Apr 10, 2023

Backport the performance improvement from llama.cpp #709

Backport the performance improvement from llama.cpp #709

Comments

kovaacs commented Apr 1, 2023

edwios commented Apr 2, 2023 • edited Loading

ggerganov commented Apr 10, 2023

edwios commented Apr 2, 2023 •

edited

Loading