Try to speed up Whisper transcription inference #1539

lfcnassif · 2023-02-23T17:45:34Z

General recommendations:
https://pytorch.org/tutorials/recipes/recipes/tuning_guide.html

This one was found by @hauck-jvsh:
https://developer.nvidia.com/blog/accelerating-inference-up-to-6x-faster-in-pytorch-with-torch-tensorrt/

lfcnassif · 2023-03-02T22:55:06Z

Just got a simple idea that can bring some speed up. Currently we are starting 2 transcription processes per GPU. I thought to use 3, but GPU used memory is already high, I see 20GB usage from 24GB. But maybe we can use some python threads to run 3 simultaneous transcriptions in the same python process, reusing the same model loaded on memory, instead of loading it for each process. GPU usage is already high, but maybe there is space for some speed up.

Running multiple transcriptions in inference batches is a common technique. But it would make the logic much more complex: we would have to group audios of similar duration, wait for them, maybe group audios from same client or from different ones, for how long time would we wait for more audios to put into the same group...

lfcnassif · 2024-04-26T22:48:28Z

WhisperX uses batch inference (trasncription of many audio parts at the same time) to speed up transcription up to 10x on GPUs using just this technique. I think it is possible to change the WhisperX library to make it transcribe different audios at the same time using audio batches.

@hauck-jvsh, since you know and already contributed fixes and improvements to the transcription code, would you like to help improving WhisperX library (I just forked it in sepinf-inc repo) and improving IPED code to group audios of similar sizes before transcribing them? I think that would allow us to update our transcription service algorithm without the new hardware, which buying should take longer after the last government budget restrictions...

lfcnassif added the enhancement label Feb 23, 2023

lfcnassif changed the title ~~Try to optimize Wav2Vec2 transcription inference~~ Try to speed up Whisper transcription inference Mar 12, 2024

lfcnassif mentioned this issue Apr 28, 2024

#1823 whisper transcription #2165

Merged

hauck-jvsh linked a pull request Jul 9, 2024 that will close this issue

Whisperx Optimization #2258

Open

lfcnassif assigned hauck-jvsh Jul 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Try to speed up Whisper transcription inference #1539

Try to speed up Whisper transcription inference #1539

lfcnassif commented Feb 23, 2023

lfcnassif commented Mar 2, 2023

lfcnassif commented Apr 26, 2024 •

edited

Loading

Try to speed up Whisper transcription inference #1539

Try to speed up Whisper transcription inference #1539

Comments

lfcnassif commented Feb 23, 2023

lfcnassif commented Mar 2, 2023

lfcnassif commented Apr 26, 2024 • edited Loading

lfcnassif commented Apr 26, 2024 •

edited

Loading