Running inference over a large batch of audio files #22

vmedappa · 2022-10-05T19:06:22Z

Hi! Firstly, thank you so much for this incredible work!

I have been running the tiny.en models on a large number of wav files stored in a folder. I am currently parallelizing the work over a multi-core machine using GNU parallel and running the following command :

find input_data/eng_wav_data -name "*.wav" | parallel 'time ./main -m models/ggml-tiny.en.bin -nt -f {} -t 1 > {.}.txt'

I found that currently the model is loaded each time we have to transcribe a wav file. Is there a way I can circumvent this and load the model only once? Any help would be appreciated. Thank you. Apologies if this issue has been resolved already

The text was updated successfully, but these errors were encountered:

ggerganov · 2022-10-05T20:54:07Z

Just added a support to provide multiple input files:

./main -m models/ggml-tiny.en.bin file0.wav file1.wav file2.wav ...

All specified files will be processed with a single model load.

Allows to start processing the input audio at some offset from the beginning. Useful for splitting a long job into multiple tasks.

Can be used to partially process a recording

ggerganov · 2022-11-07T18:16:57Z

We now have enough options to be able to process batches of files and also splitting a long file into multiple jobs

Allows to start processing the input audio at some offset from the beginning. Useful for splitting a long job into multiple tasks.

Can be used to partially process a recording

ggerganov added enhancement New feature or request good first issue Good for newcomers labels Oct 5, 2022

ggerganov added a commit that referenced this issue Oct 5, 2022

ref #22 : add option to provide multiple input .wav files

700898e

ggerganov added a commit that referenced this issue Oct 7, 2022

ref #16, #22 : add "offset" argument

7787b87

Allows to start processing the input audio at some offset from the beginning. Useful for splitting a long job into multiple tasks.

ggerganov mentioned this issue Oct 29, 2022

Running ./main on sample audio just hangs on CentOS 7.9 #106

Closed

ggerganov added a commit that referenced this issue Nov 7, 2022

ref #22 : add "duration" option

c30bffc

Can be used to partially process a recording

ggerganov closed this as completed Nov 7, 2022

anandijain pushed a commit to anandijain/whisper.cpp that referenced this issue Apr 28, 2023

ref ggerganov#22 : add option to provide multiple input .wav files

0e8a85d

anandijain pushed a commit to anandijain/whisper.cpp that referenced this issue Apr 28, 2023

ref ggerganov#16, ggerganov#22 : add "offset" argument

3365ff7

Allows to start processing the input audio at some offset from the beginning. Useful for splitting a long job into multiple tasks.

anandijain pushed a commit to anandijain/whisper.cpp that referenced this issue Apr 28, 2023

ref ggerganov#22 : add "duration" option

c1df138

Can be used to partially process a recording

warkcod mentioned this issue Jun 8, 2023

OpenCL clCreateCommandQueue error -30 on MacOS 13.4 intel #996

Open

jacobwu-b pushed a commit to jacobwu-b/Transcriptify-by-whisper.cpp that referenced this issue Oct 24, 2023

ref ggerganov#22 : add "duration" option

543880e

Can be used to partially process a recording

nishanthrs mentioned this issue Oct 31, 2023

Running whisper.cpp at Scale and in Parallel #1408

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Running inference over a large batch of audio files #22

Running inference over a large batch of audio files #22

vmedappa commented Oct 5, 2022

ggerganov commented Oct 5, 2022

ggerganov commented Nov 7, 2022

Running inference over a large batch of audio files #22

Running inference over a large batch of audio files #22

Comments

vmedappa commented Oct 5, 2022

ggerganov commented Oct 5, 2022

ggerganov commented Nov 7, 2022