[Audio] Microphone Capture - Allow setting smaller chunk size for low latency #6526

virajkarandikar · 2023-11-21T17:11:59Z

I have searched to see if a similar issue already exists.

By default the streaming mic capture uses buffer/chunk size of 1 second. This adds a long latency in real time applications. Can the chunk size be made configurable/smaller?

Is your feature request related to a problem? Please describe.
Large buffer increases audio latency and makes application sluggish to use.

Describe the solution you'd like
Provide a parameter to configure chunk size when using streaming mic capture

Additional context
Add any other context or screenshots about the feature request here.

abidlabs · 2023-11-21T19:59:51Z

Hi @virajkarandikar can you provide sample code we can use to look at the issue?

virajkarandikar · 2023-11-23T12:38:37Z

Code is simple.

with gr.Blocks() as demo:
        audio = gr.Audio(streaming=True)

        def process_audio(audio):
            rate, data = audio
            print(f"rate: {rate}, samples: {len(data)}")

        audio.stream(process_audio, [audio], None)

Below is the log I get on console.

rate: 48000, samples: 24000
rate: 48000, samples: 48000
rate: 48000, samples: 24000
rate: 48000, samples: 24000
rate: 48000, samples: 24000
rate: 48000, samples: 24000

Log indicates - sample rate is 48000, channels is 1, chunk size varies between 24000 (0.5 sec) and 48000 (1 sec). This adds significant latency.

Also the uncompressed audio data at 48000Hz is streamed from the client to application and it adds some amount of network latency. My case model expects 16000 sample rate. So if I can specify sample rate for mic capture, it will reduce the amount of data transfer by 1/3rd. But for that I have filed another issue here #5848.

qianhuiliu · 2023-11-25T13:30:36Z

Hello, have you figured out how to do it? I have the same question.

virajkarandikar · 2023-11-29T03:34:51Z

Any update here?

gaborvecsei · 2024-01-04T13:42:31Z

I am also interested in this

virajkarandikar · 2024-01-25T19:20:49Z

Ping...

abidlabs · 2024-01-26T18:07:40Z

This is on our radar, but maybe will take a few weeks for us to get to as we have a lot of other issues we're tackling as well. We are happy to review any PRs if you'd like to contribute this fix.

cc @aliabid94

mcorroyer · 2024-05-23T18:24:59Z

I have the same issue, has it progressed?

adirajagopal · 2024-06-05T15:23:10Z

This is on our radar, but maybe will take a few weeks for us to get to as we have a lot of other issues we're tackling as well. We are happy to review any PRs if you'd like to contribute this fix.

cc @aliabid94

Hi, is there a specific part of the code base you could point to to suggest how we can reduce the chunk size of the stream? This would help with guiding the PR. Thanks!

JohanWork · 2024-08-30T08:09:21Z

@abidlabs could you point out where the chunk size is set? Happy to contribute

abidlabs · 2024-08-30T16:45:56Z

Actually, we've implemented this already in our 5.0-dev branch! Let me point you to the PR where you can install and try it out: #8941

Here's a simple transcription demo where you can set your own stream_every param: #8941 (comment)

abidlabs added the enhancement New feature or request label Nov 21, 2023

virajkarandikar changed the title ~~Microphone Capture - Allow setting smaller chunk size for low latency~~ [Audio] Microphone Capture - Allow setting smaller chunk size for low latency Nov 24, 2023

abidlabs closed this as completed Aug 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Audio] Microphone Capture - Allow setting smaller chunk size for low latency #6526

[Audio] Microphone Capture - Allow setting smaller chunk size for low latency #6526

virajkarandikar commented Nov 21, 2023

abidlabs commented Nov 21, 2023

virajkarandikar commented Nov 23, 2023 •

edited

Loading

qianhuiliu commented Nov 25, 2023

virajkarandikar commented Nov 29, 2023

gaborvecsei commented Jan 4, 2024

virajkarandikar commented Jan 25, 2024

abidlabs commented Jan 26, 2024

mcorroyer commented May 23, 2024

adirajagopal commented Jun 5, 2024

JohanWork commented Aug 30, 2024

abidlabs commented Aug 30, 2024

[Audio] Microphone Capture - Allow setting smaller chunk size for low latency #6526

[Audio] Microphone Capture - Allow setting smaller chunk size for low latency #6526

Comments

virajkarandikar commented Nov 21, 2023

abidlabs commented Nov 21, 2023

virajkarandikar commented Nov 23, 2023 • edited Loading

qianhuiliu commented Nov 25, 2023

virajkarandikar commented Nov 29, 2023

gaborvecsei commented Jan 4, 2024

virajkarandikar commented Jan 25, 2024

abidlabs commented Jan 26, 2024

mcorroyer commented May 23, 2024

adirajagopal commented Jun 5, 2024

JohanWork commented Aug 30, 2024

abidlabs commented Aug 30, 2024

virajkarandikar commented Nov 23, 2023 •

edited

Loading