Add docs for audio processing #3222

stevhliu · 2021-11-05T23:07:59Z

This PR adds documentation for the Audio feature. It describes:

The difference between loading path and audio, as well as use-cases/best practices for each of them.
Resampling audio files with cast_column, and then calling ds[0]["audio"] to automatically decode and resample to the desired sampling rate.
Resampling with map.

Preview here, let me know if I'm missing anything!

lhoestq

Cool thanks :)
also pinging @anton-l @patrickvonplaten @albertvillanova

docs/source/audio_process.rst

lhoestq · 2021-11-09T11:36:11Z

Nice ! love it this way. I guess you can set this PR to "ready for review" ?

docs/source/audio_process.rst

patrickvonplaten

That's a great document - thanks a lot for putting all of this together!

I left some tips on how the preprocessing transformers code example could be a bit simplified.

In short 99% of the use cases when Audio datasets is used for transformers is either:

a) A pretrained speech model is fine-tuned

or:

b) A fine-tuned speech model is evaluated / used in inference

For both a) and b) the feature_extractor is always defined. So we should always advocate to use AutoFeatureExtractor.from_pretrained(...) here IMO.

For a) the tokenizer is not defined and has to be created as described in the docs currently. For b) the tokenizer is also defined so that one can directly use Wav2Vec2Processor.from_pretrained(...)

Hope that helps a bit :-)

docs/source/audio_process.rst

lhoestq

Looks all good to me now :)
Let us know if you have more comments or if it's ready to merge

anton-l

LGTM, great reference the transformers examples!

lhoestq · 2021-11-24T15:35:49Z

I guess we can merge this one now :)

✨ add docs for audio processing

8f6d041

stevhliu added the documentation Improvements or additions to documentation label Nov 5, 2021

stevhliu requested review from albertvillanova, patrickvonplaten, anton-l and lhoestq November 5, 2021 23:07

Steven added 2 commits November 5, 2021 16:10

add new doc to toctree

41d32fb

minor fixes

8b4fba8

lhoestq reviewed Nov 8, 2021

View reviewed changes

docs/source/audio_process.rst Outdated Show resolved Hide resolved

docs/source/audio_process.rst Show resolved Hide resolved

docs/source/audio_process.rst Show resolved Hide resolved

Steven added 2 commits November 8, 2021 09:43

add feedback from review

5b8d960

improve gif

3652521

stevhliu marked this pull request as ready for review November 9, 2021 17:01

patrickvonplaten reviewed Nov 10, 2021

View reviewed changes

docs/source/audio_process.rst Outdated Show resolved Hide resolved

patrickvonplaten reviewed Nov 10, 2021

View reviewed changes

docs/source/audio_process.rst Outdated Show resolved Hide resolved

docs/source/audio_process.rst Outdated Show resolved Hide resolved

patrickvonplaten reviewed Nov 10, 2021

View reviewed changes

docs/source/audio_process.rst Outdated Show resolved Hide resolved

patrickvonplaten reviewed Nov 10, 2021

View reviewed changes

add feedback from review

e75350e

lhoestq approved these changes Nov 12, 2021

View reviewed changes

anton-l approved these changes Nov 12, 2021

View reviewed changes

lhoestq merged commit a8f96b3 into huggingface:master Nov 24, 2021

stevhliu deleted the audio-docs branch November 24, 2021 16:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add docs for audio processing #3222

Add docs for audio processing #3222

stevhliu commented Nov 5, 2021 •

edited

Loading

lhoestq left a comment

lhoestq commented Nov 9, 2021

patrickvonplaten left a comment

lhoestq left a comment

anton-l left a comment

lhoestq commented Nov 24, 2021

Add docs for audio processing #3222

Add docs for audio processing #3222

Conversation

stevhliu commented Nov 5, 2021 • edited Loading

lhoestq left a comment

Choose a reason for hiding this comment

lhoestq commented Nov 9, 2021

patrickvonplaten left a comment

Choose a reason for hiding this comment

lhoestq left a comment

Choose a reason for hiding this comment

anton-l left a comment

Choose a reason for hiding this comment

lhoestq commented Nov 24, 2021

stevhliu commented Nov 5, 2021 •

edited

Loading