Opus audio decoding support #5802

vadimkantorov · 2025-02-03T12:27:26Z

Is opus codec (in ogg container - typically with .opus file extension) supported by DALI in audio decoding?

https://docs.nvidia.com/deeplearning/dali/user-guide/docs/examples/audio_processing/audio_decoder.html says that "procedure can be used for most of the well-known digital audio coding formats as well", and opus is very popular as it's produced by webrtc streams.

https://docs.nvidia.com/deeplearning/dali/user-guide/docs/operations/nvidia.dali.fn.decoders.audio.html says that "It supports the following audio formats: wav, flac and ogg". Does it mean ogg-container/format or ogg-codec? The wording is a bit ambiguous.

Thank you!

My original usecase: using https://github.com/triton-inference-server/dali_backend/ and DALI for performant audio decoding in Triton

The text was updated successfully, but these errors were encountered:

jantonguirao · 2025-02-03T13:26:27Z

Yes, DALI should support both OGG Vorbis and OGG Opus for audio decoding. I agree that the wording in the documentation is unclear. Thank you for bringing this to our attention; we will update the documentation to clarify this.

Please let us know if you find any issues with it.

jantonguirao · 2025-02-03T13:34:24Z

Here is the PR for the documentation changes

vadimkantorov · 2025-02-03T13:39:06Z

Also maybe worth dropping vague "most well-known audio codecs" wording

E.g. are mp3/aac/m4a/mka supported? Even if not supported - better list all supported codecs/containers (and maybe even provide a battery of example supported audio files) - and if a popular codec is not supported, better also list it explicitly, as people will have questions about codec support matrix anyways...

vadimkantorov · 2025-02-03T13:45:24Z

For that example speech commands example docs - maybe best put there a hyperlink to the codec support matrix? (As if you evolve it, this place would need to be modified as well - and it's easy to forget such stuff)

JanuszL assigned jantonguirao Feb 3, 2025

jantonguirao added the question Further information is requested label Feb 3, 2025

JanuszL added this to the Release_1.47.0 milestone Feb 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Opus audio decoding support #5802

Opus audio decoding support #5802

vadimkantorov commented Feb 3, 2025 •

edited

Loading

jantonguirao commented Feb 3, 2025

jantonguirao commented Feb 3, 2025 •

edited

Loading

vadimkantorov commented Feb 3, 2025

vadimkantorov commented Feb 3, 2025

Opus audio decoding support #5802

Opus audio decoding support #5802

Comments

vadimkantorov commented Feb 3, 2025 • edited Loading

jantonguirao commented Feb 3, 2025

jantonguirao commented Feb 3, 2025 • edited Loading

vadimkantorov commented Feb 3, 2025

vadimkantorov commented Feb 3, 2025

vadimkantorov commented Feb 3, 2025 •

edited

Loading

jantonguirao commented Feb 3, 2025 •

edited

Loading