Missing audio after calling read_video() with `sec` as unit #3779

prabhat00155 · 2021-05-05T16:03:28Z

🐛 Bug

Calling read_video() on a video file returns audio tensor with shape[1] as 0, when start_pts and end_pts is passed with sec as unit.

To Reproduce

With pts as unit:

visual, audio, info = read_video(video_path, start_pts=10010, end_pts=15015, pts_unit='pts')
print('Visual:', visual.shape, 'Audio:', audio.shape, info)
write_video(
    'foo.mp4', video_array=visual, fps=info['video_fps'], audio_array=audio, audio_fps=info['audio_fps'],
    audio_codec='aac')

Output:

Visual: torch.Size([6, 256, 340, 3]) Audio: torch.Size([1, 5192]) {'video_fps': 29.97002997002997, 'audio_fps': 48000}

With sec as unit:

visual, audio, info = read_video(video_path, start_pts=0.3337, end_pts=0.5005, pts_unit='sec')
print('Visual:', visual.shape, 'Audio:', audio.shape, info)
write_video(
    'bar.mp4', video_array=visual, fps=info['video_fps'], audio_array=audio, audio_fps=info['audio_fps'],
    audio_codec='aac')

Output:

Visual: torch.Size([6, 256, 340, 3]) Audio: torch.Size([1, 0]) {'video_fps': 29.97002997002997, 'audio_fps': 48000}

Expected behavior

Similar audio output as returned with pts unit.

Environment

PyTorch version: 1.9.0.dev20210429
Is debug build: False
CUDA used to build PyTorch: None
ROCM used to build PyTorch: N/A

OS: macOS 11.3 (x86_64)
GCC version: Could not collect
Clang version: 12.0.5 (clang-1205.0.22.9)
CMake version: version 3.19.6

Python version: 3.9 (64-bit runtime)
Is CUDA available: False
CUDA runtime version: No CUDA
GPU models and configuration: No CUDA
Nvidia driver version: No CUDA
cuDNN version: No CUDA
HIP runtime version: N/A
MIOpen runtime version: N/A

Versions of relevant libraries:
[pip3] numpy==1.20.2
[pip3] torch==1.9.0.dev20210429
[pip3] torchvision==0.10.0a0+730c5e1
[conda] Could not collect

Additional context

cc @bjuncek

The text was updated successfully, but these errors were encountered:

prabhat00155 added bug module: video labels May 5, 2021

prabhat00155 self-assigned this May 6, 2021

This was referenced Jun 11, 2021

Fixed missing audio with video_reader backend #3934

Merged

Fixed missing audio with pyav backend #4064

Merged

prabhat00155 closed this as completed in #4064 Jul 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing audio after calling read_video() with `sec` as unit #3779

Missing audio after calling read_video() with `sec` as unit #3779

prabhat00155 commented May 5, 2021 •

edited by pytorch-probot bot

Loading

Missing audio after calling read_video() with sec as unit #3779

Missing audio after calling read_video() with sec as unit #3779

Comments

prabhat00155 commented May 5, 2021 • edited by pytorch-probot bot Loading

🐛 Bug

To Reproduce

Expected behavior

Environment

Additional context

Missing audio after calling read_video() with `sec` as unit #3779

Missing audio after calling read_video() with `sec` as unit #3779

prabhat00155 commented May 5, 2021 •

edited by pytorch-probot bot

Loading