video.py read_video_timestamps calculate pts without storing full frames #2202

mjunyent · 2020-05-11T16:13:48Z

To calculate the PTS the function is storing the full frame objects on memory. This makes it crash for longer videos as it can't fill everything. PTS can be calculated directly.

With this fix we don't call _read_from_stream. That's ok as most of the code deals with seeking which is not needed in this case, but video decoding is surrounded by a try/except that we're not doing here. I'm not sure if a try/except av.AVError should be added too as it's not done in the demux either. Also, do we really want to mute this exception?

sort pts

fmassa

Thanks a lot, this was an oversight I believe!

I do think that we had issues with AVError being raised for some cases though, so might be preferable to handle those exceptions in here as well.

My thinking is that if we have a corrupted file in the dataset, we should not break the training code right away but instead skip this file, which was handled in the previous implementation.

Let me know what you think

mjunyent · 2020-05-15T13:59:09Z

It makes sense. I don't have any problem in adding a check for AVError but then shouldn't we add one too when _can_read_timestamps_from_packets is True? Or _can_read_timestamps_from_packets ensures demux won't crash.

fmassa

Let's move forward with this as this is a net improvement I think.
But I think for a follow-up PR we should add another check for if the decoding fails, so that we can keep the behavior as before.

Can you send a follow-up PR?

fmassa · 2020-05-19T10:32:11Z

Thanks a lot!

* get pts directly instead of storing full frames to get pts later * fix linting * add initial pts value sort pts * catch decoding errors for read_video_timestamp

mjunyent added 3 commits May 11, 2020 18:07

get pts directly instead of storing full frames to get pts later

f5d84be

fix linting

e8465fd

add initial pts value

57380c0

sort pts

fmassa reviewed May 15, 2020

View reviewed changes

fmassa approved these changes May 19, 2020

View reviewed changes

fmassa merged commit e6b4078 into pytorch:master May 19, 2020

mjunyent mentioned this pull request May 27, 2020

video.py read_video_timestamps (follow-up PR #2202) #2268

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

video.py read_video_timestamps calculate pts without storing full frames #2202

video.py read_video_timestamps calculate pts without storing full frames #2202

mjunyent commented May 11, 2020

fmassa left a comment

mjunyent commented May 15, 2020

fmassa left a comment

fmassa commented May 19, 2020

video.py read_video_timestamps calculate pts without storing full frames #2202

video.py read_video_timestamps calculate pts without storing full frames #2202

Conversation

mjunyent commented May 11, 2020

fmassa left a comment

Choose a reason for hiding this comment

mjunyent commented May 15, 2020

fmassa left a comment

Choose a reason for hiding this comment

fmassa commented May 19, 2020