Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Access to youtube data labelled by the IDM #42

Open
roger-creus opened this issue Apr 21, 2024 · 4 comments
Open

Access to youtube data labelled by the IDM #42

roger-creus opened this issue Apr 21, 2024 · 4 comments
Labels
question Further information is requested

Comments

@roger-creus
Copy link

Hi!

I am wondering if you released the YouTube dataset with the labels given by the IDM. If so, how could we access it?

Thx!

@Miffyli
Copy link
Collaborator

Miffyli commented Apr 21, 2024

Hey. Unfortunately the original pretraining set of VPT was not released in any form. The only data that was released as part of VPT was the contractor data, and later the BASALT competition data, both of which are listed in the README.md.

@Miffyli Miffyli added the question Further information is requested label Apr 21, 2024
@brandonhoughton
Copy link
Collaborator

You may already know this but MineDojo released a YouTube index if you just want a large collection of Minecraft gameplay!

@roger-creus
Copy link
Author

Hey! Thanks for the info. The MineDojo dataset is nice but it's not action-labelled! It should be possible to use VPT's IDM to label it right?

@Miffyli
Copy link
Collaborator

Miffyli commented May 2, 2024

@roger-creus Potentially yes: VPT's IDM has generally quite robust to different preprocessing, but might not be perfect. You could also try different techniques to make use of the data (e.g., like what MineDojo did with MineCLIP, or what follow-up papers have done).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

3 participants