-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Images no longer available #3
Comments
Any news here? |
Nothing ever came of this. I emailed the paper authors directly as well. |
@cmishra Do the authors reply to you? Thank. |
Nope.
…On Wed, Dec 4, 2019, 2:24 AM GabrielLin ***@***.***> wrote:
@cmishra <https://github.com/cmishra> Do the authors reply to you? Thank.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#3?email_source=notifications&email_token=AA4J5KJPCJ75YZBDRLWUKG3QW5LK7A5CNFSM4C3INLEKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEF4AEEY#issuecomment-561512979>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AA4J5KPTS6JJVPMYD3THOFTQW5LK7ANCNFSM4C3INLEA>
.
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hi,
I've tried to use this toolbox to download the full set of images. I've gotten ~4.5 million images, but a significant portion are still left (distributed roughly evenly over all 100 data subsets).
Upon manual inspection, many of these images are no longer being hosted while a minority have bot-shielding strategies up which a simple header addition didn't defeat. More seem to fall into the former category.
Are there any alternate sources for missing images? A torrent or tarball download? If not, can we at least temporarily set one up? It would be relatively inexpensive.
If I can get a copy of the dataset, I may be able to convince certain folks to host it long term.
The text was updated successfully, but these errors were encountered: