Added CUB2011 dataset #147

vadimkantorov · 2017-04-15T16:29:11Z

For testing some metric learning methods I created a wrapper for CUB2011 dataset (inheriting from CIFAR10 for download functionality and from ImageFolder for directory parsing).

The dataset is small, so it'd be possible to serve it from memory, but haven't needed it yet. Also not sure about the good practice on integrity hashes, just put there whole archive hash and a few image hashes.

Let me know what you think!

torchvision/datasets/cub2011.py

+from torchvision.datasets import ImageFolder
+from torchvision.datasets import CIFAR10
+
+class Cub2011(ImageFolder, CIFAR10):


szagoruyko · 2017-04-15T17:22:11Z

can you add annotations?

vadimkantorov · 2017-04-16T18:28:50Z

I'll rename the class and will fix lint errors.

About the annotations, every image seems to have "15 Part Locations, 312 Binary Attributes, 1 Bounding Box".
Do we have an accepted format for bounding boxes? (x1-y1-x2-y2?)

I will check if an image always have 15 part locations, that would simplify the encoding.

vadimkantorov · 2017-04-27T14:20:37Z

VOC pull request indeed uses "[xmin, ymin, xmax, ymax, ind]" format so I'll stick to it.

torchvision/datasets/cub2011.py

+from torchvision.datasets import ImageFolder
+from torchvision.datasets import CIFAR10
+
+class Cub2011(ImageFolder, CIFAR10):


torchvision/datasets/cub2011.py

+    tgz_md5 = '97eceeb196236b17998738112f37df78'
+
+    train_list = [
+        ['001.Black_footed_Albatross/Black_Footed_Albatross_0001_796111.jpg', '4c84da568f89519f84640c54b7fba7c2'],


torchvision/datasets/cub2011.py

+        ['001.Black_footed_Albatross/Black_Footed_Albatross_0001_796111.jpg', '4c84da568f89519f84640c54b7fba7c2'],
+        ['002.Laysan_Albatross/Laysan_Albatross_0001_545.jpg', 'e7db63424d0e384dba02aacaf298cdc0'],
+    ]
+    test_list = [


varunagrawal · 2019-03-12T19:52:59Z

Is this PR still relevant? @vadimkantorov seems to have deleted his torchvision fork.

vadimkantorov · 2019-03-12T20:08:41Z

I am not working on this. Please go ahead if you wish to pick this up (my original CUB2011 dataset code is here: https://github.com/vadimkantorov/metriclearningbench/blob/master/cub2011.py )

Some things you may want to look at:

Refactor super-hacky class CUB2011(ImageFolder, CIFAR10) way to reuse dataset download steps. Maybe now there's a better API for that.
Actually implement "15 Part Locations, 312 Binary Attributes, 1 Bounding Box" loading (but maybe the CUB2011MetricLearning would suffice?)

pmeier · 2021-06-28T07:46:28Z

@vadimkantorov

I am not working on this.

Closing this in favor of #4126.

Added CUB2011 dataset

ec25f0e

szagoruyko reviewed Apr 15, 2017

View reviewed changes

torchvision/datasets/cub2011.py

from torchvision.datasets import ImageFolder

from torchvision.datasets import CIFAR10

class Cub2011(ImageFolder, CIFAR10):

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

alykhantejani suggested changes Sep 13, 2017

View reviewed changes

alykhantejani added the awaiting response label Sep 23, 2017

vishwakftw mentioned this pull request Oct 1, 2017

Added CUB200-2010 and 2011 version #279

Closed

shakeebmurtaza added a commit to shakeebmurtaza/vision that referenced this pull request Jun 28, 2021

New dataset added (Caltech-UCSD Birds 200) regarding issue pytorch#147

a32c057

shakeebmurtaza mentioned this pull request Jun 28, 2021

New dataset added (Caltech-UCSD Birds 200) regarding issue #147 #60829 #4126

Closed

pmeier closed this Jun 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added CUB2011 dataset #147

Added CUB2011 dataset #147

vadimkantorov commented Apr 15, 2017 •

edited

Loading

This comment was marked as off-topic.

This comment was marked as off-topic.

szagoruyko commented Apr 15, 2017

vadimkantorov commented Apr 16, 2017 •

edited

Loading

vadimkantorov commented Apr 27, 2017

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

varunagrawal commented Mar 12, 2019

vadimkantorov commented Mar 12, 2019

pmeier commented Jun 28, 2021

Added CUB2011 dataset #147

Added CUB2011 dataset #147

Conversation

vadimkantorov commented Apr 15, 2017 • edited Loading

This comment was marked as off-topic.

This comment was marked as off-topic.

szagoruyko commented Apr 15, 2017

vadimkantorov commented Apr 16, 2017 • edited Loading

vadimkantorov commented Apr 27, 2017

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

varunagrawal commented Mar 12, 2019

vadimkantorov commented Mar 12, 2019

pmeier commented Jun 28, 2021

vadimkantorov commented Apr 15, 2017 •

edited

Loading

vadimkantorov commented Apr 16, 2017 •

edited

Loading