-
Download our proposals here and put under this folder
./proposals/ ├── coco/ | ├── images/ │ ├── train2017/*.pkl
-
each .pkl file contains a list of numpy.ndarray [n_1 * 5, n_2 * 5,...,n_m * 5]
-
the i_th numpy.ndarray correspond to n_i proposals in [xmin, ymin, xmax, ymax, score] of a certain category in the proposal detector
-
each _info.pkl contains image information
- We provide an example of image-caption dataset in image_caption_final.json
- A train vocabulary contains objects of interest is in object_vocab.json
- Download ALBEF pre-trained checkpoint ALBEF.pth and put under this folder