VQA Compression Benchmark

1. Getting started

To get started with compressing a VQA model, there are a few components that we need:

Dataset and evaluation code: contains the dataset (questions, an- swers,images) and official evaluation code

– Google Drive [CDNNRIA/Datasets/VQA/vqa api] (enable link sharing).
Image features: Faster-RCNN features are provided so you can only focus on running the VQA model during training.

– Google Drive

[CDNNRIA/Datasets/VQA/vqa api/Features/trainval 36.h5].
VQA model: pretrained DRAU model

– Google Drive [CDNNRIA/Datasets/VQA/pretrained drau]

– Paper: http://dx.doi.org/10.1016/j.cviu.2019.05.001 on arXiv
VQA code: Code to define, preprocess, and train the network

– Github WILL BE PUBLIC FOR RELEASE [git@github.com:ahmedmagdiosman/compress-vqa.git]

– NOT NEEDED FOR PUBLIC RELEASE SSH key (Everyone can clone with the included SSH key). [CDNNRIA/Datasets/VQA/github key/]

– Code is also available on GDrive [CDNNRIA/Datasets/VQA/vqa drau]
Colab notebook: Includes simple pruning for the VQA model

– Google Drive [CDNNRIA/Datasets/VQA/vqa prune local.ipynb].

1.1 Try out notebook

First, run the Colab notebook to make sure everything works correctly. The code takes care of mounting the data and copying everything in the right location. If you would like to try another pruning method, you can change the pruning algorithm directly in the notebook (sec. 2.6). For more complex compression, you can refer to the model architecture in the Github repo here.

1.2 GDrive IO issue

Gdrive times out with large files or directories with a large number of files. This causes the code to exit unsuccessfully. For testing purposes, you can use a partial part of the dataset by copying config_small.py over config.py in section 1.7.

2 Running Locally

2.1 Dependencies

Pip packages were extracted from the Colab environment. I recommend a fresh conda environment to install the packages. To install the packages:

pip install --upgrade --force-reinstall -r colab_pip_req.txt

2.2 Train

config.py contains the parameters that you can tweak for training. If you are fine-tuning, make sure to call --RESUME and --RESUME_PATH to point towards the pretrained network. It’s also good to specify the GPU using CUDA_VISIBLE_DEVICES=GPU_ID

To train:

CUDA_VISIBLE_DEVICES=0 python train_drau_glove.py

2.3 drau glove.py

This file contains the definition of the VQA model. I recommend first reading the forward() function and working your way down (top-down) to easily understand each component.

Contact

Wojciech Samek wojciech.samek@hhi.fraunhofer.de
Ahmed Osman ahmed.osman@hhi.fraunhofer.de

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
docs		docs
vqa_drau		vqa_drau
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VQA Compression Benchmark

1. Getting started

1.1 Try out notebook

1.2 GDrive IO issue

2 Running Locally

2.1 Dependencies

2.2 Train

2.3 drau glove.py

Contact

About

Releases

Packages

Languages

ahmedmagdiosman/compress-vqa

Folders and files

Latest commit

History

Repository files navigation

VQA Compression Benchmark

1. Getting started

1.1 Try out notebook

1.2 GDrive IO issue

2 Running Locally

2.1 Dependencies

2.2 Train

2.3 drau glove.py

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages