Skip to content

CVPR 2019 Translate-to-Recognize Networks for RGB-D Scene Recognition

Notifications You must be signed in to change notification settings

ownstyledu/Translate-to-Recognize-Networks

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Translate-to-Recognize Networks

Pytorch implementations of Translate-to-Recognize Networks for RGB-D Scene Recognition (CVPR 2019).

blockchain blockchain

Usage

  1. Download Reset18 pre-trained on Places dataset if necessary.

  2. Data processing.

    • We use ImageFolder format, i.e., [class1/images.., class2/images..], to store the data, use util.splitimages.py to help change the format if neccessary.
    • Use util.conc_modalities.py to concatenate each paired RGB and depth images to one image for more efficient data loading. An example is shown below(depth data is encoded using HHA format). blockchain
    • We provide links to download SUN RGB-D data in ImageFolder format and depth data has been encoded using HHA format.
    1. RGB and depth data is concatenated, link
    2. RGB and depth data is stored separately, link
    3. [Updated 2019.11.5] If the links above are not accessible, try (https://pan.baidu.com/s/1LZIF1hlT3k0oX76Ttp660w) The extraction code is: g5vp
  3. Configuration.
    Almost all the settings of experiments are configurable by the files in the config package.

  4. Train.
    python train.py or bash train.sh

  5. [!!New!!] New branch 'multi-gpu' has been uploaded, making losses calculated on each gpu for better balanced usage of multi gpus. You could use this version using this command:
    git clone -b multi-gpu https://github.com/ownstyledu/Translate-to-Recognize-Networks.git TrecgNet

  6. [!!New!!] In multi-gpu brach, we add more loss types in the training, e.g., GAN, pixel2pixel intensity. You could easily add these losses by modifying the config file.

  7. [!!New 2019.7.31!!] We added the fusion model, and reconstructed the code, including reusing the base model and making some modifications on decoder structure and config file. Due to the time limitation, we don't simultaneously update the multi-gpu branch. But you could still refer to it if you want.

Development Environment

  • NVIDIA TITAN XP
  • cuda 9.0
  • python 3.6.5
  • pytorch 0.4.1
  • torchvision 0.2.1
  • tensorboardX

Citation

Please cite the following paper if you feel this repository useful.

@inproceedings{du2019translate,
  title={Translate-to-Recognize Networks for RGB-D Scene Recognition},
  author={Du, Dapeng and Wang, Limin and Wang, Huiling and Zhao, Kai and Wu, Gangshan},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  pages={11836--11845},
  year={2019}
}

About

CVPR 2019 Translate-to-Recognize Networks for RGB-D Scene Recognition

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published