Depth GAN (DepGAN): Leveraging Depth Maps for Handling Occlusions and Transparency in Image Composition

DepGAN is a generative adversarial network tailored for image composition, and capable of handling occlusion and transparency while combining foreground and background images together.

Code Setup

The code was developed and tested on:

Ubuntu 22.04 / WSL
Python 3.10.13
Tensorflow 2.15.0
Cuda 12.1
Ryzen 7 3800x
Nvidia RTX 3090

Datasets

We have provided Shapenet's basket-bottle dataset in the data/bottle folder. Including the depth estimations for each background image in the depth folder. To create your own dataset, we recommend using MiDaS to estimate the depth images.

Dataset	Size
Bottle-Basket	13.1 MB
Chair-Desk	81.0 MB
Glasses-Face	33.4 MB
Aerial	NA
Real Car	NA

Pretrained Models

Dataset	Model	Size
Bottle-Basket	bottle_basket_75	239.0 MB
Chair-Desk	chair_desk_164	239.0 MB
Glasses-Face	glasses_face_15	239.8 MB
Aerial	Coming soon...	NA
Real Car	Coming soon...	NA

Environment

Please make sure you have Cuda 12.1 installed on your machine.

> conda create -n depgan python=3.10.13 -y && conda activate depgan
> pip install --extra-index-url https://pypi.nvidia.com tensorrt-bindings==8.6.1 tensorrt-libs==8.6.1
> pip install -U tensorflow[and-cuda]==2.15.0
> pip install -r requirements.txt

Running Code

We have put everything together into 2 jupyter notebooks, train.ipynb and predict.ipynb

To train depgan, run train.ipynb. This trains and saves the model at every epoch in the logs/models folder (will be automatically created). A plot with generated samples will be saved in the logs/plots folder (will be automatically created).

NOTE: You can expect the training run to take 10 minutes for 100 epochs on a 3090, we recommend 150 epochs.

To test depgan, run predict.ipynb, this will generate the predictions and save the output to logs/generated/{DATASET}.

All variables can be found and set in config.py. These include config variables to control GUI, dataset paths, and model hyperparameters. Please make sure you have the correct configuration for your train/test run.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Depth GAN (DepGAN): Leveraging Depth Maps for Handling Occlusions and Transparency in Image Composition

Code Setup

Datasets

Pretrained Models

Environment

Running Code

Files

README.md

Latest commit

History

README.md

File metadata and controls

Depth GAN (DepGAN): Leveraging Depth Maps for Handling Occlusions and Transparency in Image Composition

Code Setup

Datasets

Pretrained Models

Environment

Running Code