GitHub - amrtsg/DepGAN: GAN based image composition model with occlusion and transparency handling

Depth GAN (DepGAN): Leveraging Depth Maps for Handling Occlusions and Transparency in Image Composition

DepGAN is a generative adversarial network tailored for image composition, and capable of handling occlusion and transparency while combining foreground and background images together.

Code Setup

The code was developed and tested on:

Ubuntu 22.04 / WSL
Python 3.10.13
Tensorflow 2.15.0
Cuda 12.1
Ryzen 7 3800x
Nvidia RTX 3090

Datasets

We have provided Shapenet's basket-bottle dataset in the data/bottle folder. Including the depth estimations for each background image in the depth folder. To create your own dataset, we recommend using MiDaS to estimate the depth images.

Dataset	Size
Bottle-Basket	13.1 MB
Chair-Desk	81.0 MB
Glasses-Face	33.4 MB
Aerial	NA
Real Car	NA

Pretrained Models

Dataset	Model	Size
Bottle-Basket	bottle_basket_75	239.0 MB
Chair-Desk	chair_desk_164	239.0 MB
Glasses-Face	glasses_face_15	239.8 MB
Aerial	Coming soon...	NA
Real Car	Coming soon...	NA

Environment

Please make sure you have Cuda 12.1 installed on your machine.

> conda create -n depgan python=3.10.13 -y && conda activate depgan
> pip install --extra-index-url https://pypi.nvidia.com tensorrt-bindings==8.6.1 tensorrt-libs==8.6.1
> pip install -U tensorflow[and-cuda]==2.15.0
> pip install -r requirements.txt

Running Code

We have put everything together into 2 jupyter notebooks, train.ipynb and predict.ipynb

To train depgan, run train.ipynb. This trains and saves the model at every epoch in the logs/models folder (will be automatically created). A plot with generated samples will be saved in the logs/plots folder (will be automatically created).

NOTE: You can expect the training run to take 10 minutes for 100 epochs on a 3090, we recommend 150 epochs.

To test depgan, run predict.ipynb, this will generate the predictions and save the output to logs/generated/{DATASET}.

All variables can be found and set in config.py. These include config variables to control GUI, dataset paths, and model hyperparameters. Please make sure you have the correct configuration for your train/test run.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
data/bottle_basket		data/bottle_basket
encoders		encoders
gui		gui
misc		misc
.gitignore		.gitignore
README.md		README.md
config.py		config.py
predict.ipynb		predict.ipynb
requirements.txt		requirements.txt
train.ipynb		train.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Depth GAN (DepGAN): Leveraging Depth Maps for Handling Occlusions and Transparency in Image Composition

Code Setup

Datasets

Pretrained Models

Environment

Running Code

About

Releases

Packages

Languages

amrtsg/DepGAN

Folders and files

Latest commit

History

Repository files navigation

Depth GAN (DepGAN): Leveraging Depth Maps for Handling Occlusions and Transparency in Image Composition

Code Setup

Datasets

Pretrained Models

Environment

Running Code

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages