Image Generation and Reconstruction with Convolutional Variational Autoencoder (VAE) in PyTorch

Implementation Details

A PyTorch implementation of the standard Variational Autoencoder (VAE). The amortized inference model (encoder) is parameterized by a convolutional network, while the generative model (decoder) is parameterized by a transposed convolutional network. The choice of the approximate posterior is a fully-factorized gaussian distribution with diagonal covariance.

This implementation supports model training on the CelebA dataset. This project serves as a proof of concept, hence the original images (178 x 218) are scaled and cropped to (64 x 64) images in order to speed up the training process. For ease of access, the zip file which contains the dataset can be downloaded from: https://s3-us-west-1.amazonaws.com/udacity-dlnfd/datasets/celeba.zip.

The VAE model was evaluated on several downstream tasks, such as image reconstruction and image generation. Some sample results can be found in the Results section.

Figure 1: Visual Representation of VAE. Image source: LearnOpenCV

Requirements

Python >= 3.9
PyTorch >= 1.9

Installation Guide

$ git clone https://github.com/julian-8897/Conv-VAE-PyTorch.git
$ cd Vanilla-VAE-PyTorch
$ pip install -r requirements.txt

Usage

Training

To train the model, please modify the config.json configuration file, and run:

python train.py --config config.json

Resuming Training

To resume training of the model from a checkpoint, you can run the following command:

python train.py --resume path/to/checkpoint

Testing

To test the model, you can run the following command:

python test.py --resume path/to/checkpoint

Generated plots are stored in the 'Reconstructions' and 'Samples' folders.

Results

128 Latent Dimensions

Reconstructed Samples	Generated Samples

256 Latent Dimensions

Reconstructed Samples	Generated Samples

References

Original VAE paper "Auto-Encoding Variational Bayes" by Kingma & Welling: https://arxiv.org/abs/1312.6114
Various implementations of VAEs in PyTorch: https://github.com/AntixK/PyTorch-VAE
PyTorch template used in this project: https://github.com/victoresque/pytorch-template
A comprehensive introduction to VAEs: https://arxiv.org/pdf/1906.02691.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Image Generation and Reconstruction with Convolutional Variational Autoencoder (VAE) in PyTorch

Implementation Details

Requirements

Installation Guide

Usage

Training

Resuming Training

Testing

Results

128 Latent Dimensions

256 Latent Dimensions

References

Files

README.md

Latest commit

History

README.md

File metadata and controls

Image Generation and Reconstruction with Convolutional Variational Autoencoder (VAE) in PyTorch

Implementation Details

Requirements

Installation Guide

Usage

Training

Resuming Training

Testing

Results

128 Latent Dimensions

256 Latent Dimensions

References