GAN Compression

project | paper | demo

[NEW!] The lite pipeline (GAN Compression Lite) is updated, which could produce comparable results as the full pipeline! The lite version of map2sat is released!

[NEW!] The lite pipeline (GAN Compression Lite) is released! Check the tutorial for the pipeline.

[NEW!] GauGAN training code and tutorial is released! Check the tutorial to compress GauGAN.

[NEW!] Correct metric naming and update the evaluation protocol. Support MACs budget for searching.

[NEW!] The compressed model and test codes of GauGAN are released! Check here to use our models.

We introduce GAN Compression, a general-purpose method for compressing conditional GANs. Our method reduces the computation of widely-used conditional GAN models, including pix2pix, CycleGAN, and GauGAN, by 9-21x while preserving the visual fidelity. Our method is effective for a wide range of generator architectures, learning objectives, and both paired and unpaired settings.

GAN Compression: Efficient Architectures for Interactive Conditional GANs
Muyang Li, Ji Lin, Yaoyao Ding, Zhijian Liu, Jun-Yan Zhu, and Song Han
MIT, Adobe Research, SJTU
In CVPR 2020.

Demos

Overview

GAN Compression framework: ① Given a pre-trained teacher generator G', we distill a smaller “once-for-all” student generator G that contains all possible channel numbers through weight sharing. We choose different channel numbers for the student generator G at each training step. ② We then extract many sub-generators from the “once-for-all” generator and evaluate their performance. No retraining is needed, which is the advantage of the “once-for-all” generator. ③ Finally, we choose the best sub-generator given the compression ratio target and performance target (FID or mIoU), perform fine-tuning, and obtain the final compressed model.

Colab Notebook

PyTorch Colab notebook: CycleGAN and pix2pix.

Prerequisites

Linux
Python 3
CPU or NVIDIA GPU + CUDA CuDNN

Getting Started

Installation

Clone this repo:

git clone git@github.com:mit-han-lab/gan-compression.git
cd gan-compression

Install PyTorch 1.4 and other dependencies (e.g., torchvision).
- For pip users, please type the command pip install -r requirements.txt.
- For Conda users, we provide an installation script scripts/conda_deps.sh. Alternatively, you can create a new Conda environment using conda env create -f environment.yml.

CycleGAN

Setup

Download the CycleGAN dataset (e.g., horse2zebra).

bash datasets/download_cyclegan_dataset.sh horse2zebra

Get the statistical information for the ground-truth images for your dataset to compute FID. We provide pre-prepared real statistic information for several datasets. For example,
```
bash datasets/download_real_stat.sh horse2zebra A
bash datasets/download_real_stat.sh horse2zebra B
```

Apply a Pre-trained Model

Download the pre-trained models.

python scripts/download_model.py --model cyclegan --task horse2zebra --stage full
python scripts/download_model.py --model cyclegan --task horse2zebra --stage compressed

Test the original full model.

bash scripts/cycle_gan/horse2zebra/test_full.sh

Test the compressed model.

bash scripts/cycle_gan/horse2zebra/test_compressed.sh

Measure the latency of the two models.

bash scripts/cycle_gan/horse2zebra/latency_full.sh
bash scripts/cycle_gan/horse2zebra/latency_compressed.sh

Pix2pix

Setup

Download the pix2pix dataset (e.g., edges2shoes).

bash datasets/download_pix2pix_dataset.sh edges2shoes-r

Get the statistical information for the ground-truth images for your dataset to compute FID. We provide pre-prepared real statistics for several datasets. For example,
```
bash datasets/download_real_stat.sh edges2shoes-r B
```

Apply a Pre-trained Model

Download the pre-trained models.

python scripts/download_model.py --model pix2pix --task edges2shoes-r --stage full
python scripts/download_model.py --model pix2pix --task edges2shoes-r --stage compressed

Test the original full model.

bash scripts/pix2pix/edges2shoes-r/test_full.sh

Test the compressed model.

bash scripts/pix2pix/edges2shoes-r/test_compressed.sh

Measure the latency of the two models.

bash scripts/pix2pix/edges2shoes-r/latency_full.sh
bash scripts/pix2pix/edges2shoes-r/latency_compressed.sh

GauGAN

Setup

Prepare the cityscapes dataset. Check here for preparing the cityscapes dataset.
Get the statistical information for the ground-truth images for your dataset to compute FID. We provide pre-prepared real statistics for several datasets. For example,
```
bash datasets/download_real_stat.sh cityscapes A
```

Apply a Pre-trained Model

Download the pre-trained models.

python scripts/download_model.py --model gaugan --task cityscapes --stage full
python scripts/download_model.py --model gaugan --task cityscapes --stage compressed

Test the original full model.

bash scripts/gaugan/cityscapes/test_full.sh

Test the compressed model.

bash scripts/gaugan/cityscapes/test_compressed.sh

Measure the latency of the two models.

bash scripts/gaugan/cityscapes/latency_full.sh
bash scripts/gaugan/cityscapes/latency_compressed.sh

Cityscapes Dataset

For the Cityscapes dataset, we cannot provide it due to license issue. Please download the dataset from https://cityscapes-dataset.com and use the script prepare_cityscapes_dataset.py to preprocess it. You need to download gtFine_trainvaltest.zip and leftImg8bit_trainvaltest.zip and unzip them in the same folder. For example, you may put gtFine and leftImg8bit in database/cityscapes-origin. You need to prepare the dataset with the following commands:

python datasets/get_trainIds.py database/cityscapes-origin/gtFine/
python datasets/prepare_cityscapes_dataset.py \
--gtFine_dir database/cityscapes-origin/gtFine \
--leftImg8bit_dir database/cityscapes-origin/leftImg8bit \
--output_dir database/cityscapes \
--table_path datasets/table.txt

You will get a preprocessed dataset in database/cityscapes and a mapping table (used to compute mIoU) in dataset/table.txt.

To support mIoU computation, you need to download a pre-trained DRN model drn-d-105_ms_cityscapes.pth from http://go.yf.io/drn-cityscapes-models. By default, we put the drn model in the root directory of our repo. Then you can test our compressed models on cityscapes after you have downloaded our compressed models.

Training

Please refer to our training tutorial on how to train models on our datasets and your own.

FID Computation

To compute the FID score, you need to get some statistical information from the groud-truth images of your dataset. We provide a script get_real_stat.py to extract statistical information. For example, for the edges2shoes dataset, you could run the following command:

python get_real_stat.py \
--dataroot database/edges2shoes-r \
--output_path real_stat/edges2shoes-r_B.npz \
--direction AtoB

For paired image-to-image translation (pix2pix and GauGAN), we calculate the FID between generated test images to real test images. For unpaired image-to-image translation (CycleGAN), we calculate the FID between generated test images to real training+test images. This allows us to use more images for a stable FID evaluation, as done in previous unconditional GANs research. The difference of the two protocols is small. The FID of our compressed CycleGAN model increases by 4 when using real test images instead of real training+test images.

Code Structure

To help users better understand and use our code, we briefly overview the functionality and implementation of each package and each module.

Citation

If you use this code for your research, please cite our paper.

@inproceedings{li2020gan,
  title={GAN Compression: Efficient Architectures for Interactive Conditional GANs},
  author={Li, Muyang and Lin, Ji and Ding, Yaoyao and Liu, Zhijian and Zhu, Jun-Yan and Han, Song},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year={2020}
}

Acknowledgements

Our code is developed based on pytorch-CycleGAN-and-pix2pix and SPADE.

We also thank pytorch-fid for FID computation and drn for mIoU computation.

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
configs		configs
data		data
datasets		datasets
distillers		distillers
docs		docs
imgs		imgs
metric		metric
models		models
options		options
scripts		scripts
supernets		supernets
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cycle_gan.ipynb		cycle_gan.ipynb
distill.py		distill.py
environment.yaml		environment.yaml
export.py		export.py
get_real_stat.py		get_real_stat.py
get_test_opt.py		get_test_opt.py
latency.py		latency.py
merge.py		merge.py
pix2pix.ipynb		pix2pix.ipynb
remove_spectral_norm.py		remove_spectral_norm.py
requirements.txt		requirements.txt
search.py		search.py
search_multi.py		search_multi.py
select_arch.py		select_arch.py
test.py		test.py
tox.ini		tox.ini
train.py		train.py
train_supernet.py		train_supernet.py
trainer.py		trainer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GAN Compression

project | paper | demo

Demos

Overview

Colab Notebook

Prerequisites

Getting Started

Installation

CycleGAN

Setup

Apply a Pre-trained Model

Pix2pix

Setup

Apply a Pre-trained Model

GauGAN

Setup

Apply a Pre-trained Model

Cityscapes Dataset

Training

FID Computation

Code Structure

Citation

Acknowledgements

About

Releases

Packages

Languages

License

dltech-xyz/gan-compression

Folders and files

Latest commit

History

Repository files navigation

GAN Compression

project | paper | demo

Demos

Overview

Colab Notebook

Prerequisites

Getting Started

Installation

CycleGAN

Setup

Apply a Pre-trained Model

Pix2pix

Setup

Apply a Pre-trained Model

GauGAN

Setup

Apply a Pre-trained Model

Cityscapes Dataset

Training

FID Computation

Code Structure

Citation

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages