This repo gives a brief introduction and the code to reproduce the experiments of our paper: Kiran Koshy Thekumparampil, Ashish Khetan, Zinan Lin, and Sewoong Oh. Robustness of conditional GANs to noisy labels, NIPS 2018.
A variant of GAN that is popular in many practical applications is the conditional GAN (cGAN), where the input examples are labelled and the goal is to generate realistic examples conditioned on the label chosen by the end-user. That is, given a set of samples {(xi, yi)} from distribution P(X,Y), cGAN should learn to sample from the conditional distributions P(X|Y=y), for any y.
For example, the MNIST dataset of hand written digit images, {xi}, is divided into 10 classes (0-9), according to the digit the image represents. Using these images {xi}, and their corresponding class labels (digits) {yi}, we can then train a cGAN to generate hand written digits for any digit in {0,1,...,9}. In the figure below, we show ten images generated by a cGAN for each digit class (row).
One of the merits of conditional GANs is that conditioning improves sample quality. Empirical evidences in the literature show that conditional GANs achieve better image quality compared to the (unconditioned) GANs trained on the same data but without the labels.
Due to either human errors in annotation or malicious adversaries, labels might be corrupted. If we train a cGAN using these noisy labels, as expected, the generated samples are biased. For example, if we randomly and uniformly corrupt the true labels with a chance of 50%, we see that the trained cGAN on such a data will only produce the correct digit only 50% time as show in the figure below.
Apart from the bias in the generated examples, we note the decline in the quality of these samples as compared to the noise-less case.
- Samples are biased, generating examples from wrong classes.
- Samples quality degrades with noise.
Can we make cGAN training robust against noise in the labels?
We propose RCGAN architecture when we know the noise model C which corrupted the real labels. In RCGAN, before passing the samples (x, y) generated by the generator G, to the discriminator D, we pass the generated label y through the same noisy channel C to corrupt them. Effectively, the adversarial loss D minimizes the distance between the distributions of real noisy samples and generated noisy samples. In the paper we prove that under-certain assumptions on the discriminator RCGAN learns the noiseless "true" conditional distributions.
When the noise model C, is unknown we propose RCGAN-U, which has a similar architecture as that of RCGAN but, along with the generator G, the noise model C is learned as a parametric model. For identifiability of C, we add the permutation regularizer to the architecture.
As an immediate application of RCGAN, we also use these learned cGANs to recover the hidden true real labels with high accuracy. We compare our recovery accuracy with a state-of-the-art un-biased classifier.
In addition to the baseline biased GAN, we provide another baseline: un-biased GAN, which is inspired from un-biased classifier (see our paper for details). We see that our proposed RCGAN and RCGAN-U consistently beat the baselines in all metrics and sample quality.
For MNIST, we also propose another RCGAN variant named RCGAN+y, which has the same discriminator as that of RCGAN but additionally a one-hot representation of the label y is concatenated with the input to its first layer. RCGAN+y is challenging to train (see appendix of our paper for details).
In the following plots we chart:
- generator label accuracy: rate at which the generator produces the correct digit, and,
- label recovery accuracy: recovery accuracy of corresponding cGAN-label-recovery algorithm.
In the following plots we chart:
- inception score: , and,
- generator label accuracy: rate at which the generator produces the correct class.
- Linux or OSX
- python 3.5
- tensorflow 1.5
- numpy
- scipy
Please download MNIST and CIFAR-10 dataset to the appropriate folders using the follow commands.
python download mnist
python download cifar10
Now you can use the following scripts to train different settings. All the scripts use only one GPU, but we have observed that using 2 GPUs can accelerate the training for CIFAR-10.
Code is adapted from https://github.com/carpedm20/DCGAN-tensorflow.
Before running the MNIST experiments make mnist/
your current directory: cd mnist/
- RCGAN:
./run_rcgan.sh
- RCGAN-U:
./run_rcganu.sh
- RCGAN+y:
./run_rcgany.sh
- Biased GAN:
./run_biased.sh
- Un-biased GAN:
./run_unbiased.sh
- Ambient GAN:
./run_ambient.sh
Code is adapted from https://github.com/watsonyanghx/GAN_Lib_Tensorflow.
Before running the MNIST experiments make cifar10/
your current directory: cd cifar10/
.
- RCGAN:
./run_rcgan.sh
- RCGAN-U:
./run_rcganu.sh
- Biased GAN:
./run_biased.sh
- Un-biased GAN:
./run_unbiased.sh