PCA Color Augmentation

PCA Color Augmentation described in the AlexNet's paper. You can run with TensorFlow, Keras, Numpy.

What's PCA Color Augmentation

It is a kind of Data Augmentation and uses principal component analysis. By calculating eigenvectors and eigenvalues, it is possible to add noise matching the distribution of data, and behave like if there are many images (data).

For structured data

The original PCA Augmentation is only an image, I notice that this can be applied to structured data, so I implement it.

Simple Augmentation

Categorical Augmentation

Since it is implemented as a tensor calculation, it is possible to augment structured data, not just images, by category

Evaluation

CIFAR-10

AlexNet is pre-BatchNorm paper, so I check the existence of BatchNorm besides PCA Augmentation.

PCA　Augmenation	Batch Norm	Train Acc	Validation Acc	s/epc (GPU)	s/epc (CPU)
Yes	Yes	0.9931	0.7618	94	14
Yes	No	0.9208	0.6651	92	14
No	Yes	1.0000	0.7762	6	-
No	No	0.9843	0.6507	6	-

BatchNorm is too strong. The effect of augmentation is canceled by BatchNorm on this evaluation. Without BatchNorm, it is certainly possible to confirm the effect of PCA Augmentation.

s/epc (GPU) means seconds per epoch when PCA Augmentation is run on GPU. s/epc (CPU) is same on CPU. Both train on GPU.

The reason why the GPU version is slow is because the SVD of the TensorFlow GPU is very slow. Related issue.

So, I recommend running PCA Augmentation with Numpy tensor version of CPU (pca_aug_numpy_tensor.py).

Wine Dataset(For structured data)

Scikit-learn wine datasets. To reproduce overfitting, I change the train and test split rate from general 7: 3 to 3: 7.

PCA Augmentation	# PCA	MaxValAcc
No	-	0.9440
Yes(Total)	5	0.9600
Yes(Total)	20	0.9760
Yes(Categorical)	5	0.9600
Yes(Categorical)	20	0.9680

PCA Augmentation for structured data does well! Categorical augmentation is works well too.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
images		images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cnn_cifar.py		cnn_cifar.py
cnn_cifar_gpu_augmentation.py		cnn_cifar_gpu_augmentation.py
pca_aug_numpy_single.py		pca_aug_numpy_single.py
pca_aug_numpy_tensor.py		pca_aug_numpy_tensor.py
pca_aug_tf_keras_version.py		pca_aug_tf_keras_version.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PCA Color Augmentation

What's PCA Color Augmentation

For structured data

Simple Augmentation

Categorical Augmentation

Evaluation

CIFAR-10

Wine Dataset(For structured data)

See details (Japanese)

About

Releases

Packages

Languages

License

koshian2/PCAColorAugmentation

Folders and files

Latest commit

History

Repository files navigation

PCA Color Augmentation

What's PCA Color Augmentation

For structured data

Simple Augmentation

Categorical Augmentation

Evaluation

CIFAR-10

Wine Dataset(For structured data)

See details (Japanese)

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages