interpretability_for_adversarial_detection

Work exploring the use of interpretability techniques such as saliency maps to help detect machine learning adversarial attacks

Training data generation code is for python3

After installing python module requirements, place the files found in 'foolbox_replacement_files/models' in to 'foolbox/models' in your site-packages directory. (Due to this, the use of a virtual enviroment is reccomended)

Run cifar_util.py using python2 (due to the way the cifar images were 'pickled') to produce the cifar_10 images used to produce the adversarial detector training data.

mnist training data generation: generate_training_images.py

cifar_10 training data generation: cifar_generate_training_images.py

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
cifar_convolutional_model		cifar_convolutional_model
cifar_data/cifar-10-batches-py		cifar_data/cifar-10-batches-py
foolbox_replacement_files		foolbox_replacement_files
mnist_convolutional_model		mnist_convolutional_model
modules		modules
supporting_material		supporting_material
.gitignore		.gitignore
README.md		README.md
cifar_generate_training_images.py		cifar_generate_training_images.py
cifar_train_adv_final_layer_rel_detector.m		cifar_train_adv_final_layer_rel_detector.m
cifar_train_adv_final_layer_rel_detector.m~		cifar_train_adv_final_layer_rel_detector.m~
cifar_train_cnn_adv_detector.m		cifar_train_cnn_adv_detector.m
cifar_train_cnn_adv_rel_detector.m		cifar_train_cnn_adv_rel_detector.m
cifar_train_cnn_adv_rel_vgg_like_detector.m		cifar_train_cnn_adv_rel_vgg_like_detector.m
cifar_util.py		cifar_util.py
generate_training_images.py		generate_training_images.py
input_data.py		input_data.py
mnist_train_cnn_adv_detector.m		mnist_train_cnn_adv_detector.m
mnist_train_cnn_adv_rel_detector.m		mnist_train_cnn_adv_rel_detector.m
mnist_train_cnn_adv_rel_vgg_like_detector.m		mnist_train_cnn_adv_rel_vgg_like_detector.m
requirements.txt		requirements.txt
train_adv_final_layer_rel_detector.m		train_adv_final_layer_rel_detector.m
train_cifar_network.py		train_cifar_network.py
train_cnn_adv_detector.m		train_cnn_adv_detector.m
train_cnn_adv_rel_detector.m		train_cnn_adv_rel_detector.m

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

interpretability_for_adversarial_detection

About

Releases

Packages

Languages

dais-ita/interpretability_for_adversarial_detection

Folders and files

Latest commit

History

Repository files navigation

interpretability_for_adversarial_detection

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages