GitHub - ksreenivasan/pruning_is_enough: Pruning is all you need (hopefully)

Rare Gems: Finding Lottery Tickets at Initialization

Overview

It has been widely observed that large neural networks can be pruned to a small fraction of their original size, with little loss in accuracy, by typically following a time-consuming "train, prune, re-train" approach. Frankle & Carbin (2018) conjecture that we can avoid this by training lottery tickets, i.e., special sparse subnetworks found at initialization, that can be trained to high accuracy. However, a subsequent line of work presents concrete evidence that current algorithms for finding trainable networks at initialization, fail simple baseline comparisons, e.g., against training random sparse subnetworks. Finding lottery tickets that train to better accuracy compared to simple baselines remains an open problem. In this work, we partially resolve this open problem by discovering rare gems: subnetworks at initialization that attain considerable accuracy, even before training. Refining these rare gems - "by means of fine-tuning" - beats current baselines and leads to accuracy competitive or better than magnitude pruning methods.

Depdendencies (tentative)

Tested stable dependencies:

python 3.6.5 (Anaconda)
PyTorch 1.1.0
torchvision 0.2.2
CUDA 10.0.130
cuDNN 7.5.1
tensorboard
tqdm
ffcv (If you want to run ffcv imagenet)

Data Preparation

For tinyimagenet, run load_tiny_imagenet.sh
For imagenet, you will need to download imagenet and specify the path in data/imagenet.py (Currently in branch. Will be merged soon)

Running Experients:

The main script is main.py, to launch the jobs, we provide scripts ./cifar_exec.sh, imp_exec.sh. And we provide a description of the main arguments. For more detailed descriptions, refer to args_helper.py.

Argument	Description
`algo`	Specify the algorithm to run. `hc
`lr`	Inital learning rate that will be used for the pruning process.
`fine_tune_lr`	Inital learning rate that will be used for the finetuning process.
`batch-size`	Batch size for the optimizers e.g. SGD or Adam.
`optimizer`	`sgd` or `adam`.
`dataset`	Dataset to use.
`arch`	Model to use.
`gamma`	the factor of learning rate decay, i.e. the effective learning rate is `lr*gamma^t`.
`iter_period`	Specifically for `hc_iter`, how often to run iterative thresholding.
`conv_type`	Will almost always be `SubnetConv` for pruning.
`target_sparsity`	Specify the target sparsity for the ticket.
`unflag_before_finetune`	Restore weights if the regularizer killed too many.
`init`	Weight initialization distribution.
`score_init`	Score initialization distribution.
`hc_quantized`	Enable for GM since it will round on forward pass.
`regularization`	`L2
`lmbda`	Regularization weight.
`gpu`	Specify which gpu to run on.

Configs

Note that the workflow is managed by specifying the above arguments using .yml files specified in the configs/ directory. Please refer them to create new configs like configs/resnet20/resnet20_sparsity_0_59_unflagT.yml.

Sample Config

# subfolder: target_sparsity_0_59_unflagT

# Hypercube optimization
algo: 'hc_iter'
iter_period: 5

# Architecture
arch: resnet20

# ===== Dataset ===== #
dataset: CIFAR10
name: resnet20_quantized_iter_hc

# ===== Learning Rate Policy ======== #
optimizer: sgd
lr: 0.1 #0.01
lr_policy: cosine_lr #constant_lr #multistep_lr
fine_tune_lr: 0.01
fine_tune_lr_policy: multistep_lr

# ===== Network training config ===== #
epochs: 150 
wd: 0.0 
momentum: 0.9
batch_size: 128

# ===== Sparsity =========== #
conv_type: SubnetConv
bn_type: NonAffineBatchNorm
freeze_weights: True
prune_type: BottomK
# enter target sparsity here
target_sparsity: 0.59
# decide if you want to "unflag"
unflag_before_finetune: True
init: signed_constant
score_init: unif #skew #half #bimodal #skew # bern
scale_fan: False #True

# ===== Rounding ===== #
round: naive 
noise: True
noise_ratio: 0 

# ===== Quantization ===== #
hc_quantized: True
quantize_threshold: 0.5

# ===== Regularization ===== #
regularization: L2
lmbda: 0.0001 # 1e-4

# ===== Hardware setup ===== #
workers: 4
gpu: 0

# ===== Checkpointing ===== #
checkpoint_at_prune: False

# ==== sanity check ==== #
skip_sanity_checks: False

Name		Name	Last commit message	Last commit date
Latest commit History 994 Commits
IMP_codebase		IMP_codebase
configs		configs
data		data
exec_scripts		exec_scripts
imagenet		imagenet
models		models
subset_sum_exps		subset_sum_exps
trainers		trainers
utils		utils
word_language_model_revised		word_language_model_revised
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SmartRatio.py		SmartRatio.py
arg_checker.ipynb		arg_checker.ipynb
args_helper.py		args_helper.py
bad_global_minima_2d_brute_force.py		bad_global_minima_2d_brute_force.py
binary_gadget_mnist.py		binary_gadget_mnist.py
cifar_exec.sh		cifar_exec.sh
cifar_exec_GD.sh		cifar_exec_GD.sh
ddp_poc.py		ddp_poc.py
debug_model_from_ckpt.py		debug_model_from_ckpt.py
imp_exec.sh		imp_exec.sh
imp_main.py		imp_main.py
imp_mask.py		imp_mask.py
imp_sanity.py		imp_sanity.py
linear_connectivity.sh		linear_connectivity.sh
load_tiny_imagenet.sh		load_tiny_imagenet.sh
log_layer_sparsities.py		log_layer_sparsities.py
log_layer_sparsities_sr.py		log_layer_sparsities_sr.py
main.py		main.py
main_greedy.py		main_greedy.py
main_utils.py		main_utils.py
matt.sh		matt.sh
mnist_activation_hc.py		mnist_activation_hc.py
mnist_exec.sh		mnist_exec.sh
mnist_hc.py		mnist_hc.py
mnist_hc_iterative.py		mnist_hc_iterative.py
mnist_neurips.py		mnist_neurips.py
mnist_pruning_exps.py		mnist_pruning_exps.py
new_pruning_args.json		new_pruning_args.json
old_pruning_args.json		old_pruning_args.json
plotting_things.ipynb		plotting_things.ipynb
resnet20_layerwise_sparsity.csv		resnet20_layerwise_sparsity.csv
run.sh		run.sh
run_mnist.sh		run_mnist.sh
sanity_check_GD.sh		sanity_check_GD.sh
ste_toy_examples.ipynb		ste_toy_examples.ipynb
tinyimagenet_exec.sh		tinyimagenet_exec.sh
tinyimagenet_exec_GD.sh		tinyimagenet_exec_GD.sh
val_format.py		val_format.py
visualize-results-GD.ipynb		visualize-results-GD.ipynb
visualize_results.ipynb		visualize_results.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Rare Gems: Finding Lottery Tickets at Initialization

Overview

Depdendencies (tentative)

Data Preparation

Running Experients:

Configs

Sample Config

About

Releases

Packages

Contributors 5

Languages

License

ksreenivasan/pruning_is_enough

Folders and files

Latest commit

History

Repository files navigation

Rare Gems: Finding Lottery Tickets at Initialization

Overview

Depdendencies (tentative)

Data Preparation

Running Experients:

Configs

Sample Config

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages