GitHub - nelaturuharsha/TurboPrune: Harness for training/finding lottery tickets in PyTorch. With support for multiple pruning techniques and augmented by distributed training, FFCV and AMP.

TurboPrune: High-Speed Distributed Lottery Ticket Training

PyTorch Distributed Data Parallel (DDP) based training harness for training the network (post-pruning) as fast as possible.
FFCV integration for super-fast training on ImageNet (1:09 mins/epoch on 4xA100 GPUs with ResNet18).
Support for most (if not all) torchvision models. (Transformers will be added later).
Multiple pruning techniques, listed below.
Simple harness, with fastargs -- easily extensible.
Logging to CSV (nothing fancy, but you can integrate wandb/comet/your own system easily).
End to End pipeline easily configurarable using fastargs.

Timing Comparison

The numbers below were obtained on a cluster with similar computational configuration -- only variation was the dataloading method, AMP (enabled where specified) and the GPU model used was NVIDIA A100 (40GB).

The model used was ResNet50 and the effective batch size in each case was 512.

Datasets Supported

CIFAR10
CIFAR100
ImageNet
SVHN (to be added)

Networks supported

As it stands, ResNets, VGG variants should work out of the box. If you run into issues with any other variant happy to look into. For CIFAR based datasets, there are modification to the basic architecture based on tuning and references such as this repository.

Pruning Algorithms included:

- Name: Iterative Magnitude Pruning (IMP)
- Type of Pruning: Iterative
- Paper: The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
- Name: IMP with Weight Rewinding (IMP + WR)
- Type of Pruning: Iterative
- Paper: Stabilizing the lottery ticket hypothesis
- Name: IMP with Learning Rate Rewinding (IMP + LRR)
- Type of Pruning: Iterative
- Paper: Comparing Rewinding and Fine-tuning in Neural Network Pruning
- Name: SNIP
- Type of Pruning: Pruning at Initialization (PaI), One-shot
- Paper: SNIP: Single-shot Network Pruning based on Connection Sensitivity
- Name: SynFlow
- Type of Pruning: Pruning at Initialization (PaI), One-shot
- Paper: Pruning neural networks without any data by iteratively conserving synaptic flow
- Name: Random Balanced/ERK Pruning
- Type of Pruning: Pruning at Initialization (PaI) One-shot + Iterative
- Paper: Why Random Pruning Is All We Need to Start Sparse
- Name: Random Pruning
- Type of Pruning: Iterative
- Paper: The Unreasonable Effectiveness of Random Pruning: Return of the Most Naive Baseline for Sparse Training

Repository structure:

harness.py: contains the training harness for actually training the network, has the requisite setup for DDP.
harness_params.py: Provides the parameters to be provided via config (fastargs), they are defined here. Please refer to the code, or documentation for usage directions.
harness_utils.py: contains methods used for rewinding the weights, optimizer and other nice things we need to make this training work.
utils/conv_type: has the layers definitions and the model pre-processing function to insert the mask parameters as a buffer into those layers. This is what you probably want to edit for adding support for > insert custom SOTA architecture here.
utils/dataset.py: definiton for CIFAR10/CIFAR100, DDP or otherwise.
utils/schedulers.py: learning rate schedulers, for when you need to use them.
utils/pruning_utils.py: Pruning harness.

Pruning within the training harness itself is not very stable w.r.t DDP and probably not the right way (also the harness is supposed to just train the network anyway).
This file contains all the criterion we use for pruning (those in the previous section) and a pruning harness which has the method to call the same -- hopefully this is useful.
Pruning harness can be called at the beginning
- for Pruning at Initialization (PaI),
- for one-shot pruning or
- of each level for an iterative method. Where necessary, it will use a GPU/Dataset.

Important Pre-requisites

To run ImageNet experiments, you obviously need ImageNet downloaded -- in addition, since we use FFCV, you would need to generate .beton files as per the instructions here.
CIFAR10, CIFAR100 and other stuff are handled using torchvision -- thank you torchvision!

Usage

Now to the fun part:

To start an experiment, ensure there is appropriate (sufficient) compute (or it might take a while -- its going to anyways) and in case of ImageNet the appropriate betons available.

pip install -r requirements.txt
python harness.py --config configs/imagenet_lrr_resnet18.yaml --dataset.data_root <PATH_TO_FOLDER>

and it should start.

Baselines

The configs provided in configs/ are for some tuned baselines, but if you find a better configuration -- please feel free to make a pull request. Some results

ImageNet Baseline

CIFAR10 Baseline

CIFAR100 Baseline

If you use this code in your research, and find it useful in general -- please consider citing using:

@software{Nelaturu_TurboPrune_High-Speed_Distributed,
author = {Nelaturu, Sree Harsha and Gadhikar, Advait and Burkholz, Rebekka},
license = {Apache-2.0},
title = {{TurboPrune: High-Speed  Distributed Lottery Ticket Training}},
url = {https://github.com/nelaturuharsha/TurboPrune}}

Footnotes and Acknowledgments:

This code is built using references to the substantial hard work put in by Advait Gadhikar.
Thank you to Dr. Rebekka Burkholz for the opportunity to build this :)
I was heavily influenced by the code style here. Just a general thanks and shout-out to the FFCV team for all they've done!
All credit/references for the original methods and reference implementations are due to the original authors of the work :)
Thank you Andrej, Bhavnick, Akanksha for feedback :)

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
assets		assets
configs		configs
utils		utils
.gitignore		.gitignore
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
harness.py		harness.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TurboPrune: High-Speed Distributed Lottery Ticket Training

Timing Comparison

Datasets Supported

Networks supported

Pruning Algorithms included:

Repository structure:

Important Pre-requisites

Usage

Baselines

ImageNet Baseline

CIFAR10 Baseline

CIFAR100 Baseline

Footnotes and Acknowledgments:

About

Releases

Packages

Languages

License

nelaturuharsha/TurboPrune

Folders and files

Latest commit

History

Repository files navigation

TurboPrune: High-Speed Distributed Lottery Ticket Training

Timing Comparison

Datasets Supported

Networks supported

Pruning Algorithms included:

Repository structure:

Important Pre-requisites

Usage

Baselines

ImageNet Baseline

CIFAR10 Baseline

CIFAR100 Baseline

Footnotes and Acknowledgments:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages