Constraint-aware one-shot pruners #2657

zheng-ningxin · 2020-07-08T03:02:39Z

In this pr, we add three constraint-aware one-shot pruners into NNI: Constrained_L1FilterPruner, Constrained_L2FilterPruner, ConstrainedActivationMeanRankFilterPruner.
These constraint-aware pruners are aware of the constraints of the channel dependency/ group dependency and prunes the model under such constraints, so that we can better harvest the speed benefit from model pruning. In the original version, the L1FilterPruner prunes the model only based on the L1 norm values, and many pruned models violate the aforementioned constraints(channel dependency/ group dependency). Therefore, the benefits of the model pruning cannot be obtained through the speedup module.

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

zheng-ningxin · 2020-07-08T03:03:59Z

#2616

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

…aint_pruner

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

…aint_pruner

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

docs/en_US/Compressor/Pruner.md

QuanluZhang · 2020-09-15T01:20:04Z

src/sdk/pynni/nni/compression/torch/pruning/one_shot.py

+    def update_mask(self):
+        if not self.dependency_aware:
+            # if we use the normal way to update the mask,
+            # then call the updata_mask of the father class


updata -> update

QuanluZhang · 2020-09-15T01:23:12Z

src/sdk/pynni/nni/compression/torch/pruning/one_shot.py

+    def _dependency_update_mask(self):
+        """
+        In the original update_mask, the wraper of each layer will update its
+        mask own mask according to the sparsity specified in the config_list. However, in


mask own mask -> own mask

QuanluZhang · 2020-09-15T01:31:23Z

src/sdk/pynni/nni/compression/torch/pruning/one_shot.py

+            The list of the wrappers that in the same channel dependency
+            set.
+        wrappers_idx : list
+            The list of the indexes of wrapppers.


please also write "Returns" here in docstring

QuanluZhang · 2020-09-15T01:40:57Z

src/sdk/pynni/nni/compression/torch/pruning/structured_pruning.py

@@ -47,6 +79,9 @@ def calc_mask(self, sparsity, wrapper, wrapper_idx=None):
            layer wrapper of this layer
        wrapper_idx: int
            index of this wrapper in pruner's all wrappers
+        channel_masks: Tensor
+            channel_mask indicates the channels that we should at least mask.
+            the finnal masked channels should include these channels.             


finnal -> final

QuanluZhang · 2020-09-15T01:48:37Z

src/sdk/pynni/nni/compression/torch/pruning/structured_pruning.py

            num_prune = num_total - num_preserve

        if num_total < 2 or num_prune < 1:
            return mask
        # weight*mask_weight: apply base mask for iterative pruning
-        return self.get_mask(mask, weight*mask_weight, num_prune, wrapper, wrapper_idx)
+        return self.get_mask(mask, weight*mask_weight, num_prune, wrapper, wrapper_idx, channel_masks)


what is the difference between mask and channel_masks?

channel_masks indicates the output channels that we should at least mask in this conv layer. channel_masks actually are the common channels that all the layers in the dependency group should prune. mask is just the mask of this conv layer, maybe we can change the name to make this clearer?

yes, you can directly remove channel_masks in this call, and also remove it from _normal_calc_mask's argument list

QuanluZhang · 2020-09-15T07:51:21Z

src/sdk/pynni/nni/compression/torch/pruning/structured_pruning.py

+        # find the max number of the filter groups of the dependent
+        # layers. The group constraint of this dependency set is decided
+        # by the layer with the max groups.
+        max_group = max(groups)


why we should choose the maximum value of the groups?

QuanluZhang · 2020-09-18T06:56:07Z

docs/en_US/Compressor/DependencyAware.md

+## Evaluation
+In order to compare the performance of the pruner with or without the dependency-aware mode, we use L1FilterPruner to prune the Mobilenet_v2 separately when the dependency-aware mode is turned on and off. To simplify the experiment, we use the uniform pruning which means we allocate the same sparsity for all convolutional layers in the model.
+We trained a Mobilenet_v2 model on the cifar10 dataset and prune the model based on this pretrained checkpoint. The following figure shows the accuracy and FLOPs of the model pruned by different pruners.
+![](../../img/mobilev2_l1_cifar.jpg)


this figure looks great!

QuanluZhang · 2020-09-18T06:57:02Z

docs/en_US/Compressor/DependencyAware.md

+```python
+from nni.compression.torch import L1FilterPruner
+config_list = [{ 'sparsity': 0.8, 'op_types': ['Conv2d'] }]
+# dummy_input is necessary for the dependency_aware mode
+dummy_input = torch.ones(1, 3, 224, 224).cuda()
+pruner = L1FilterPruner(model, config_list, dependency_aware=True, dummy_input=dummy_input)
+pruner.compress()
+```
+
+To enable the dependency-aware mode for `L2FilterPruner`:
+```python
+from nni.compression.torch import L2FilterPruner
+config_list = [{ 'sparsity': 0.8, 'op_types': ['Conv2d'] }]
+# dummy_input is necessary for the dependency_aware mode
+dummy_input = torch.ones(1, 3, 224, 224).cuda()
+pruner = L2FilterPruner(model, config_list, dependency_aware=True, dummy_input=dummy_input)
+pruner.compress()
+```
+
+To enable the dependency-aware mode for `FPGMPruner`:
+```python
+from nni.compression.torch import FPGMPruner
+config_list = [{
+    'sparsity': 0.5,
+    'op_types': ['Conv2d']
+}]
+# dummy_input is necessary for the dependency_aware mode
+dummy_input = torch.ones(1, 3, 224, 224).cuda()
+pruner = FPGMPruner(model, config_list, dependency_aware=True, dummy_input=dummy_input)
+pruner.compress()
+```
+
+To enable the dependency-aware mode for `ActivationAPoZRankFilterPruner`
+```python
+from nni.compression.torch import ActivationAPoZRankFilterPruner
+config_list = [{
+    'sparsity': 0.5,
+    'op_types': ['Conv2d']
+}]
+# dummy_input is necessary for the dependency_aware mode
+dummy_input = torch.ones(1, 3, 224, 224).cuda()
+pruner = ActivationAPoZRankFilterPruner(model, config_list, statistics_batch_num=1, , dependency_aware=True, dummy_input=dummy_input)
+pruner.compress()
+```
+
+To enable the dependency-aware mode for `ActivationMeanRankFilterPruner`:
+
+```python
+from nni.compression.torch import ActivationMeanRankFilterPruner
+config_list = [{
+    'sparsity': 0.5,
+    'op_types': ['Conv2d']
+}]
+# dummy_input is necessary for the dependency-aware mode and the
+# dummy_input should be on the same device with the model
+dummy_input = torch.ones(1, 3, 224, 224).cuda()
+pruner = ActivationMeanRankFilterPruner(model, config_list, statistics_batch_num=1, dependency_aware=True, dummy_input=dummy_input)
+pruner.compress()
+```
+
+To enable the dependency-aware mode for `TaylorFOWeightFilterPruner`:
+```python
+from nni.compression.torch import TaylorFOWeightFilterPruner
+config_list = [{
+    'sparsity': 0.5,
+    'op_types': ['Conv2d']
+}]
+dummy_input = torch.ones(1, 3, 224, 224).cuda()
+pruner = TaylorFOWeightFilterPruner(model, config_list, statistics_batch_num=1, dependency_aware=True, dummy_input=dummy_input)
+pruner.compress()
+```


we can simplify this example code

Sure, how about we just keep the example of L1FilterPruner and remove the others?

good point, and above this line pruner = L1FilterPruner(model, config_list, dependency_aware=True, dummy_input=dummy_input) you can use comments to show the usage of other pruners, one comment line for each pruner

QuanluZhang · 2020-09-18T07:14:56Z

src/sdk/pynni/nni/compression/torch/pruning/structured_pruning.py

-        threshold = torch.topk(w_abs_structured.view(-1), num_prune, largest=False)[0].max()
-        mask_weight = torch.gt(w_abs_structured, threshold)[:, None, None, None].expand_as(weight).type_as(weight)
-        mask_bias = torch.gt(w_abs_structured, threshold).type_as(weight).detach() if base_mask['bias_mask'] is not None else None
+        return w_abs_structured


here we only support output channel, right?

Yes, the L1FilterPruner only prunes the filters (output channel).

QuanluZhang · 2020-09-18T07:26:54Z

@chicm-ms , I didn't fully check the modifications of XXXPrunerMasker, as I know little about them.

src/sdk/pynni/tests/test_dependecy_aware.py

chicm-ms · 2020-09-21T10:03:10Z

src/sdk/pynni/nni/compression/torch/pruning/structured_pruning.py

            return None
-        mean_activation = self._cal_mean_activation(activations)
+        if channel_masks is not None:


Let's remove this special handing for mean activation as discussed.

Ningxin added 12 commits July 5, 2020 07:05

constraint-aware pruner

3515b29

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Constrained Structure pruner.

1c84925

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Constrained pruner.

3ed53cb

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Constrained one-shot pruner.

a415206

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Constraint aware pruner.

5fa19fc

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Constrained one-shot pruner.

bed63fe

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Constrained one shot pruner.

12c24e5

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Constrained-aware one-shot pruner.

45e62c2

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Update the doc.

aeb8aaf

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

reformat the unit test.

70dac7c

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Add test case for constrained-aware pruners.

5362328

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Remove the unnecessary log function.

bd375a4

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Ningxin added 4 commits July 8, 2020 03:23

fix pylint errors.

9d0fb79

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

Add the docs for the constrained pruners.

211a047

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

empty commit

a11cf48

Merge branch 'master' of https://github.com/microsoft/nni into constr…

899b6f9

…aint_pruner

ultmaster requested a review from QuanluZhang July 17, 2020 09:30

Ningxin added 11 commits July 20, 2020 06:57

Add an accuracy comparsion benchmark for Constrained Pruner.

439426f

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

update

a08dac6

Merge branch 'master' of https://github.com/microsoft/nni into constr…

39eadcc

…aint_pruner

update the benchmark

fab3315

Update constrained pruner benchmark.

4bc3c48

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

update

6c32a7c

update

9923ea1

update

5a35c8e

fix a bug.

00eb006

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

update.

65676ad

Signed-off-by: Ningxin <Ningxin.Zheng@microsoft.com>

update

50d3468

chicm-ms reviewed Jul 30, 2020

View reviewed changes

docs/en_US/Compressor/Pruner.md Outdated Show resolved Hide resolved

QuanluZhang mentioned this pull request Sep 14, 2020

September Iteration Plan #2841

Closed

79 tasks

Ningxin added 4 commits September 14, 2020 03:49

update doc

29029ee

update the doc

a8f3f74

update doc

9bf7667

update the doc

b7b7150

QuanluZhang reviewed Sep 15, 2020

View reviewed changes

Ningxin added 5 commits September 16, 2020 03:11

update

e68cec0

update

c9c5329

update

ef51a10

update

4acabaa

add some evaluation results

80aec67

zheng-ningxin requested review from QuanluZhang and chicm-ms September 18, 2020 03:22

QuanluZhang reviewed Sep 18, 2020

View reviewed changes

QuanluZhang approved these changes Sep 18, 2020

View reviewed changes

chicm-ms reviewed Sep 21, 2020

View reviewed changes

src/sdk/pynni/tests/test_dependecy_aware.py Show resolved Hide resolved

chicm-ms reviewed Sep 21, 2020

View reviewed changes

Ningxin added 2 commits September 21, 2020 11:42

update

3a73e30

update the doc

d5bbe48

chicm-ms approved these changes Sep 21, 2020

View reviewed changes

chicm-ms merged commit ec5af41 into microsoft:master Sep 21, 2020

zheng-ningxin added a commit to zheng-ningxin/nni that referenced this pull request Nov 18, 2020

Constraint-aware one-shot pruners (microsoft#2657)

07cd399

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Constraint-aware one-shot pruners #2657

Constraint-aware one-shot pruners #2657

zheng-ningxin commented Jul 8, 2020

zheng-ningxin commented Jul 8, 2020

QuanluZhang Sep 15, 2020

QuanluZhang Sep 15, 2020

QuanluZhang Sep 15, 2020

QuanluZhang Sep 15, 2020

QuanluZhang Sep 15, 2020

zheng-ningxin Sep 15, 2020

QuanluZhang Sep 15, 2020

QuanluZhang Sep 15, 2020

QuanluZhang Sep 18, 2020

QuanluZhang Sep 18, 2020

zheng-ningxin Sep 18, 2020

QuanluZhang Sep 18, 2020

QuanluZhang Sep 18, 2020

zheng-ningxin Sep 18, 2020

QuanluZhang commented Sep 18, 2020 •

edited

Loading

chicm-ms Sep 21, 2020

zheng-ningxin Sep 21, 2020

Constraint-aware one-shot pruners #2657

Constraint-aware one-shot pruners #2657

Conversation

zheng-ningxin commented Jul 8, 2020

zheng-ningxin commented Jul 8, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

QuanluZhang commented Sep 18, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

QuanluZhang commented Sep 18, 2020 •

edited

Loading