[MKLDNN]Add quantized concat #13297

ZhennanQin · 2018-11-16T04:02:16Z

Description

This PR is to add quantized concat op and its MKLDNN implementation.

@pengzhao-intel @TaoLv

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

Feature1, tests, (and when applicable, API doc)
Feature2, tests, (and when applicable, API doc)

Comments

If this change is a backward incompatible change, why must this change be made.
Interesting edge cases to note here

src/operator/quantization/mkldnn/mkldnn_quantize_concat.cc

kalyc · 2018-11-16T20:42:49Z

@mxnet-label-bot add [pr-awaiting-review]
Thanks for your contribution @ZhennanQin

pengzhao-intel · 2018-11-17T01:16:41Z

@larroy please help take a review again. This is an important OP for the quantization flow so we hope it can be merged before r1.4 code freeze.

pengzhao-intel · 2018-11-18T01:14:05Z

@zheng-da @reminisce @szha @eric-haibin-lin @apeforest
please help take a review :)

src/operator/quantization/quantization_utils.h

src/operator/quantization/quantized_concat.cc

src/operator/nn/mkldnn/mkldnn_concat-inl.h

src/operator/quantization/quantized_concat.cc

apeforest · 2018-11-21T05:18:14Z

src/operator/quantization/quantized_concat.cc

+NNVM_REGISTER_OP(Concat)
+.set_attr<FQuantizedOp>("FQuantizedOp", [](const NodeAttrs& attrs) {
+  nnvm::NodePtr node = nnvm::Node::Create();
+  node->attrs.op = Op::Get("_contrib_quantized_concat");


It seems the Concat operator will call the MKLDNN version of the operator. Is this intended?

Yes, all quantized op doesn't have default implementation for cpu. MKLDNN is the only cpu implementation. I guess that's why they all have _contrib_ prefix.

What if MKLDNN is not ON and user invokes this operator? Will any error message be given?

I'm not sure. It should follow framework default behavior. Quantized op are all supported by FComputeEx, so basically it should only be used when MKLDNN is on. One way to avoid this is to define quantized_concat as a mkldnn specific op, by declaring it inside MXNET_USE_MKLDNN macro. But we don't have such backend specific op before. Do you have any suggestion?

I think we should issue an error message that the quantized op is not supported in non MKLDNN build.

Yes, I agree with that. But that beyond the scope of this PR. Currently all quantized op has this issue. We need to create another PR to add error message from framework level for each op that don't have default implementation.

Can we have that PR first before we merge this? Knowing there is some limitation without issuing any clear message may create a bad user experience.

@apeforest Build MXNet with make USE_OPENCV=1 USE_BLAS=openblas, and run quantized model, below message is reported:

[10:54:08] src/executor/attach_op_execs_pass.cc:351: Neither FCompute nor FComputeEx registered _contrib_quantized_concat [10:54:08] src/executor/attach_op_execs_pass.cc:351: Neither FCompute nor FComputeEx registered _contrib_quantized_pooling [10:54:08] src/executor/attach_op_execs_pass.cc:351: Neither FCompute nor FComputeEx registered _contrib_quantized_conv

I think framework can handle this case properly.

src/operator/quantization/mkldnn/mkldnn_quantized_concat.cc

codecov-io · 2018-11-21T14:35:55Z

Codecov Report

Merging #13297 into master will decrease coverage by 11.06%.
The diff coverage is 16.04%.

@@             Coverage Diff             @@
##           master   #13297       +/-   ##
===========================================
- Coverage   79.72%   68.66%   -11.07%     
===========================================
  Files         749      652       -97     
  Lines       81176    70894    -10282     
  Branches     3164     3164               
===========================================
- Hits        64714    48676    -16038     
- Misses      15606    21923     +6317     
+ Partials      856      295      -561

Impacted Files	Coverage Δ
src/operator/quantization/quantization_utils.h	`50% <ø> (-50%)`	⬇️
src/operator/quantization/quantized_concat.cc	`16.04% <16.04%> (ø)`
src/operator/nn/depthwise_convolution_tf.cuh	`0% <0%> (-85.89%)`	⬇️
python/mxnet/symbol/contrib.py	`8.54% <0%> (-81.65%)`	⬇️
src/operator/contrib/multibox_prior.cu	`2.7% <0%> (-81.09%)`	⬇️
src/operator/contrib/multibox_detection.cu	`4.76% <0%> (-80.96%)`	⬇️
src/engine/threaded_engine_pooled.cc	`1.58% <0%> (-80.96%)`	⬇️
python/mxnet/gluon/block.py	`13.83% <0%> (-79.53%)`	⬇️
python/mxnet/gluon/model_zoo/vision/squeezenet.py	`21.62% <0%> (-78.38%)`	⬇️
python/mxnet/gluon/model_zoo/vision/resnet.py	`21.71% <0%> (-76.27%)`	⬇️
... and 393 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c78f89f...152ad06. Read the comment docs.

src/operator/nn/mkldnn/mkldnn_concat-inl.h

src/operator/quantization/quantized_concat.cc

apeforest

Some change still needed

ZhennanQin · 2018-11-27T03:00:39Z

@apeforest All comments are addressed. Can you review again? Thanks a lot for keeping review round after round.

apeforest

LGTM. Thanks for the detailed explanation.

TaoLv · 2018-11-28T08:39:12Z

Thanks for the contribution. Now merging.

Add quantized concat

c41349d

ZhennanQin requested a review from anirudh2290 as a code owner November 16, 2018 04:02

Fix non-mkldnn build

d018311

larroy reviewed Nov 16, 2018

View reviewed changes

src/operator/quantization/mkldnn/mkldnn_quantize_concat.cc Outdated Show resolved Hide resolved

Add size check for MKLDNNQuantizedConcatForward

8a05091

marcoabreu added the pr-awaiting-review PR is waiting for code review label Nov 16, 2018

apeforest reviewed Nov 18, 2018

View reviewed changes

src/operator/quantization/quantization_utils.h Outdated Show resolved Hide resolved

apeforest reviewed Nov 18, 2018

View reviewed changes

src/operator/quantization/quantized_concat.cc Show resolved Hide resolved

use all capital for constant

f681e20

ZhennanQin requested a review from szha as a code owner November 20, 2018 02:18

Rename constant with Google C++ style.

152ad06