[QNN][TFLite] Added support for fused-bias and quantized input in TRANSPOSE_CONV for TFLite. #6523

jainris · 2020-09-21T23:03:39Z

Added dilation_value attribute to dilate operator of Relay/TOPI.
(Enables custom value for dilation, instead of always 0)
Added tests for dilation_value of dilate operator in Relay and TOPI.
Added support for quantized input in TRANSPOSE_CONV operator of TFLite.
Added tests for quantized input in TRANSPOSE_CONV operator of TFLite.

…NSPOSE_CONV for TFLite. * Added dilation_value attribute to dilate operator of Relay/TOPI. (Enables custom value for dilation, instead of always 0) * Added tests for dilation_value of dilate operator in Relay and TOPI. * Added support for quantized input in TRANSPOSE_CONV operator of TFLite. * Added tests for quantized input in TRANSPOSE_CONV operator of TFLite.

jainris · 2020-09-21T23:04:57Z

cc @anijain2305 @mbaret @u99127 @FrozenGene @tqchen

include/tvm/relay/attrs/nn.h

python/tvm/relay/frontend/tflite.py

python/tvm/topi/nn/dilate.py

python/tvm/topi/testing/dilate_python.py

python/tvm/relay/frontend/tflite.py

mbaret

Looks almost there to me. Could we see if there's a hosted model somewhere with a transpose convolution in that we can test with? Also ping @giuseros as I know you're familiar with the maths behind this.

tests/python/frontend/tflite/test_forward.py

mbaret · 2020-09-23T13:03:08Z

also ping @siju-samuel

anijain2305 · 2020-09-23T20:17:08Z

Dilation part is good.

I am not sure about the conv2d transpose portion. My concern is that we now have to replicate the logic for different framework parsers. My suggestion would be to add qnn.conv2d_tranpose op and perform the "dilation + qnn.op.conv2d" lowering in QNN Legalize (example here - https://github.com/apache/incubator-tvm/blob/master/python/tvm/relay/qnn/op/legalizations.py#L266).

For now, we can make the transformation for all targets, not just specifically to ARM.

This will keep the option open to improve the schedule of conv2d_transpose as a whole if needed.

jainris · 2020-09-24T16:51:01Z

Quantized Transpose Convolution code needs some changes, so bringing dilate operator changes independently in #6550.

@jainris

This work is based on @jainris initial PR: apache#6523 I added a relay.qnn.conv2d_transpose node. The strategy I followed is to convert to int16 and invoke nn.conv2d_transpose (which already exists in relay). Main changes: - The node declaration lives in relay/qnn/op/convolution_transpose.cc - Cast int8->int16 and subsequent offset removal is in tvm/relay/qnn/op/legalizations.py. - I added and tested the operator in the tflite front-end - I added a unit-test in Relay for qnn.conv2d_transpose

@jainris

This work is based on @jainris initial PR: apache#6523 I added a relay.qnn.conv2d_transpose node. The strategy I followed is to convert to int16 and invoke nn.conv2d_transpose (which already exists in relay). Main changes: - The node declaration lives in relay/qnn/op/convolution_transpose.cc - Cast int8->int16 and subsequent offset removal is in tvm/relay/qnn/op/legalizations.py. - I added and tested the operator in the tflite front-end - I added a unit-test in Relay for qnn.conv2d_transpose Co-authored-by: Rishabh Jain

@jainris

This work is based on @jainris initial PR: apache#6523 I added a relay.qnn.conv2d_transpose node. The strategy I followed is to convert to int16 and invoke nn.conv2d_transpose (which already exists in relay). Main changes: - The node declaration lives in relay/qnn/op/convolution_transpose.cc - Cast int8->int16 and subsequent offset removal is in tvm/relay/qnn/op/legalizations.py. - I added and tested the operator in the tflite front-end - I added a unit-test in Relay for qnn.conv2d_transpose Co-authored-by: Rishabh Jain

@jainris

This work is based on @jainris initial PR: apache#6523 I added a relay.qnn.conv2d_transpose node. The strategy I followed is to convert to int16 and invoke nn.conv2d_transpose (which already exists in relay). Main changes: - The node declaration lives in relay/qnn/op/convolution_transpose.cc - Cast int8->int16 and subsequent offset removal is in tvm/relay/qnn/op/legalizations.py. - I added and tested the operator in the tflite front-end - I added a unit-test in Relay for qnn.conv2d_transpose Co-authored-by: Rishabh Jain <jainris@users.noreply.github.com>

@jainris

* Add initial support for quantized transpose convolution in Relay This work is based on @jainris initial PR: #6523 I added a relay.qnn.conv2d_transpose node. The strategy I followed is to convert to int16 and invoke nn.conv2d_transpose (which already exists in relay). Main changes: - The node declaration lives in relay/qnn/op/convolution_transpose.cc - Cast int8->int16 and subsequent offset removal is in tvm/relay/qnn/op/legalizations.py. - I added and tested the operator in the tflite front-end - I added a unit-test in Relay for qnn.conv2d_transpose Co-authored-by: Rishabh Jain <jainris@users.noreply.github.com> * Fix linting * Addressing review comments Co-authored-by: Rishabh Jain <jainris@users.noreply.github.com>

@jainris

…che#6899) * Add initial support for quantized transpose convolution in Relay This work is based on @jainris initial PR: apache#6523 I added a relay.qnn.conv2d_transpose node. The strategy I followed is to convert to int16 and invoke nn.conv2d_transpose (which already exists in relay). Main changes: - The node declaration lives in relay/qnn/op/convolution_transpose.cc - Cast int8->int16 and subsequent offset removal is in tvm/relay/qnn/op/legalizations.py. - I added and tested the operator in the tflite front-end - I added a unit-test in Relay for qnn.conv2d_transpose Co-authored-by: Rishabh Jain <jainris@users.noreply.github.com> * Fix linting * Addressing review comments Co-authored-by: Rishabh Jain <jainris@users.noreply.github.com>

@jainris

…che#6899) * Add initial support for quantized transpose convolution in Relay This work is based on @jainris initial PR: apache#6523 I added a relay.qnn.conv2d_transpose node. The strategy I followed is to convert to int16 and invoke nn.conv2d_transpose (which already exists in relay). Main changes: - The node declaration lives in relay/qnn/op/convolution_transpose.cc - Cast int8->int16 and subsequent offset removal is in tvm/relay/qnn/op/legalizations.py. - I added and tested the operator in the tflite front-end - I added a unit-test in Relay for qnn.conv2d_transpose Co-authored-by: Rishabh Jain <jainris@users.noreply.github.com> * Fix linting * Addressing review comments Co-authored-by: Rishabh Jain <jainris@users.noreply.github.com>

@jainris

…che#6899) * Add initial support for quantized transpose convolution in Relay This work is based on @jainris initial PR: apache#6523 I added a relay.qnn.conv2d_transpose node. The strategy I followed is to convert to int16 and invoke nn.conv2d_transpose (which already exists in relay). Main changes: - The node declaration lives in relay/qnn/op/convolution_transpose.cc - Cast int8->int16 and subsequent offset removal is in tvm/relay/qnn/op/legalizations.py. - I added and tested the operator in the tflite front-end - I added a unit-test in Relay for qnn.conv2d_transpose Co-authored-by: Rishabh Jain <jainris@users.noreply.github.com> * Fix linting * Addressing review comments Co-authored-by: Rishabh Jain <jainris@users.noreply.github.com>

anijain2305 self-assigned this Sep 21, 2020

mbaret requested changes Sep 22, 2020

View reviewed changes

Documentation changes.

0e787ae

mbaret requested changes Sep 23, 2020

View reviewed changes

tests/python/frontend/tflite/test_forward.py Show resolved Hide resolved

Added test for asymmetric kernel.

e964ed0

ZihengJiang added the status: need update need update based on feedbacks label Sep 23, 2020

jainris closed this Sep 24, 2020

giuseros mentioned this pull request Nov 11, 2020

Add initial support for quantized transpose convolution in Relay #6899

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QNN][TFLite] Added support for fused-bias and quantized input in TRANSPOSE_CONV for TFLite. #6523

[QNN][TFLite] Added support for fused-bias and quantized input in TRANSPOSE_CONV for TFLite. #6523

jainris commented Sep 21, 2020

jainris commented Sep 21, 2020

mbaret left a comment

mbaret commented Sep 23, 2020

anijain2305 commented Sep 23, 2020 •

edited

Loading

jainris commented Sep 24, 2020

[QNN][TFLite] Added support for fused-bias and quantized input in TRANSPOSE_CONV for TFLite. #6523

[QNN][TFLite] Added support for fused-bias and quantized input in TRANSPOSE_CONV for TFLite. #6523

Conversation

jainris commented Sep 21, 2020

jainris commented Sep 21, 2020

mbaret left a comment

Choose a reason for hiding this comment

mbaret commented Sep 23, 2020

anijain2305 commented Sep 23, 2020 • edited Loading

jainris commented Sep 24, 2020

anijain2305 commented Sep 23, 2020 •

edited

Loading