[WIP] Support deploy MMRazor quantized model #1471

pppppM · 2022-11-30T07:39:32Z

Motivation

The related pr in MMRazor is open-mmlab/mmrazor#365

MMRazor is developing quantization algorithms, including PTQ and QAT.
This PR is a draft code to deploy MMRazor quantization model in MMDeploy, mainly with the following two points.

Export FX Graph

MMRazor quantized model is FX graph, and the current function rewriter cannot handle FX graph correctly.
The function rewriter has been fine-tuned in this PR, which can handle FX graph correctly.

Export Quantized ONNX

Different backends have different ONNX formats for quantized models, TensorRT and Openvion's quantized onnx exporters are implemented in this pr.

Modification

Function Rewriter

The original function rewriter is a wrapper, and the first arg is ctx.
In order to process FX Graph, wrapper is no longer used in this pr.
The original function is directly replaced by rewritten function, and ctx is removed from args.
ctx becomes a global variable.

Quantize ONNX Exporter

This pr adds a fake quant symbolic op, with which a temporary non-running onnx can be exported.
Then, different backends quantize onnx exporter will convert it to final deployed onnx.

python tools/deploy.py configs/mmdet/detection/detection_openvino_dynamic-800x1344-quantize.py  $RETINANET $FLOAT_CKPT demo/resources/det.jpg --show

CLAassistant · 2022-12-12T09:26:14Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
1 out of 2 committers have signed the CLA.

✅ pppppM
❌ grimoire

grimoire seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

pppppM added 6 commits November 30, 2022 14:34

refactor func rewriter to support fx graph

d79da54

adapt refactored func writer

c6cc9f8

add quantize exporters

f5c24b1

add fake quant symbolic

059a2fc

add openvino quantize deploy configs

a114027

support export quantized model to onnx

eb9e8aa

pppppM mentioned this pull request Nov 30, 2022

[Feature] Add prepare_for_mmdeploy interface open-mmlab/mmrazor#365

Merged

grimoire and others added 3 commits December 2, 2022 21:40

wip

582781f

update rewriter

240ad95

Support all codebase

bee736c

grimoire mentioned this pull request Dec 3, 2022

[Refactor] Refactor rewriter context for MMRazor #1483

Merged

grimoire and others added 5 commits December 5, 2022 11:25

update docs

4057523

fix ssd

87f662c

rename qualname

4348cf2

support torch.fx.wrap

392946a

import by torch version

8f3f781

grimoire and others added 9 commits December 13, 2022 10:50

solve conflict

0143c8d

solve conflict

133fce5

Merge branch 'refactor-rewriter-context' into adapt_razor_quantize

8494dd2

remove FunctionContextContextCaller

de2c5d4

update is qat and fake quant symbolic

a4a4e78

valid openvino pipeline

2ea041b

add explicit quant

1db8077

pre-rebase

9a94315

add trt_explicit pipeline

290646b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Support deploy MMRazor quantized model #1471

[WIP] Support deploy MMRazor quantized model #1471

pppppM commented Nov 30, 2022

CLAassistant commented Dec 12, 2022 •

edited

Loading

[WIP] Support deploy MMRazor quantized model #1471

Are you sure you want to change the base?

[WIP] Support deploy MMRazor quantized model #1471

Conversation

pppppM commented Nov 30, 2022

Motivation

Export FX Graph

Export Quantized ONNX

Modification

Function Rewriter

Quantize ONNX Exporter

CLAassistant commented Dec 12, 2022 • edited Loading

CLAassistant commented Dec 12, 2022 •

edited

Loading