[Quantization speedup]support TensorRT8.0.0 #3866

linbinskn · 2021-06-24T02:05:13Z

This PR aims to support the latest TensorRT version. Current quantization speedup tool is implemented based on TensorRT7.0. However, the latest tensorrt python api has changed which separates network definition and configuration after version 8.0. All configuration of low precision have been moved to IBuilderConfig and our current implementation can't work on it (the problem raised in issue #3857 ). This PR supports the new TensorRT version and the new api.

J-shang · 2021-07-01T02:50:51Z

this means we upgrade TensorRT dependency to >= 8.0? Is these upgrade available for most users?
And pls update the quantization speed up doc for this change.

linbinskn · 2021-07-05T07:47:21Z

this means we upgrade TensorRT dependency to >= 8.0? Is these upgrade available for most users?
And pls update the quantization speed up doc for this change.

@J-shang Good point! Have updated quantization speedup doc. I think the departure of network definition and configuration in latest TensorRT version is rational, and support the latest version is necessary for us. I believe most of people will use the latest version especially these people who want to try mixed precision in TensorRT.

support TensorRT8.0.0

98d3091

linbinskn mentioned this pull request Jun 24, 2021

max_workspace_size is not compatible with tensorrt8.x #3857

Closed

QuanluZhang requested review from J-shang and QuanluZhang June 25, 2021 09:58

update doc

11558fc

support both TensorRT7.0 and 8.0

6127ec6

J-shang approved these changes Jul 9, 2021

View reviewed changes

QuanluZhang approved these changes Jul 9, 2021

View reviewed changes

J-shang merged commit a4760ce into microsoft:master Jul 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Quantization speedup]support TensorRT8.0.0 #3866

[Quantization speedup]support TensorRT8.0.0 #3866

linbinskn commented Jun 24, 2021

J-shang commented Jul 1, 2021

linbinskn commented Jul 5, 2021

[Quantization speedup]support TensorRT8.0.0 #3866

[Quantization speedup]support TensorRT8.0.0 #3866

Conversation

linbinskn commented Jun 24, 2021

J-shang commented Jul 1, 2021

linbinskn commented Jul 5, 2021