Quantitative problems for int8 #10240

lingl-space · 2023-06-26T06:06:15Z

lingl-space
Jun 26, 2023

The basic model is used for precision=fp32 reasoning and the corresponding slim model is used for precision=int8 reasoning respectively. However, the final inference time obtained is that when ocr.ocr() is run in main.cpp, the inference time using fp32 only needs half of that of int8. The acceleration purpose of int8 quantification is the exact opposite

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quantitative problems for int8 #10240

{{title}}

Replies: 0 comments

Select a reply

Quantitative problems for int8 #10240

lingl-space Jun 26, 2023

Replies: 0 comments

lingl-space
Jun 26, 2023