use tensorrt to inference #4

Yutong-gannis · 2023-03-18T12:38:54Z

how to use tensorrt to do inference

qinjian623 · 2023-03-19T09:25:57Z

To use TensorRT for inference with your PyTorch model, please follow these steps:

Convert your PyTorch model into an ONNX format file. If you haven't done this before, please refer to the PyTorch official documentation: https://pytorch.org/tutorials/advanced/super_resolution_with_onnxruntime.html.
Set up your TensorRT environment. The easiest way to do this is to use a Docker container from NVIDIA: https://ngc.nvidia.com/catalog/containers/nvidia:tensorrt.
Once your TensorRT environment is set up, you can use the trtexec command to perform a random input inference test and obtain a detailed latency report. Follow the instructions provided by the trtexec command to perform the test.

Provide feedback