How to profile cuDLA computation #31

angry-crab · 2024-03-21T04:14:34Z

Hi,
I tried to profile DLA according to this tutorial.
https://github.com/NVIDIA-AI-IOT/jetson_dla_tutorial

But I got
Error[1]: [runtime.cpp::parsePlan::314] Error Code 1: Serialization (Serialization assertion plan->header.magicTag == rt::kPLAN_MAGIC_TAG failed.)

It seems that TensorRT cannot serialized the loadable somehow. Some posts said this was because of mismatch of TensorRT versions, but I was using the same TensorRT for building and inferring.

Therefore, I was wondering if there is a way to profile cuDLA. Thanks.

The text was updated successfully, but these errors were encountered:

lynettez · 2024-03-25T03:45:31Z

Hi @angry-crab, TensorRT can only build the loadable, but is unable to load it. We should use cuDLA API to load and execute it,
cuDLA samples can be found in https://github.com/NVIDIA/cuda-samples/tree/master/Samples/4_CUDA_Libraries/cuDLAHybridMode and https://github.com/NVIDIA/cuda-samples/tree/master/Samples/4_CUDA_Libraries/cuDLAStandaloneMode

angry-crab · 2024-03-28T05:56:04Z

Hi @angry-crab, TensorRT can only build the loadable, but is unable to load it. We should use cuDLA API to load and execute it, cuDLA samples can be found in https://github.com/NVIDIA/cuda-samples/tree/master/Samples/4_CUDA_Libraries/cuDLAHybridMode and https://github.com/NVIDIA/cuda-samples/tree/master/Samples/4_CUDA_Libraries/cuDLAStandaloneMode

Hi,
thank you for the info. However, I would like to profile cuDLA internal computations, such matmul, conv, etc. Is there a way to do that?

lynettez · 2024-09-02T06:05:27Z

sorry for the late reply. @angry-crab here are the samples that used to provide layerwise statistics to the application.
https://github.com/NVIDIA/Deep-Learning-Accelerator-SW/tree/main/samples/cuDLA
Please check if cudlaExternalEtbl.hpp is available on your platform. Layer-wise profiling is a new feature that may not be supported on some older platforms.

lynettez added the question Further information is requested label Mar 25, 2024

lynettez self-assigned this Mar 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to profile cuDLA computation #31

How to profile cuDLA computation #31

angry-crab commented Mar 21, 2024

lynettez commented Mar 25, 2024

angry-crab commented Mar 28, 2024

lynettez commented Sep 2, 2024

How to profile cuDLA computation #31

How to profile cuDLA computation #31

Comments

angry-crab commented Mar 21, 2024

lynettez commented Mar 25, 2024

angry-crab commented Mar 28, 2024

lynettez commented Sep 2, 2024