Skip to content

lucienne999/DLA-Convert

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DLA Converter

This repo used to convert ONNX to CUDLA in Orin / Xavier, as it required cudla lib.

  • The conversion process strictly prohibits GPU fallback, ensuring full utilization of the DLA capabilities on Orin/Xavier devices.
  • The converted models can be efficiently executed using the cudla library.

Usage

  1. install

    mkdir build && cmake .. & make -j4
    
  2. convert

    ./build/dla_converter/dla_int8  ./tests/data/test.onnx  ./tests/data/test.calib
    ./build/dla_converter/dla_fp16  ./tests/data/test.onnx  
    
    • To convert int8, a calibration file is required. You can refer to TensorRT-Samples for more details.

Known Issues

  1. After installation, if you encounter the error [eglUtils.cpp::operator()::105] Error Code 2: Internal Error (Assertion (eglCreateStreamKHR) != nullptr failed.), it is likely due to missing nvidia-l4t-3d-core. You can fix this by installing it using the following command: apt install nvidia-l4t-3d-core -y

    For more information, please refer to the discussion: Internal Error Assertion Failed (eglCreateStreamKHR)

  2. DLA has limited support for certain operations. Please review the link to ensure the operations used in your ONNX models are supported. In case of conversion failure, it might be due to unsupported operations.

Some Engineering problems

  1. When trying to run 2 DLA + 1 GPU Model on the same Orin device, it was observed that the GPU model becomes slower.

    This behavior seems to be normal. For more discussions and insight: https://forums.developer.nvidia.com/t/run-pure-conv2d-node-on-dla-makes-gpu-get-slower/219770/8

About

Convert onnx to dla.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published