Skip to content
View Tingwei-Jen's full-sized avatar

Block or report Tingwei-Jen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. YOLOv8_TensorRT_CUDA_DeepSort YOLOv8_TensorRT_CUDA_DeepSort Public

    Object tracking implemented with YOLOv8, TensorRT, CUDA, DeepSort, and Pytorch.

    C++ 2 1

  2. Reduction_Optimization Reduction_Optimization Public

    Cuda

  3. SGEMM_Optimization SGEMM_Optimization Public

    Optimized Single-Precision General Matrix Multiplication (SGEMM) using CUDA, achieving 89% of cuBLAS performance.

    Cuda

  4. Nsight_Compute_Tutorial Nsight_Compute_Tutorial Public

  5. CUDA_Example CUDA_Example Public

    C++

  6. FlashAttention FlashAttention Public

    C++