Software Engineer | Algorithm Engineer
-
Gallopwave
- Taipei, Taiwan
- in/tingwei-jen-225bb911a
Pinned Loading
-
YOLOv8_TensorRT_CUDA_DeepSort
YOLOv8_TensorRT_CUDA_DeepSort PublicObject tracking implemented with YOLOv8, TensorRT, CUDA, DeepSort, and Pytorch.
-
-
SGEMM_Optimization
SGEMM_Optimization PublicOptimized Single-Precision General Matrix Multiplication (SGEMM) using CUDA, achieving 89% of cuBLAS performance.
Cuda
-
-
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.