Efficient Triton Kernels for LLM Training
-
Updated
Dec 18, 2024 - Python
Efficient Triton Kernels for LLM Training
FlagGems is an operator library for large language models implemented in Triton Language.
LLM notes, including model inference, transformer model structure, and llm framework code analysis notes
The llama model inference lite framework by triton.
KernelHeim – development ground of custom Triton and CUDA kernel functions designed to optimize and accelerate machine learning workloads on NVIDIA GPUs. Inspired by the mythical stronghold of the gods, KernelHeim is a forge where high-performance kernels are crafted to unlock the full potential of the hardware.
Add a description, image, and links to the triton-kernels topic page so that developers can more easily learn about it.
To associate your repository with the triton-kernels topic, visit your repo's landing page and select "manage topics."