Interested in Deep Learning (self-supervised learning & LLMs), Astrophysics (exoplanets), and Cosmology (CMB).... I like to build things
- New York, NY
- @vgoklani_ai
Pinned Loading
-
pytorch/pytorch
pytorch/pytorch PublicTensors and Dynamic neural networks in Python with strong GPU acceleration
-
NVIDIA/TransformerEngine
NVIDIA/TransformerEngine PublicA library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…
-
RedisTimeSeries/RedisTimeSeries
RedisTimeSeries/RedisTimeSeries PublicTime Series data structure for Redis
-
IST-DASLab/gptq
IST-DASLab/gptq PublicCode for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
-
fpgaminer/GPTQ-triton
fpgaminer/GPTQ-triton PublicGPTQ inference Triton kernel
-
Dao-AILab/flash-attention
Dao-AILab/flash-attention PublicFast and memory-efficient exact attention
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.