saeyoonoh

Follow

saeyoonoh

Follow

0 followers · 1 following

Achievements

Achievements

Popular repositories Loading

TensorRT-LLM TensorRT-LLM Public

Forked from NVIDIA/TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++
speculative-decoding speculative-decoding Public

Forked from lucidrains/speculative-decoding

Explorations into some recent techniques surrounding speculative decoding

Python
gpt-fast gpt-fast Public

Forked from pytorch-labs/gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python
.dotfiles .dotfiles Public

Forked from saeyoon17/.dotfiles

Vim Script
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
EAGLE EAGLE Public

Forked from SafeAILab/EAGLE

Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)

Python