dongxianzhe

Follow

Xianzhe Dong dongxianzhe

Follow

4 followers · 5 following

Achievements

Achievements

Popular repositories Loading

ScaleLLM ScaleLLM Public

Forked from vectorch-ai/ScaleLLM

A high-performance inference system for large language models, designed for production environments.

C++
BertWithPretrained BertWithPretrained Public

Forked from moon-hotel/BertWithPretrained

An implementation of the BERT model and its related downstream tasks based on the PyTorch framework

Python
flashinfer flashinfer Public

Forked from flashinfer-ai/flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda
sarathi-serve sarathi-serve Public

Forked from microsoft/sarathi-serve

A low-latency & high-throughput serving engine for LLMs

Python
DistServe DistServe Public

Forked from LLMServe/DistServe

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python