adapt vllm distributed module to sglang #2244

yizhang2077 · 2024-11-28T14:28:15Z

Motivation

Modifications

Move vllm distributed module (v0.6.4.post1) to sglang, and current model is stilll using vllm.distributed module

Checklist

Format your code according to the Contributor Guide.
Add unit tests as outlined in the Contributor Guide.
Update documentation as needed, including docstrings or example tutorials.

What to do Next

Install sgl-kernel for custom allreduce and replace vllm.distributed to sglang.srt.distributed
Add tensorrt allreduce module

zhyncs · 2024-11-28T14:35:37Z

QQ why not use the v0.6.4.post1

yizhang2077 · 2024-11-28T14:38:23Z

QQ why not use the v0.6.4.post1

I see pyproject.toml require vllm>=0.6.3.post1

zhyncs · 2024-11-28T14:42:18Z

This is to maintain compatibility, 0.6.3.post1 uses torch 2.4, and 0.6.4.post1 uses torch 2.5.1. The current main branch is compatible with both torch 2.4 and torch 2.5.1. Regarding the distributed part, I suggest directly updating to v0.6.4.post1

yizhang2077 · 2024-11-28T14:50:18Z

This is to maintain compatibility, 0.6.3.post1 uses torch 2.4, and 0.6.4.post1 uses torch 2.5.1. The current main branch is compatible with both torch 2.4 and torch 2.5.1. Regarding the distributed part, I suggest directly updating to v0.6.4.post1

ok，currently it use 0.6.4.post1 as base

…orm.is_hpu()

zhyncs

Overall LGTM just left some comments Thanks!

python/sglang/srt/_custom_ops.py

python/sglang/srt/distributed/device_communicators/cuda_wrapper.py

python/sglang/srt/distributed/device_communicators/shm_broadcast.py

python/sglang/srt/distributed/device_communicators/pynccl_wrapper.py

python/sglang/srt/distributed/device_communicators/shm_broadcast.py

python/sglang/srt/distributed/parallel_state.py

python/sglang/srt/distributed/device_communicators/xpu_communicator.py

python/sglang/srt/distributed/communication_op.py

python/sglang/srt/_custom_ops.py

zhyncs

LGTM. I think we can merge it into remove-vllm-distributed and verify afterward. You can then create another PR from remove-vllm-distributed.

python/sglang/srt/distributed/device_communicators/custom_all_reduce.py

python/sglang/srt/_custom_ops.py

zhyncs · 2024-11-30T14:21:05Z

isort....................................................................Failed
- hook id: isort
- files were modified by this hook

Fixing /home/runner/work/sglang/sglang/python/sglang/srt/_custom_ops.py

yizhang2077 · 2024-11-30T14:58:09Z

I think the test failed is caused by pr #2266 , @zhaochenyang20

zhyncs

LGTM Since it is not currently in use, I think it's safe to merge. More detailed testing and verification can be conducted when integrating the custom all-reduce CUDA kernel into sgl-kernel. cc @yizhang2077

python/sglang/srt/distributed/__init__.py

python/sglang/srt/distributed/communication_op.py

python/sglang/srt/_custom_ops.py

yizhang2077 requested review from merrymercy, Ying1123, hnyls2002, zhyncs, ispobock and ByronHsu as code owners November 28, 2024 14:28

yizhang2077 changed the title ~~move vllm distributed module to sglang~~ move vllm distributed module (v0.6.3.post1) to sglang Nov 28, 2024

zhyncs self-assigned this Nov 28, 2024

zhyncs added the high priority label Nov 28, 2024

zhyncs mentioned this pull request Nov 28, 2024

[Track] progress in removing vLLM dependencies #2245

Open

2 tasks

move vllm distributed to sglang

e15632f

yizhang2077 force-pushed the remove-vllm-distributed branch from 84622c3 to 21d7d58 Compare November 28, 2024 16:14

yizhang2077 changed the title ~~move vllm distributed module (v0.6.3.post1) to sglang~~ move vllm distributed module (v0.6.4.post1) to sglang Nov 28, 2024

add is_hpu func since vllm=0.6.3.post1 does not support current_platf…

9fecd6e

…orm.is_hpu()

yizhang2077 force-pushed the remove-vllm-distributed branch from 21d7d58 to 9fecd6e Compare November 28, 2024 16:22

zhyncs reviewed Nov 29, 2024

View reviewed changes

add vllm version and remove unused code in _custom_ops

d637d0b

zhyncs reviewed Nov 29, 2024

View reviewed changes

python/sglang/srt/distributed/communication_op.py Outdated Show resolved Hide resolved

yizhang2077 added 4 commits November 30, 2024 01:56

remove vllm.utils, vllm.platforms and vllm.envs

e01d48a

add detail vllm version

4cdd03a

remove tpu_comm since it is useless in sglang

855c069

add reference for cuda_device_count_stateless

a1f4006

zhyncs reviewed Nov 30, 2024

View reviewed changes

python/sglang/srt/_custom_ops.py Outdated Show resolved Hide resolved

zhyncs added 2 commits November 30, 2024 13:37

upd

6e1a3a9

Merge branch 'main' into remove-vllm-distributed

16cbc8c

zhyncs changed the title ~~move vllm distributed module (v0.6.4.post1) to sglang~~ adapt vllm distributed module to sglang Nov 30, 2024

zhyncs reviewed Nov 30, 2024

View reviewed changes

python/sglang/srt/distributed/device_communicators/custom_all_reduce.py Outdated Show resolved Hide resolved

python/sglang/srt/_custom_ops.py Show resolved Hide resolved

zhyncs requested a review from HaiShaw November 30, 2024 05:52

move all envs which prefix is VLLM -> SGLANG

6e23463

fix format

1cea9d4

zhyncs and others added 3 commits November 30, 2024 23:16

Merge branch 'main' into remove-vllm-distributed

a3e5ae9

Merge branch 'main' into remove-vllm-distributed

d6d6611

Merge branch 'main' into remove-vllm-distributed

152ab3d

zhyncs approved these changes Dec 1, 2024

View reviewed changes

zhyncs enabled auto-merge (squash) December 1, 2024 07:49

zhyncs disabled auto-merge December 1, 2024 07:54

zhyncs merged commit d5b95cb into sgl-project:main Dec 1, 2024
8 of 13 checks passed

merrymercy reviewed Dec 1, 2024

View reviewed changes

python/sglang/srt/distributed/__init__.py Show resolved Hide resolved

python/sglang/srt/distributed/communication_op.py Show resolved Hide resolved

merrymercy reviewed Dec 1, 2024

View reviewed changes

python/sglang/srt/_custom_ops.py Show resolved Hide resolved

yizhang2077 mentioned this pull request Dec 2, 2024

Adapt vllm custom ar into sgl-kernel #2321

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adapt vllm distributed module to sglang #2244

adapt vllm distributed module to sglang #2244

yizhang2077 commented Nov 28, 2024 •

edited

Loading

zhyncs commented Nov 28, 2024

yizhang2077 commented Nov 28, 2024

zhyncs commented Nov 28, 2024

yizhang2077 commented Nov 28, 2024 •

edited

Loading

zhyncs left a comment

zhyncs left a comment

zhyncs commented Nov 30, 2024

yizhang2077 commented Nov 30, 2024 •

edited

Loading

zhyncs left a comment

adapt vllm distributed module to sglang #2244

adapt vllm distributed module to sglang #2244

Conversation

yizhang2077 commented Nov 28, 2024 • edited Loading

Motivation

Modifications

Checklist

What to do Next

zhyncs commented Nov 28, 2024

yizhang2077 commented Nov 28, 2024

zhyncs commented Nov 28, 2024

yizhang2077 commented Nov 28, 2024 • edited Loading

zhyncs left a comment

Choose a reason for hiding this comment

zhyncs left a comment

Choose a reason for hiding this comment

zhyncs commented Nov 30, 2024

yizhang2077 commented Nov 30, 2024 • edited Loading

zhyncs left a comment

Choose a reason for hiding this comment

yizhang2077 commented Nov 28, 2024 •

edited

Loading

yizhang2077 commented Nov 28, 2024 •

edited

Loading

yizhang2077 commented Nov 30, 2024 •

edited

Loading