[Feature] Make vLLM optional in model code #1673

ByronHsu · 2024-10-15T06:49:05Z

UPDATE(11/23/2024)

Currently, @james-p-xu is removing rope, @yizhang2077 is removing distributed, @HandH1998 is removing weight loader. Optimistically, we can remove these dependencies by the end of the month and make quant optional (try import). cc @merrymercy @Ying1123

Motivation

This is a tracker of removing vLLM dependencies in general model code (not considering quantization). This is our current import from vLLM, and we want to remove all them.

from vllm.config import CacheConfig
from vllm.distributed import get_tensor_model_parallel_world_size
from vllm.model_executor.layers.rotary_embedding import get_rope
from vllm.model_executor.layers.vocab_parallel_embedding import (
   ParallelLMHead,
   VocabParallelEmbedding,
)

Tracker

Remove CacheConfig: [1/N] Remove CacheConfig import in all model files #1658
Remove RoPE: Support vLLM-style rope flashinfer-ai/flashinfer#530
Remove get_tensor_model_parallel_world_size
Remove ParallelLMHead: Update vocab embedding deps and add TP switch #1856
Remove VocabParallelEmbedding: Update vocab embedding deps and add TP switch #1856

The text was updated successfully, but these errors were encountered:

vkc1vk · 2024-11-10T01:04:28Z

Just curious, are the following imports in model_runner.py also being considered for removal, in later stages

from vllm.config import DeviceConfig, LoadConfig
from vllm.config import ModelConfig as VllmModelConfig
from vllm.distributed import (
    get_tp_group,
    init_distributed_environment,
    initialize_model_parallel,
    set_custom_all_reduce,
)
from vllm.distributed.parallel_state import in_the_same_node_as
from vllm.model_executor.model_loader import get_model
from vllm.model_executor.models import ModelRegistry

zhyncs assigned ByronHsu and zhyncs Oct 15, 2024

zhyncs added the wip label Oct 15, 2024

zhyncs mentioned this issue Oct 17, 2024

Development Roadmap (2024 Q4) #1487

Open

37 tasks

zhyncs assigned yizhang2077 Oct 20, 2024

zhyncs mentioned this issue Nov 28, 2024

[Track] progress in removing vLLM dependencies #2245

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Make vLLM optional in model code #1673

[Feature] Make vLLM optional in model code #1673

ByronHsu commented Oct 15, 2024 •

edited by zhyncs

Loading

vkc1vk commented Nov 10, 2024

[Feature] Make vLLM optional in model code #1673

[Feature] Make vLLM optional in model code #1673

Comments

ByronHsu commented Oct 15, 2024 • edited by zhyncs Loading

UPDATE(11/23/2024)

Motivation

Tracker

vkc1vk commented Nov 10, 2024

ByronHsu commented Oct 15, 2024 •

edited by zhyncs

Loading