You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Did we get a request for extra sampling parameters from someone?
Just to clarify why we are not using sampling params model from vLLM (vllm.sampling_params.SamplingParams):
vLLM uses msgspec for SamplingParams not Pydantic so it will not work well as parameter to the endpoints.
We want to support the same SamplingParams for different text generation deployments. Right now we support vLLM and HuggingFace Transformers. Soon HQQ Text Generation Models. So we don't want to be dependent on vLLM.
Feature Summary
Justification/Rationale
Proposed Implementation (if any)
vllm.sampling_params
model as input directly instead of declaring theSamplingParam
in the sdk (It can be too much arguments for the user)SamplingParam
The text was updated successfully, but these errors were encountered: