[python] Fix new logprobs computation in vllm_utils #2146

sindhuvahinis · 2024-07-04T01:24:21Z

Description

This bug is only in master, not in 0.28.0. This fixes the LMI no-code low code CI failures.

Even if we set logprobs=1, sometimes, vLLM sends more than one log probabilities. Here for new log probs, we add all the log probabilities that are return by vLLM to new_logprobs dict.

But when we determine whether it is last token or not, i == (len(new_logprobs) -1) and this fails, because now it has more than one new probs, this case will never be true. So last_token never occurred, so it returned broken json without any details. Hence the CI failed.

Will add unit test cases for this use-cases as well in the next PR.

[python] Fix new logprobs computation in vllm_utils

f6ca674

sindhuvahinis requested review from zachgk, frankfliu and a team as code owners July 4, 2024 01:24

lanking520 approved these changes Jul 4, 2024

View reviewed changes

sindhuvahinis merged commit 46e05cb into deepjavalibrary:master Jul 4, 2024
9 checks passed

sindhuvahinis deleted the ci branch July 10, 2024 19:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[python] Fix new logprobs computation in vllm_utils #2146

[python] Fix new logprobs computation in vllm_utils #2146

sindhuvahinis commented Jul 4, 2024

[python] Fix new logprobs computation in vllm_utils #2146

[python] Fix new logprobs computation in vllm_utils #2146

Conversation

sindhuvahinis commented Jul 4, 2024

Description