[Bug]: stream_chunk_builder fails to include tool_calls preceded by content #2716

stephenfreund · 2024-03-27T17:03:38Z

What happened?

If the stream produced by a call to litellm.completion(..., stream=True) contains content deltas before tool call deltas, the result of calling stream_chunk_builder on an array of all chunks from the stream fails to include the tool calls. The test case below demonstrates this issue on runs in which gpt-4 responds with first content and then a call to weather. The output of one such run is attached.

import litellm

prompt = """
Tell me word that starts with "aar".
Then tell me the weather in San Francisco.
"""

messages = [{"role": "user", "content": prompt}]

tools = [
    {
        "type": "function",
        "function": {
            "name": "get_current_weather",
            "description": "Get the current weather in a given location",
            "parameters": {
                "type": "object",
                "properties": {
                    "location": {
                        "type": "string",
                        "description": "The city and state, e.g. San Francisco, CA",
                }
                },
                "required": ["location"],
            },
        },
    }
]

stream = litellm.completion(
    model="gpt-4",
    messages=messages,
    tools=tools,
    stream=True
)

chunks=[]
for chunk in stream:
    print(chunk.choices[0].delta)
    chunks.append(chunk)
    
response = litellm.stream_chunk_builder(chunks, messages=messages)
    
print("\n-----\n")
print("stream_chunk_builder result:\n", response)

Relevant log output

Delta(content='The', role='assistant', function_call=None, tool_calls=None)
Delta(content=' word', role=None, function_call=None, tool_calls=None)
Delta(content=' that', role=None, function_call=None, tool_calls=None)
Delta(content=' starts', role=None, function_call=None, tool_calls=None)
Delta(content=' with', role=None, function_call=None, tool_calls=None)
Delta(content=' "', role=None, function_call=None, tool_calls=None)
Delta(content='aar', role=None, function_call=None, tool_calls=None)
Delta(content='"', role=None, function_call=None, tool_calls=None)
Delta(content=' is', role=None, function_call=None, tool_calls=None)
Delta(content=' "', role=None, function_call=None, tool_calls=None)
Delta(content='a', role=None, function_call=None, tool_calls=None)
Delta(content='ard', role=None, function_call=None, tool_calls=None)
Delta(content='v', role=None, function_call=None, tool_calls=None)
Delta(content='ark', role=None, function_call=None, tool_calls=None)
Delta(content='".\n\n', role=None, function_call=None, tool_calls=None)
Delta(content='Let', role=None, function_call=None, tool_calls=None)
Delta(content=' me', role=None, function_call=None, tool_calls=None)
Delta(content=' check', role=None, function_call=None, tool_calls=None)
Delta(content=' the', role=None, function_call=None, tool_calls=None)
Delta(content=' weather', role=None, function_call=None, tool_calls=None)
Delta(content=' in', role=None, function_call=None, tool_calls=None)
Delta(content=' San', role=None, function_call=None, tool_calls=None)
Delta(content=' Francisco', role=None, function_call=None, tool_calls=None)
Delta(content=' for', role=None, function_call=None, tool_calls=None)
Delta(content=' you', role=None, function_call=None, tool_calls=None)
Delta(content='.', role=None, function_call=None, tool_calls=None)
Delta(content=None, role=None, function_call=None, tool_calls=[ChatCompletionDeltaToolCall(id='call_kT6n9k2oyBaNtko9oqluSjvY', function=Function(arguments='', name='get_current_weather'), type='function', index=0)])
Delta(content=None, role=None, function_call=None, tool_calls=[ChatCompletionDeltaToolCall(id=None, function=Function(arguments='{\n', name=None), type=None, index=0)])
Delta(content=None, role=None, function_call=None, tool_calls=[ChatCompletionDeltaToolCall(id=None, function=Function(arguments=' ', name=None), type=None, index=0)])
Delta(content=None, role=None, function_call=None, tool_calls=[ChatCompletionDeltaToolCall(id=None, function=Function(arguments=' "', name=None), type=None, index=0)])
Delta(content=None, role=None, function_call=None, tool_calls=[ChatCompletionDeltaToolCall(id=None, function=Function(arguments='location', name=None), type=None, index=0)])
Delta(content=None, role=None, function_call=None, tool_calls=[ChatCompletionDeltaToolCall(id=None, function=Function(arguments='":', name=None), type=None, index=0)])
Delta(content=None, role=None, function_call=None, tool_calls=[ChatCompletionDeltaToolCall(id=None, function=Function(arguments=' "', name=None), type=None, index=0)])
Delta(content=None, role=None, function_call=None, tool_calls=[ChatCompletionDeltaToolCall(id=None, function=Function(arguments='San', name=None), type=None, index=0)])
Delta(content=None, role=None, function_call=None, tool_calls=[ChatCompletionDeltaToolCall(id=None, function=Function(arguments=' Francisco', name=None), type=None, index=0)])
Delta(content=None, role=None, function_call=None, tool_calls=[ChatCompletionDeltaToolCall(id=None, function=Function(arguments=',', name=None), type=None, index=0)])
Delta(content=None, role=None, function_call=None, tool_calls=[ChatCompletionDeltaToolCall(id=None, function=Function(arguments=' CA', name=None), type=None, index=0)])
Delta(content=None, role=None, function_call=None, tool_calls=[ChatCompletionDeltaToolCall(id=None, function=Function(arguments='"\n', name=None), type=None, index=0)])
Delta(content=None, role=None, function_call=None, tool_calls=[ChatCompletionDeltaToolCall(id=None, function=Function(arguments='}', name=None), type=None, index=0)])
Delta(content=None, role=None, function_call=None, tool_calls=None)

-----

stream_chunk_builder result:
 ModelResponse(id='chatcmpl-97QlEIdc7JfojkM4G6uwSc4GZT7Y8', choices=[Choices(finish_reason='tool_calls', index=0, message=Message(content='The word that starts with "aar" is "aardvark".\n\nLet me check the weather in San Francisco for you.', role='assistant'))], created=1711558192, model='gpt-4', object='chat.completion', system_fingerprint=None, usage=Usage(completion_tokens=26, prompt_tokens=26, total_tokens=52))

Twitter / LinkedIn details

No response

The text was updated successfully, but these errors were encountered:

krrishdholakia · 2024-03-27T20:20:02Z

Hey @stephenfreund can you share what the expected response here should be?

I can add fix accordingly and add it to our ci/cd!

Thanks for this issue

stephenfreund · 2024-03-28T14:58:33Z

Hi,

Absolutely. Here's an example of what I would expect.

When I create a completion without streaming

response = litellm.completion(
    model="gpt-4",
    messages=messages,
    tools=tools,
)

I get responses like this:

ModelResponse(
    id="chatcmpl-97lEc2asXEk3w7yHiVH6nUqqIx7FD",
    choices=[
        Choices(
            finish_reason="tool_calls",
            index=0,
            message=Message(
                content='The word "Aardvark" starts with "aar".\n\nLet me check the weather in San Francisco for you.',
                role="assistant",
                tool_calls=[
                    ChatCompletionMessageToolCall(
                        function=Function(
                            arguments='{\n  "location": "San Francisco, CA"\n}',
                            name="get_current_weather",
                        ),
                        id="call_SMaGOY7OUR47n4phOeAyjHhB",
                        type="function",
                    )
                ],
            ),
        )
    ],
    created=1711636894,
    model="gpt-4-0613",
    object="chat.completion",
    system_fingerprint=None,
    usage=Usage(completion_tokens=44, prompt_tokens=82, total_tokens=126),
)

where the response Message has both the content and tool calls. I would expect the response built with the stream_chunk_builder to have this same form when a model responds with content and a tool call in the streamed version.

Happy to provide any other details that would be useful. Thanks!

Steve.

aantn · 2024-11-10T15:05:30Z

Does this still occur?

github-actions · 2025-02-09T00:01:57Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs.

krrishdholakia · 2025-02-09T03:24:41Z

not stale - needs to be investigated still. Adding to feb 2025 roadmap.

krrishdholakia · 2025-02-11T06:35:11Z

cc: @vibhavbhat

stephenfreund added the bug Something isn't working label Mar 27, 2024

krrishdholakia mentioned this issue Apr 15, 2024

[14/04/2024 - 19/05/2024] 🐛 💪 Bug Bash \ New Features \ Providers #3045

Closed

66 tasks

github-actions bot added the stale label Feb 9, 2025

github-actions bot removed the stale label Feb 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: stream_chunk_builder fails to include tool_calls preceded by content #2716

[Bug]: stream_chunk_builder fails to include tool_calls preceded by content #2716

stephenfreund commented Mar 27, 2024

krrishdholakia commented Mar 27, 2024

stephenfreund commented Mar 28, 2024

aantn commented Nov 10, 2024

github-actions bot commented Feb 9, 2025

krrishdholakia commented Feb 9, 2025

krrishdholakia commented Feb 11, 2025

[Bug]: stream_chunk_builder fails to include tool_calls preceded by content #2716

[Bug]: stream_chunk_builder fails to include tool_calls preceded by content #2716

Comments

stephenfreund commented Mar 27, 2024

What happened?

Relevant log output

Twitter / LinkedIn details

krrishdholakia commented Mar 27, 2024

stephenfreund commented Mar 28, 2024

aantn commented Nov 10, 2024

github-actions bot commented Feb 9, 2025

krrishdholakia commented Feb 9, 2025

krrishdholakia commented Feb 11, 2025