ollama[patch]: permit streaming for tool calls #28654

ccurme · 2024-12-10T17:52:37Z

Resolves #28543

Ollama recently released support for streaming tool calls. Previously we would override the stream parameter if tools were passed in.

Covered in standard tests here:

langchain/libs/standard-tests/langchain_tests/integration_tests/chat_models.py

Lines 893 to 897 in c1d348e

    
           full: Optional[BaseMessageChunk] = None 
        
           for chunk in model_with_tools.stream(query): 
        
               full = chunk if full is None else full + chunk  # type: ignore 
        
           assert isinstance(full, AIMessage) 
        
           _validate_tool_call_message(full)

Before, the test generates one message chunk:

[
    AIMessageChunk(
        content='',
        additional_kwargs={},
        response_metadata={
            'model': 'llama3.1',
            'created_at': '2024-12-10T17:49:04.468487Z',
            'done': True,
            'done_reason': 'stop',
            'total_duration': 525471208,
            'load_duration': 19701000,
            'prompt_eval_count': 170,
            'prompt_eval_duration': 31000000,
            'eval_count': 17,
            'eval_duration': 473000000,
            'message': Message(
                role='assistant',
                content='',
                images=None,
                tool_calls=[
                    ToolCall(
                        function=Function(name='magic_function', arguments={'input': 3})
                    )
                ]
            )
        },
        id='run-552bbe0f-8fb2-4105-ada1-fa38c1db444d',
        tool_calls=[
            {
                'name': 'magic_function',
                'args': {'input': 3},
                'id': 'b0a4dc07-7d7a-487b-bd7b-ad062c2363a2',
                'type': 'tool_call',
            },
        ],
        usage_metadata={
            'input_tokens': 170, 'output_tokens': 17, 'total_tokens': 187
        },
        tool_call_chunks=[
            {
                'name': 'magic_function',
                'args': '{"input": 3}',
                'id': 'b0a4dc07-7d7a-487b-bd7b-ad062c2363a2',
                'index': None,
                'type': 'tool_call_chunk',
            }
        ]
    )
]

After, it generates two (tool call in one, response metadata in another):

[
    AIMessageChunk(
        content='',
        additional_kwargs={},
        response_metadata={},
        id='run-9a3f0860-baa1-4bae-9562-13a61702de70',
        tool_calls=[
            {
                'name': 'magic_function',
                'args': {'input': 3},
                'id': '5bbaee2d-c335-4709-8d67-0783c74bd2e0',
                'type': 'tool_call',
            },
        ],
        tool_call_chunks=[
            {
                'name': 'magic_function',
                'args': '{"input": 3}',
                'id': '5bbaee2d-c335-4709-8d67-0783c74bd2e0',
                'index': None,
                'type': 'tool_call_chunk',
            },
        ],
    ),
    AIMessageChunk(
        content='',
        additional_kwargs={},
        response_metadata={
            'model': 'llama3.1',
            'created_at': '2024-12-10T17:46:43.278436Z',
            'done': True,
            'done_reason': 'stop',
            'total_duration': 514282750,
            'load_duration': 16894458,
            'prompt_eval_count': 170,
            'prompt_eval_duration': 31000000,
            'eval_count': 17,
            'eval_duration': 464000000,
            'message': Message(
                role='assistant', content='', images=None, tool_calls=None
            ),
        },
        id='run-9a3f0860-baa1-4bae-9562-13a61702de70',
        usage_metadata={
            'input_tokens': 170, 'output_tokens': 17, 'total_tokens': 187
        }
    ),
]

vercel · 2024-12-10T17:52:41Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Skipped Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)			Dec 10, 2024 5:52pm

permit streaming tool calls

aecaee9

dosubot bot added the size:XS This PR changes 0-9 lines, ignoring generated files. label Dec 10, 2024

ccurme mentioned this pull request Dec 10, 2024

ollama: support streaming tool calls #28543

Closed

1 task

ccurme merged commit bc4dc7f into master Dec 10, 2024
20 checks passed

ccurme deleted the cc/ollama branch December 10, 2024 17:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ollama[patch]: permit streaming for tool calls #28654

ollama[patch]: permit streaming for tool calls #28654

ccurme commented Dec 10, 2024 •

edited

Loading

vercel bot commented Dec 10, 2024

	full: Optional[BaseMessageChunk] = None
	for chunk in model_with_tools.stream(query):
	full = chunk if full is None else full + chunk # type: ignore
	assert isinstance(full, AIMessage)
	_validate_tool_call_message(full)

ollama[patch]: permit streaming for tool calls #28654

ollama[patch]: permit streaming for tool calls #28654

Conversation

ccurme commented Dec 10, 2024 • edited Loading

vercel bot commented Dec 10, 2024

ccurme commented Dec 10, 2024 •

edited

Loading