Added support for streaming tool calls #1184

davorrunje · 2024-01-09T13:31:29Z

This is closing an issue coming from a discussion in PR #1118:

choice.delta.function_call is deprecated and we should use choice.delta.tool_calls instead (https://platform.openai.com/docs/api-reference/chat/streaming). Since tests are passing, we are probably not triggering tool_calls but a deprecated function_call

Other than that, some type hints were fixed so that two additional files pass mypy type checks. Mypy configuration is not part of this PR and no type-checking is added here.

Why are these changes needed?

Related issue number

Closes #1178

Checks

I've included any doc changes needed for https://microsoft.github.io/autogen/. See https://microsoft.github.io/autogen/docs/Contribute#documentation to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

codecov-commenter · 2024-01-09T13:32:41Z

Codecov Report

Attention: 68 lines in your changes are missing coverage. Please review.

Comparison is base (b548e55) 30.38% compared to head (1b99532) 40.47%.

Files	Patch %	Lines
autogen/oai/client.py	35.23%	67 Missing and 1 partial ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##             main    #1184       +/-   ##
===========================================
+ Coverage   30.38%   40.47%   +10.09%     
===========================================
  Files          32       32               
  Lines        4302     4353       +51     
  Branches      994     1065       +71     
===========================================
+ Hits         1307     1762      +455     
+ Misses       2901     2464      -437     
- Partials       94      127       +33

Flag	Coverage Δ
unittests	`40.40% <36.44%> (+10.07%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

tyler-suard-parker · 2024-01-09T20:41:33Z

@davorrunje It is streaming now, thank you!

tyler-suard-parker · 2024-01-09T20:44:07Z

Ok this is streaming output to the terminal, which is great. However, I have AutoGen behind an API, and I want to stream the output back to the requestor. How would I do that using your repo?

davorrunje · 2024-01-09T22:28:05Z

Ok this is streaming output to the terminal, which is great. However, I have AutoGen behind an API, and I want to stream the output back to the requestor. How would I do that using your repo?

We could easily add a callback to be called, would that work for you?

bitnom · 2024-01-09T22:45:11Z

Ok this is streaming output to the terminal, which is great. However, I have AutoGen behind an API, and I want to stream the output back to the requestor. How would I do that using your repo?

We could easily add a callback to be called, would that work for you?

I've been doing a lot of work in this area but need to merge it with the latest main and this PR. Some important things to consider about callbacks:

When a callback is called, it should be called with all possible data relating to the request/response, preferably as all keyword arguments.
The callback function should be called for every chunk and every completed message regardless of whether stream is enabled.
Callbacks can be both sync and async.
User can specify a request_id: UUID when sending messages via user proxy, which will then be passed to the callback function for subsequent responses of agents.
Each agent should have a unique agent_id: UUID which is passed to the callback function.

At least that's how I spec it. My implementation is pretty rough at this point. I've got it working except that my request_id doesn't change when I supply a new one yet for some reason. I'm working on it.

…or-streamed-function-calls

davorrunje · 2024-01-10T08:34:36Z

@davorrunje @bitnom As it is right now, I instantiate a User Proxy Agent and an Assistant Agent, then do self.user_proxy.initiate_chat. The conversation goes well, and the final answer streams to the terminal, but I need to send the stream back to a requestor. You mentioned a callback, that could be an option, or maybe make a new method, initiate_streaming_chat, which returns a generator immediately, and that generator can be used to get chunks as the conversation continues.

@tyler-suard-parker Can you please open an issue for that and we'll take it from there. I like the proposal, but it is outside of the scope of this PR. This is a cleanup on an already merged PR.

…or-streamed-function-calls

bitnom · 2024-01-10T18:44:21Z

@davorrunje @bitnom As it is right now, I instantiate a User Proxy Agent and an Assistant Agent, then do self.user_proxy.initiate_chat. The conversation goes well, and the final answer streams to the terminal, but I need to send the stream back to a requestor. You mentioned a callback, that could be an option, or maybe make a new method, initiate_streaming_chat, which returns a generator immediately, and that generator can be used to get chunks as the conversation continues.

You'll be able to use a generator if you like. An arbitrary dict can be specified alongside the callback, which is always passed back to it along with the openai responses. The issue I was having with it is resolved. Now to payback a bit of technical debt.

* added support for streaming tool calls * bug fix: removed tmp assert --------- Co-authored-by: Chi Wang <wang.chi@microsoft.com>

added support for streaming tool calls

609bb10

davorrunje had a problem deploying to openai1 January 9, 2024 13:31 — with GitHub Actions Failure

davorrunje marked this pull request as draft January 9, 2024 13:31

davorrunje had a problem deploying to openai1 January 9, 2024 13:31 — with GitHub Actions Failure

davorrunje linked an issue Jan 9, 2024 that may be closed by this pull request

[Feature Request]: add support for tools for streamed function calls #1178

Closed

bug fix: removed tmp assert

fc8b56f

davorrunje had a problem deploying to openai1 January 9, 2024 14:47 — with GitHub Actions Failure

davorrunje marked this pull request as ready for review January 9, 2024 15:06

davorrunje mentioned this pull request Jan 9, 2024

Handle streamed function calls #1118

Merged

3 tasks

davorrunje mentioned this pull request Jan 9, 2024

cleaner definition of tool_responses fixes #1174 #1182

Merged

3 tasks

sonichi requested review from Alvaromah and BeibinLi January 9, 2024 23:20

Merge branch 'main' into 1178-feature-request-add-support-for-tools-f…

6855c62

…or-streamed-function-calls

sonichi temporarily deployed to openai1 January 10, 2024 01:54 — with GitHub Actions Inactive

sonichi requested a review from ragyabraham January 10, 2024 03:24

sonichi enabled auto-merge January 10, 2024 05:29

Merge branch 'main' into 1178-feature-request-add-support-for-tools-f…

1b99532

…or-streamed-function-calls

davorrunje had a problem deploying to openai1 January 10, 2024 12:00 — with GitHub Actions Failure

bitnom mentioned this pull request Jan 10, 2024

I set streaming to True, but Autogen is still not streaming #1166

Closed

sonichi approved these changes Jan 11, 2024

View reviewed changes

sonichi added this pull request to the merge queue Jan 11, 2024

Merged via the queue into main with commit 56aed2d Jan 11, 2024
79 of 87 checks passed

sonichi deleted the 1178-feature-request-add-support-for-tools-for-streamed-function-calls branch January 11, 2024 06:54

joshkyh pushed a commit that referenced this pull request Jan 17, 2024

Added support for streaming tool calls (#1184)

84d39b7

* added support for streaming tool calls * bug fix: removed tmp assert --------- Co-authored-by: Chi Wang <wang.chi@microsoft.com>

whiskyboy pushed a commit to whiskyboy/autogen that referenced this pull request Apr 17, 2024

Added support for streaming tool calls (microsoft#1184)

622a291

* added support for streaming tool calls * bug fix: removed tmp assert --------- Co-authored-by: Chi Wang <wang.chi@microsoft.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added support for streaming tool calls #1184

Added support for streaming tool calls #1184

davorrunje commented Jan 9, 2024 •

edited

Loading

codecov-commenter commented Jan 9, 2024 •

edited

Loading

tyler-suard-parker commented Jan 9, 2024

tyler-suard-parker commented Jan 9, 2024 •

edited

Loading

davorrunje commented Jan 9, 2024

bitnom commented Jan 9, 2024 •

edited

Loading

davorrunje commented Jan 10, 2024

bitnom commented Jan 10, 2024

Added support for streaming tool calls #1184

Added support for streaming tool calls #1184

Conversation

davorrunje commented Jan 9, 2024 • edited Loading

Why are these changes needed?

Related issue number

Checks

codecov-commenter commented Jan 9, 2024 • edited Loading

Codecov Report

tyler-suard-parker commented Jan 9, 2024

tyler-suard-parker commented Jan 9, 2024 • edited Loading

davorrunje commented Jan 9, 2024

bitnom commented Jan 9, 2024 • edited Loading

davorrunje commented Jan 10, 2024

bitnom commented Jan 10, 2024

davorrunje commented Jan 9, 2024 •

edited

Loading

codecov-commenter commented Jan 9, 2024 •

edited

Loading

tyler-suard-parker commented Jan 9, 2024 •

edited

Loading

bitnom commented Jan 9, 2024 •

edited

Loading