Streaming doesn't work #1

laszlovandenhoek · 2024-09-23T15:41:03Z

Even Azure models that support streaming won't do it; the entire response is always returned in one chunk.

I have found no way to enable streaming using configuration, and from the code it doesn't seem possible. The problem appears to be with the can_stream property of the llm.Model class. Even if you define it using a config.yaml, it is ignored by the llm-azure plugin. AzureChat extends the OpenAI Chat, which in turn extends llm.Model. In Chat, can_stream is True by default, but this doesn't take effect because AzureChat doesn't call super().__init__(), so it becomes effectively False for all Azure models.

I propose to check config.yaml for a can_stream key, use it if present, and assume True otherwise. I will submit a PR shortly.

The text was updated successfully, but these errors were encountered:

fabge · 2024-11-03T19:16:55Z

thank you for the heads up, it is fixed with the newest release :)

laszlovandenhoek added a commit to laszlovandenhoek/llm-azure that referenced this issue Sep 23, 2024

Fix fabge#1: allow streaming with can_stream

7179691

laszlovandenhoek mentioned this issue Sep 23, 2024

Fix #1: allow streaming with can_stream #2

Closed

fabge closed this as completed Nov 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Streaming doesn't work #1

Streaming doesn't work #1

laszlovandenhoek commented Sep 23, 2024

fabge commented Nov 3, 2024

Streaming doesn't work #1

Streaming doesn't work #1

Comments

laszlovandenhoek commented Sep 23, 2024

fabge commented Nov 3, 2024