fix: Add support for output messages for sync/async #1188

nate-mar · 2025-01-13T07:57:21Z

resolves #1060

…async

mikeldking · 2025-01-13T17:39:26Z

Possibly also resolves #1189

nate-mar · 2025-01-13T18:29:40Z

The stream tests I added work in that the stream parameter works with the litellm client and the updated handling. However, the result provided by the litellm client when using a mock_response is not actually representative of how streaming will work in a normal setting unfortunately.

in actuality, the streamed chunks won't be populated automatically for the response. so I will need to remove the handling I added, as it only works for litellm client mocked responses. I will add a separate PR for streaming and will perhaps drop a note to the lite-lm maintainers to add some clarity around the client when using mock_response in the case of streaming.

mikeldking · 2025-01-13T23:49:18Z

pinging @anticorrelator here to take a look

cc @nate-mar

nate-mar · 2025-01-14T00:15:03Z

Github pushes are down atm so I can't push my latest changes. FYI

nate-mar · 2025-01-14T01:43:27Z

python/instrumentation/openinference-instrumentation-litellm/pyproject.toml

@@ -31,6 +31,7 @@ dependencies = [
  "openinference-instrumentation>=0.1.17",
  "openinference-semantic-conventions>=0.1.9",
  "wrapt",
+  "setuptools",


fixes build error

nate-mar · 2025-01-14T01:44:10Z

python/instrumentation/openinference-instrumentation-litellm/pyproject.toml

@@ -39,6 +40,7 @@ test = [
  "opentelemetry-sdk",
  "opentelemetry-instrumentation-httpx",
  "tenacity",
+  "tokenizers==0.20.3; python_version == '3.8'"


fixes build error related to bad version of tokenizers that needed to be yanked. for now, force an earlier version pre-the bad version; related to huggingface/tokenizers#1691

nate-mar · 2025-01-14T01:52:21Z

Possibly also resolves #1189

Yep, should resolve this too.

anticorrelator · 2025-01-14T05:36:09Z

hi @nate-mar if we want to handle streaming it looks like we need to add another handler for the CustomStreamWrapper output type: https://github.com/BerriAI/litellm/blob/00f50bc2018eb45dd300e6d7d5ad8b3ab0ff5410/litellm/litellm_core_utils/streaming_handler.py#L59.

All of their streaming logic is there and we can hook into what they're already doing to instrument the stream (such as compiling the final streamed message and attaching it to our span).

Currently we don't handle this streaming return type and only handle ModelResponse: https://github.com/Arize-ai/openinference/blob/main/python/instrumentation/openinference-instrumentation-litellm/src/openinference/instrumentation/litellm/__init__.py#L155

nate-mar · 2025-01-14T06:48:45Z

Sounds good @anticorrelator !

For now, if you wouldn't mind giving me a review on this one so we can get it across the line first, that'd be great. Thanks!

anticorrelator

lgtm!

Add support for output messages for streaming/non-streaming and sync/…

ecd9791

…async

nate-mar requested a review from a team as a code owner January 13, 2025 07:57

dosubot bot added the size:M This PR changes 30-99 lines, ignoring generated files. label Jan 13, 2025

lint

e3aada2

nate-mar added 5 commits January 13, 2025 13:05

removed streaming

dc4e9c2

remove prints

e00229b

lint

754b080

fix ci errors

0d9ba2f

Update __init__.py

9d5148f

nate-mar added 2 commits January 13, 2025 15:50

Update test_instrumentor.py

1f52f55

fix typing

7d9d58b

nate-mar changed the title ~~fix: Add support for output messages for streaming/non-streaming and sync/async~~ fix: Add support for output messages for sync/async Jan 14, 2025

nate-mar added 7 commits January 13, 2025 16:25

lint

5aef266

remove additional stream reference

e2b6f63

avoid bad tokenizer lib release that was yanked

87f82ca

Update pyproject.toml

5187c2e

Update pyproject.toml

f07d53d

Update tox.ini

7c0917c

Update tox.ini

a2b615e

nate-mar commented Jan 14, 2025

View reviewed changes

anticorrelator approved these changes Jan 15, 2025

View reviewed changes

restore version

76342aa

nate-mar merged commit 0bb96b6 into main Jan 15, 2025
3 checks passed

nate-mar deleted the add-output-messages branch January 15, 2025 23:11

nate-mar mentioned this pull request Jan 15, 2025

[python] support liteLLM streaming #1189

Closed

github-actions bot mentioned this pull request Jan 15, 2025

chore(main): release python-openinference-instrumentation-litellm 0.1.6 #1204

Merged

nate-mar mentioned this pull request Jan 16, 2025

[bug] litellm instrumentation canary failing #1118

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Add support for output messages for sync/async #1188

fix: Add support for output messages for sync/async #1188

nate-mar commented Jan 13, 2025 •

edited

Loading

mikeldking commented Jan 13, 2025

nate-mar commented Jan 13, 2025

mikeldking commented Jan 13, 2025

nate-mar commented Jan 14, 2025

nate-mar Jan 14, 2025

nate-mar Jan 14, 2025 •

edited

Loading

nate-mar commented Jan 14, 2025

anticorrelator commented Jan 14, 2025

nate-mar commented Jan 14, 2025

anticorrelator left a comment

fix: Add support for output messages for sync/async #1188

fix: Add support for output messages for sync/async #1188

Conversation

nate-mar commented Jan 13, 2025 • edited Loading

mikeldking commented Jan 13, 2025

nate-mar commented Jan 13, 2025

mikeldking commented Jan 13, 2025

nate-mar commented Jan 14, 2025

nate-mar Jan 14, 2025

Choose a reason for hiding this comment

nate-mar Jan 14, 2025 • edited Loading

Choose a reason for hiding this comment

nate-mar commented Jan 14, 2025

anticorrelator commented Jan 14, 2025

nate-mar commented Jan 14, 2025

anticorrelator left a comment

Choose a reason for hiding this comment

nate-mar commented Jan 13, 2025 •

edited

Loading

nate-mar Jan 14, 2025 •

edited

Loading