Make LlamaStackLibraryClient work correctly #581

ashwinb · 2024-12-07T19:54:16Z

This PR does a few things:

it moves "direct client" to llama-stack repo instead of being in the llama-stack-client-python repo
renames it to LlamaStackLibraryClient
actually makes synchronous generators work
makes streaming and non-streaming work properly

In many ways, this PR makes things finally "work"

Test Plan

See a library_client_test.py I added. This isn't really quite a test yet but it demonstrates that this mode now works. Here's the invocation and the response:

INFERENCE_MODEL=meta-llama/Llama-3.2-3B-Instruct python llama_stack/distribution/tests/library_client_test.py ollama

ashwinb · 2024-12-07T19:58:39Z

llama_stack/distribution/library_client.py

+T = TypeVar("T")
+
+
+def stream_across_asyncio_run_boundary(


this is a most crucial part of this PR. without this you cannot make the "non-async" generators (which is a necessary mode for our client-sdk) work properly. you must be able to do for chunk in inference.chat_completion() since that's what synchronous generators are about. to bridge a sync generator to the async generators we have in our server-side code we need to intermediate via a thread pool.

yanxi0830 · 2024-12-09T17:33:08Z

llama_stack/distribution/tests/library_client_test.py

+
+
+def main(config_path: str):
+    client = LlamaStackAsLibraryClient(config_path)


https://llama-stack.readthedocs.io/en/latest/distributions/importing_as_library.html

does this reference to LlamaStackDirectClient also needs to be updated?

ashwinb added 3 commits December 7, 2024 08:41

Moved llama-stack-as-library client to llama-stack

fd48cf3

make direct client streaming work properly

86b5743

Make sure Agents work with direct client

1fba8f8

ashwinb requested review from yanxi0830, hardikjshah, dltn and raghotham as code owners December 7, 2024 19:54

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Dec 7, 2024

ashwinb commented Dec 7, 2024

View reviewed changes

raghotham approved these changes Dec 7, 2024

View reviewed changes

support GETs with streaming also

2837dd6

ashwinb merged commit 14f973a into main Dec 7, 2024
2 checks passed

ashwinb deleted the direct_refactor branch December 7, 2024 22:59

yanxi0830 reviewed Dec 9, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make LlamaStackLibraryClient work correctly #581

Make LlamaStackLibraryClient work correctly #581

ashwinb commented Dec 7, 2024 •

edited

Loading

ashwinb Dec 7, 2024

yanxi0830 Dec 9, 2024



		def main(config_path: str):
		client = LlamaStackAsLibraryClient(config_path)

Make LlamaStackLibraryClient work correctly #581

Make LlamaStackLibraryClient work correctly #581

Conversation

ashwinb commented Dec 7, 2024 • edited Loading

Test Plan

ashwinb Dec 7, 2024

Choose a reason for hiding this comment

yanxi0830 Dec 9, 2024

Choose a reason for hiding this comment

ashwinb commented Dec 7, 2024 •

edited

Loading