Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Docker compose vllm #1821

Merged
merged 2 commits into from
Oct 8, 2024
Merged

Conversation

hitpoint6
Copy link
Contributor

Please describe the purpose of this pull request.
Is it to add a new feature? Is it to fix a bug?
Enable running Letta with vllm powered model with one command:
export LETTA_LLM_MODEL=teknium/OpenHermes-2-Mistral-7B; docker compose -f docker-compose-vllm.yaml up
Replace teknium/OpenHermes-2-Mistral-7B with your own model id

How to test
How can we test your PR during review? What commands should we run? What outcomes should we expect?

  1. export LETTA_LLM_MODEL=teknium/OpenHermes-2-Mistral-7B; docker compose -f docker-compose-vllm.yaml up
  2. Open http://localhost:8083/
  3. Create an agent and chat with the agent. Note that the model is 7B so it is much less powerful than gpt-4o

Have you tested this PR?
Have you tested the latest commit on the PR? If so please provide outputs from your tests.

Yes, Able to create an agent, update my name and ask the agent about my name.

Related issues or PRs
Please link any related GitHub issues or PRs.

Is your PR over 500 lines of code?
If so, please break up your PR into multiple smaller PRs so that we can review them quickly, or provide justification for its length.

Additional context
Add any other context or screenshots about the PR here.

@sarahwooders sarahwooders self-requested a review October 8, 2024 00:20
@sarahwooders sarahwooders merged commit 7a0c97a into letta-ai:main Oct 8, 2024
2 of 7 checks passed
mattzh72 pushed a commit that referenced this pull request Oct 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants