[CLI] simplify docker run #159

yanxi0830 · 2024-09-30T15:52:08Z

Changes

Motivation: We should not need to install llama CLI & run llama stack configure / llama stack run outside of docker containers. Downloading docker image should be sufficient to start Llama Stack server.
Clean up output messages from CLI.
[RFC] New developer flow for interaction with docker image.

Developer Flow

Download docker image from docker hub.

docker image pull llamastack/llamastack-local-gpu

[New] Run w/ built in default config

docker run -it -p 5000:5000 -v ~/.llama:/root/.llama --gpus=all llamastack-local-gpu

(Advanced Option) Run with custom config docker

docker run -it \
-p 5000:5000 \
-v path/to/run.yaml:/app/run.yaml \
-v ~/.llama:/root/.llama \
--gpus=all \
llamastack-d1 \
/app/run.yaml
--port 5000

where path/to/run.yaml is absolute path to config outside container. /app/run.yaml is the mounted path to config inside container.

3.5 (Easier configuration) add example build.yaml / run.yaml configs.

Old llama configure/run flow outside docker container still works.

$ llama stack build
> Enter a name for your Llama Stack (e.g. my-local-stack): d1
...

$ llama stack configure llamastack-d1

$ llama stack run d1

Distribution Owner: Building Docker

$ llama stack build

> Enter a name for your Llama Stack (e.g. my-local-stack): d7
> Enter the image type you want your Llama Stack to be built as (docker or conda): docker

 Llama Stack is composed of several APIs working together. Let's configure the providers (implementations) you want to use for these APIs.
> Enter provider for the inference API: (default=meta-reference): meta-reference
> Enter provider for the safety API: (default=meta-reference): meta-reference
> Enter provider for the agents API: (default=meta-reference): meta-reference
> Enter provider for the memory API: (default=meta-reference): meta-reference
> Enter provider for the telemetry API: (default=meta-reference): meta-reference
 
 > (Optional) Enter a short description for your Llama Stack:
Build spec configuration saved at /data/users/xiyan/llama-stack/tmp/configs/d7-build.yaml
Configuring API `inference`...
=== Configuring provider `meta-reference` for API inference...
Enter value for model (default: Llama3.1-8B-Instruct) (required): 
Do you want to configure quantization? (y/n): n
Enter value for torch_seed (optional): 
Enter value for max_seq_len (default: 4096) (required): 
Enter value for max_batch_size (default: 1) (required): 

Configuring API `safety`...
=== Configuring provider `meta-reference` for API safety...
Do you want to configure llama_guard_shield? (y/n): n
Do you want to configure prompt_guard_shield? (y/n): n

Configuring API `agents`...
=== Configuring provider `meta-reference` for API agents...
Enter `type` for persistence_store (options: redis, sqlite, postgres) (default: sqlite): 

Configuring SqliteKVStoreConfig:
Enter value for namespace (optional): 
Enter value for db_path (default: /home/xiyan/.llama/runtime/kvstore.db) (required): 

Configuring API `memory`...
=== Configuring provider `meta-reference` for API memory...
> Please enter the supported memory bank type your provider has for memory: vect
or

Configuring API `telemetry`...
=== Configuring provider `meta-reference` for API telemetry...

> YAML configuration has been written to `/data/users/xiyan/llama-stack/tmp/configs/d7-run.yaml`.
Dockerfile created successfully in /tmp/tmp.4Mfy6zpfb2/DockerfileFROM python:3.10-slim
WORKDIR /app
...

...
Success! You can run it with: podman run -p 8000:8000 llamastack-d7

llama_stack/distribution/build_container.sh

llama_stack/distribution/configure_container.sh

llama_stack/distribution/server/server.py

llama_stack/distribution/templates/docker/llamastack-local-cpu/build.yaml

llama_stack/cli/stack/build.py

llama_stack/cli/stack/configure.py

ashwinb

lgtm

llama_stack/distribution/build.py

llama_stack/distribution/configure_container.sh

bake run.yaml inside docker, simplify run

6cd3e41

yanxi0830 mentioned this pull request Sep 30, 2024

[CLI] bake run.yaml file inside docker container #122

Closed

add docker template examples

3482adb

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Sep 30, 2024

yanxi0830 marked this pull request as ready for review September 30, 2024 16:07

yanxi0830 requested review from ashwinb, hardikjshah, dltn and raghotham as code owners September 30, 2024 16:07

yanxi0830 added 4 commits September 30, 2024 10:47

delete generated Dockerfile

8731cc3

unique deps

827525c

clean up debug

340e134

default entrypoint

fd04ad9

ashwinb reviewed Sep 30, 2024

View reviewed changes

russellb reviewed Sep 30, 2024

View reviewed changes

llama_stack/cli/stack/build.py Outdated Show resolved Hide resolved

yanxi0830 added 3 commits September 30, 2024 13:49

address comments, update output msg

0f10de0

update msg

e026538

build output msg

8c2ba15

russellb reviewed Sep 30, 2024

View reviewed changes

llama_stack/cli/stack/configure.py Outdated Show resolved Hide resolved

configure msg

16ad2de

russellb reviewed Sep 30, 2024

View reviewed changes

llama_stack/cli/stack/configure.py Show resolved Hide resolved

ashwinb approved these changes Sep 30, 2024

View reviewed changes

llama_stack/distribution/build.py Show resolved Hide resolved

llama_stack/distribution/configure_container.sh Outdated Show resolved Hide resolved

yanxi0830 added 2 commits September 30, 2024 14:59

unique special_deps

e9de2de

remove quotes in configure

cae5369

yanxi0830 merged commit d28c3df into main Sep 30, 2024
3 checks passed

yanxi0830 deleted the simplify_docker_2 branch September 30, 2024 22:30

This was referenced Oct 2, 2024

docker: workflow now requires running configure twice #169

Closed

How do you use the docker distribution? #106

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CLI] simplify docker run #159

[CLI] simplify docker run #159

yanxi0830 commented Sep 30, 2024 •

edited

Loading

ashwinb left a comment

[CLI] simplify docker run #159

[CLI] simplify docker run #159

Conversation

yanxi0830 commented Sep 30, 2024 • edited Loading

Changes

Developer Flow

Distribution Owner: Building Docker

ashwinb left a comment

Choose a reason for hiding this comment

yanxi0830 commented Sep 30, 2024 •

edited

Loading