docs: update building distro to use container commands over llama stack run

Bobbins228 · Bobbins228 · commit f2309d032b32 · 2025-05-19T14:48:11.000+01:00
diff --git a/docs/source/distributions/building_distro.md b/docs/source/distributions/building_distro.md
@@ -260,7 +260,33 @@ Containerfile created successfully in /tmp/tmp.viA3a3Rdsg/ContainerfileFROM pyth
 You can now edit ~/meta-llama/llama-stack/tmp/configs/ollama-run.yaml and run `llama stack run ~/meta-llama/llama-stack/tmp/configs/ollama-run.yaml`
 ```
 
-After this step is successful, you should be able to find the built container image and test it with `llama stack run <path/to/run.yaml>`.
+After this step is successful, you should be able to find the built container image and test it with:
+```
+# This docker command will run the Llama Stack server based on the image built before
+# The following is a list of docker flags and their use in running the server:
+# -p $LLAMA_STACK_PORT:$LLAMA_STACK_PORT: Maps the container port to the host port for accessing the server
+# -v ~/.llama:/root/.llama: Mounts the local .llama directory to persist configurations and data
+# -v <path/to/run.yaml>:/app/run.yaml: Mounts the run configuration file into the container
+# --entrypoint python: Specifies the entry point to run Python directly
+# localhost/distribution-ollama:dev: The name and tag of the container image to run
+# -m llama_stack.distribution.server.server: The Python module to execute
+# --config /app/run.yaml: Path to the configuration file inside the container
+# --port $LLAMA_STACK_PORT: Port number for the server to listen on
+# --env INFERENCE_MODEL=$INFERENCE_MODEL: Sets the model to use for inference
+# --env OLLAMA_URL=http://host.docker.internal:11434: Configures the URL for the Ollama service
+
+docker run -it \
+  -p $LLAMA_STACK_PORT:$LLAMA_STACK_PORT \
+  -v ~/.llama:/root/.llama \
+  -v <path/to/run.yaml>:/app/run.yaml \
+  --entrypoint python \
+  localhost/distribution-ollama:dev \
+  -m llama_stack.distribution.server.server \
+  --config /app/run.yaml \
+  --port $LLAMA_STACK_PORT \
+  --env INFERENCE_MODEL=$INFERENCE_MODEL \
+  --env OLLAMA_URL=http://host.docker.internal:11434
+```
 :::
 
 ::::