Skip to content

Commit 607647a

Browse files
Update doc 04-GCP-GKE-lmcache-local-disk.md (#727)
Signed-off-by: dannawang <[email protected]> Co-authored-by: Yuhan Liu <[email protected]>
1 parent ddca3ef commit 607647a

File tree

1 file changed

+4
-3
lines changed

1 file changed

+4
-3
lines changed

tutorials/cloud_deployments/04-GCP-GKE-lmcache-local-disk.md

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -16,18 +16,19 @@ Before running this setup, ensure you have:
1616

1717
Disclaimer: This script requires cloud resources and will incur costs. Please make sure all resources are shut down properly.
1818

19-
Set the environment variables. Check that they meet the minimum requirements for your LLM model. Default values will be used for any unset variables. In this example, we use an A3 mega machine type to run the meta-llama/Llama-3.3-70B-Instruct model and use local SSD as the local disk backend through Kubernetes ephemeral storage(emptyDir).
19+
Set the environment variables. Check that they meet the minimum requirements for your LLM model. Default values will be used for any unset variables. In this example, we use an [A3 mega machine type](https://cloud.google.com/compute/docs/gpus#gpu-models) to run the meta-llama/Llama-3.3-70B-Instruct model and use local SSD as the local disk backend through Kubernetes ephemeral storage(emptyDir).
2020

2121
```bash
2222
export CLUSTER_NAME="my-vllm-cluster"
23-
export CLUSTER_VERSION="1.32.3-gke.1440000"
23+
export CLUSTER_VERSION="1.33.4-gke.1172000"
2424
export ZONE="us-central1-c"
2525
export ACCELERATOR_TYPE="nvidia-h100-mega-80gb"
2626
```
2727

28-
To run the service, go to "deployment_on_cloud/gcp" and run:
28+
To run the service, go to "[deployment_on_cloud/gcp](https://github.com/vllm-project/production-stack/tree/main/deployment_on_cloud/gcp)" and run:
2929

3030
```bash
31+
cd ../../deployment_on_cloud/gcp/
3132
bash entry_point_basic.sh ../../tutorials/assets/gke-example/values-01-offload-kv-cache-local-disk.yaml
3233
```
3334

0 commit comments

Comments
 (0)