Skip to content

Commit 501ff19

Browse files
authored
Apply suggestion from @mathetake
Signed-off-by: Takeshi Yoneda <[email protected]>
1 parent 8e9301c commit 501ff19

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

site/docs/capabilities/inference/httproute-inferencepool.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -40,7 +40,7 @@ kubectl wait --timeout=2m -n envoy-gateway-system deployment/envoy-gateway --for
4040
Deploy a sample inference backend that will serve as your inference endpoints:
4141

4242
```bash
43-
kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/raw/main/config/manifests/vllm/sim-deployment.yaml
43+
kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/raw/v1.0.1/config/manifests/vllm/sim-deployment.yaml
4444
```
4545

4646
This creates a simulated vLLM deployment with multiple replicas that can handle inference requests.

0 commit comments

Comments
 (0)