Apply suggestion from @mathetake

mathetake · web-flow · commit 501ff191d965 · 2025-10-10T08:14:27.000-07:00
Signed-off-by: Takeshi Yoneda &lt;t.y.mathetake@gmail.com&gt;
diff --git a/site/docs/capabilities/inference/httproute-inferencepool.md b/site/docs/capabilities/inference/httproute-inferencepool.md
@@ -40,7 +40,7 @@ kubectl wait --timeout=2m -n envoy-gateway-system deployment/envoy-gateway --for
 Deploy a sample inference backend that will serve as your inference endpoints:
 
 ```bash
-kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/raw/main/config/manifests/vllm/sim-deployment.yaml
+kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/raw/v1.0.1/config/manifests/vllm/sim-deployment.yaml
 ```
 
 This creates a simulated vLLM deployment with multiple replicas that can handle inference requests.