File tree Expand file tree Collapse file tree 3 files changed +8
-12
lines changed
deploy/environments/dev/kubernetes-kgateway Expand file tree Collapse file tree 3 files changed +8
-12
lines changed Original file line number Diff line number Diff line change @@ -205,7 +205,7 @@ This will deploy the entire stack to whatever namespace you chose.
205205You can test by exposing the inference ` Gateway ` via port-forward:
206206
207207``` bash
208- kubectl port-forward service/inference-gateway 8080:80
208+ kubectl port-forward service/inference-gateway 8080:80 -n " ${NAMESPACE} "
209209```
210210
211211And making requests with ` curl ` :
@@ -260,22 +260,18 @@ the `git` SHA:
260260export EPP_TAG=$( git rev-parse HEAD)
261261```
262262
263- Build the image:
263+ Build the image and tag the image for your private registry :
264264
265265``` bash
266- DEV_VERSION=$EPP_TAG make image-build
266+ DEV_VERSION=$EPP_TAG IMAGE_REGISTRY=quay.io/my-id make image-build
267267```
268268
269- Tag the image for your private registry and push it:
269+ and push it:
270270
271271``` bash
272- $CONTAINER_RUNTIME tag quay.io/llm-d/llm-d-gateway-api-inference-extension/epp:$TAG \
273- < MY_REGISTRY> /< MY_IMAGE> :$EPP_TAG
274- $CONTAINER_RUNTIME push < MY_REGISTRY> /< MY_IMAGE> :$EPP_TAG
275- ```
272+ IMAGE_REGISTRY=quay.io/my-id make image-push
276273
277- > ** NOTE** : ` $CONTAINER_RUNTIME ` can be configured or replaced with whatever your
278- > environment's standard container runtime is (e.g. ` podman ` , ` docker ` ).
274+ ```
279275
280276Then you can re-deploy the environment with the new changes (don't forget all
281277the required env vars):
Original file line number Diff line number Diff line change @@ -9,7 +9,7 @@ resources:
99- gateway-parameters.yaml
1010
1111images :
12- - name : quay .io/llm-d/gateway-api-inference-extension
12+ - name : ghcr .io/llm-d/gateway-api-inference-extension
1313 newName : ${EPP_IMAGE}
1414 newTag : ${EPP_TAG}
1515
Original file line number Diff line number Diff line change 156156# Run Helm upgrade/install vllm
157157echo " INFO: Deploying vLLM Environment in namespace ${NAMESPACE} , ${POOL_NAME} "
158158helm upgrade --install " $VLLM_HELM_RELEASE_NAME " " $VLLM_CHART_DIR " \
159- --namespace c3 \
159+ --namespace= " $NAMESPACE " \
160160 --set secret.create=true \
161161 --set secret.hfTokenValue=" $HF_TOKEN2 " \
162162 --set vllm.poolLabelValue=" $POOL_NAME " \
You can’t perform that action at this time.
0 commit comments