Skip to content

Commit a16f025

Browse files
committed
fix: add EPP RBAC for InferenceModelRewrite (required by v0.7.0)
EPP v0.7.0 (built on gateway-api-inference-extension v1.3.0) watches InferenceModelRewrite resources on startup. The inferencepool Helm chart v1.0.1 (shipped with llm-d v0.3.0) does not include this permission, causing the EPP to crash-loop with a forbidden error when its image is patched to v0.7.0. Add a supplemental Role and RoleBinding granting the EPP service account read-only access to inferencemodelrewrites, applied alongside the existing image and ConfigMap patches in deploy/lib/infra_llmd.sh. Made-with: Cursor
1 parent 5b6f0a0 commit a16f025

1 file changed

Lines changed: 29 additions & 0 deletions

File tree

deploy/lib/infra_llmd.sh

Lines changed: 29 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -338,6 +338,35 @@ deploy_llm_d_infrastructure() {
338338
else
339339
log_warning "ConfigMap $LLM_D_EPP_NAME not found in $LLMD_NS"
340340
fi
341+
342+
# Ensure EPP has RBAC for InferenceModelRewrite (required by EPP v0.7.0+
343+
# which watches this CRD, but older inferencepool Helm charts don't include it).
344+
log_info "Ensuring EPP RBAC includes inferencemodelrewrites permission"
345+
kubectl apply -f - <<RBAC_EOF
346+
apiVersion: rbac.authorization.k8s.io/v1
347+
kind: Role
348+
metadata:
349+
name: ${LLM_D_EPP_NAME}-model-rewrite
350+
namespace: ${LLMD_NS}
351+
rules:
352+
- apiGroups: ["inference.networking.x-k8s.io"]
353+
resources: ["inferencemodelrewrites"]
354+
verbs: ["get", "list", "watch"]
355+
---
356+
apiVersion: rbac.authorization.k8s.io/v1
357+
kind: RoleBinding
358+
metadata:
359+
name: ${LLM_D_EPP_NAME}-model-rewrite
360+
namespace: ${LLMD_NS}
361+
subjects:
362+
- kind: ServiceAccount
363+
name: ${LLM_D_EPP_NAME}
364+
namespace: ${LLMD_NS}
365+
roleRef:
366+
apiGroup: rbac.authorization.k8s.io
367+
kind: Role
368+
name: ${LLM_D_EPP_NAME}-model-rewrite
369+
RBAC_EOF
341370
else
342371
log_warning "Skipping inference-scheduler patch: Deployment $LLM_D_EPP_NAME not found in $LLMD_NS"
343372
fi

0 commit comments

Comments
 (0)