Commit 2d8e765
committed
fix: change LLM_D_RELEASE default to main for GA InferencePool API
The previous attempt set LLM_D_RELEASE in ci-benchmark.yaml, but
issue_comment-triggered workflows always use the workflow YAML from the
default branch — not the PR branch. The env var never took effect.
Change the default in deploy/install.sh instead (v0.3.0 → main), which
IS executed from the PR checkout. llm-d main uses inferencepool chart
v1.4.0 which creates InferencePool with the GA API group
(inference.networking.k8s.io/v1) that Istio 1.29+ expects.
The old v0.3.0 default used chart v1.0.1 with the alpha API group
(inference.networking.x-k8s.io/v1alpha2), which Istio ignores —
causing Gateway HTTP 500 errors.
Made-with: Cursor1 parent 1fbe6e1 commit 2d8e765
2 files changed
Lines changed: 1 addition & 4 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
545 | 545 | | |
546 | 546 | | |
547 | 547 | | |
548 | | - | |
549 | | - | |
550 | | - | |
551 | 548 | | |
552 | 549 | | |
553 | 550 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
47 | 47 | | |
48 | 48 | | |
49 | 49 | | |
50 | | - | |
| 50 | + | |
51 | 51 | | |
52 | 52 | | |
53 | 53 | | |
| |||
0 commit comments