Commit 7196434
authored
π Enable scale-from-zero E2E on CKS and OCP with KEDA support (#865)
* π Enable scale-from-zero on CKS and OCP with KEDA support
- Remove environment skip in scale_from_zero_test.go β test now runs on
all platforms (KEDA must be pre-installed on the cluster)
- Add retry logic to detect_inference_pool_api_group() to handle the race
where InferencePool instances haven't been created yet after helmfile deploy
- Make deploy_keda() skip helm install when KEDA CRD already exists
(pre-installed on OCP via CMA operator, on CKS via helm)
- Remove environment guard on SCALER_BACKEND=keda β supported everywhere
Signed-off-by: Andy Anderson <andy@clubanderson.com>
Signed-off-by: Andrew Anderson <andy@clubanderson.com>
* π Increase deploy wait timeout from 60s to 600s for model loading
The kubectl wait --timeout=60s for all deployments in the llm-d
namespace was too short for model-serving pods (vLLM) that need to
download and load large models (e.g. Meta-Llama-3.1-8B) into GPU
memory. This caused both OCP and CKS nightly E2E to fail at the
"Deploy guide via WVA install.sh" step.
Default is now 600s (10 min), overridable via DEPLOY_WAIT_TIMEOUT
env var. The vLLM startupProbe already allows up to 30 minutes.
Signed-off-by: Andrew Anderson <andy@clubanderson.com>
* π Address Copilot review feedback on KEDA and scale-from-zero
- deploy_keda(): Check operator pods + APIService, not just CRD, to
avoid false skip when stale CRD remains after prior uninstall
- detect_inference_pool_api_group(): Implement actual namespace-first
then cluster-wide fallback (comment said fallback but code didn't)
- Pin KEDA chart version (KEDA_CHART_VERSION, default 2.19.0) for
reproducible installs
- Fix ENABLE_SCALE_TO_ZERO default inconsistency in helm --set
- Add Skip guard in scale-from-zero test for non-KEDA environments
where HPA rejects minReplicas=0
- Fix misleading comment that said scale-from-zero requires KEDA
- Document per-environment KEDA_NAMESPACE values in suite_test.go
Signed-off-by: Andrew Anderson <andy@clubanderson.com>
---------
Signed-off-by: Andy Anderson <andy@clubanderson.com>
Signed-off-by: Andrew Anderson <andy@clubanderson.com>1 parent 072ec8b commit 7196434
3 files changed
Lines changed: 69 additions & 20 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
118 | 118 | | |
119 | 119 | | |
120 | 120 | | |
| 121 | + | |
| 122 | + | |
121 | 123 | | |
122 | 124 | | |
123 | 125 | | |
| |||
505 | 507 | | |
506 | 508 | | |
507 | 509 | | |
| 510 | + | |
| 511 | + | |
508 | 512 | | |
509 | 513 | | |
510 | | - | |
511 | | - | |
512 | | - | |
513 | | - | |
514 | | - | |
| 514 | + | |
| 515 | + | |
| 516 | + | |
| 517 | + | |
| 518 | + | |
| 519 | + | |
| 520 | + | |
| 521 | + | |
| 522 | + | |
| 523 | + | |
| 524 | + | |
| 525 | + | |
| 526 | + | |
| 527 | + | |
| 528 | + | |
| 529 | + | |
| 530 | + | |
| 531 | + | |
| 532 | + | |
| 533 | + | |
| 534 | + | |
| 535 | + | |
| 536 | + | |
| 537 | + | |
| 538 | + | |
| 539 | + | |
| 540 | + | |
| 541 | + | |
| 542 | + | |
515 | 543 | | |
516 | 544 | | |
517 | 545 | | |
| |||
555 | 583 | | |
556 | 584 | | |
557 | 585 | | |
| 586 | + | |
558 | 587 | | |
559 | 588 | | |
560 | 589 | | |
| |||
1041 | 1070 | | |
1042 | 1071 | | |
1043 | 1072 | | |
1044 | | - | |
1045 | | - | |
| 1073 | + | |
| 1074 | + | |
| 1075 | + | |
| 1076 | + | |
| 1077 | + | |
| 1078 | + | |
1046 | 1079 | | |
1047 | 1080 | | |
1048 | 1081 | | |
| |||
1074 | 1107 | | |
1075 | 1108 | | |
1076 | 1109 | | |
| 1110 | + | |
| 1111 | + | |
| 1112 | + | |
| 1113 | + | |
| 1114 | + | |
| 1115 | + | |
| 1116 | + | |
| 1117 | + | |
| 1118 | + | |
| 1119 | + | |
| 1120 | + | |
| 1121 | + | |
| 1122 | + | |
1077 | 1123 | | |
1078 | 1124 | | |
1079 | 1125 | | |
1080 | 1126 | | |
1081 | 1127 | | |
1082 | 1128 | | |
| 1129 | + | |
1083 | 1130 | | |
1084 | 1131 | | |
1085 | 1132 | | |
| |||
1689 | 1736 | | |
1690 | 1737 | | |
1691 | 1738 | | |
1692 | | - | |
| 1739 | + | |
| 1740 | + | |
1693 | 1741 | | |
1694 | | - | |
1695 | | - | |
1696 | | - | |
1697 | | - | |
1698 | 1742 | | |
1699 | 1743 | | |
1700 | 1744 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
25 | 25 | | |
26 | 26 | | |
27 | 27 | | |
28 | | - | |
| 28 | + | |
| 29 | + | |
29 | 30 | | |
30 | 31 | | |
31 | 32 | | |
| |||
35 | 36 | | |
36 | 37 | | |
37 | 38 | | |
38 | | - | |
39 | | - | |
40 | | - | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
41 | 46 | | |
42 | 47 | | |
43 | 48 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
53 | 53 | | |
54 | 54 | | |
55 | 55 | | |
56 | | - | |
57 | | - | |
58 | | - | |
59 | | - | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
60 | 60 | | |
61 | 61 | | |
62 | 62 | | |
| |||
0 commit comments