Fix flaky failure e2e test: increase training duration for reliable polling

sutaakar · claude · abhijeet-dhumal · commit 916e0dffe43b · 2026-04-13T15:54:33.000+05:30
The failing-test-runtime training ran only 3 seconds (15 steps × 0.2s),
which was too short for the controller's 2s poll interval to capture
progress &gt; 0 before the job crashed. Increased per-step sleep to 0.5s
(~8s total) so the controller has 3-4 poll cycles to capture progress.

Co-Authored-By: Claude Opus 4.6 &lt;noreply@anthropic.com&gt;
diff --git a/test/e2e/rhai/resources/failing-test-runtime.yaml b/test/e2e/rhai/resources/failing-test-runtime.yaml
@@ -66,15 +66,17 @@ spec:
                           # Wait briefly for server to be ready
                           time.sleep(1)
 
-                          # Fast training that will fail at 50% (3 seconds total)
+                          # Training that will fail at 50% (~8 seconds total)
+                          # Must run long enough for controller to poll progress > 0
+                          # with a 2s poll interval (at least 3-4 poll cycles).
                           print("Starting training that will fail...")
                           total_steps = 30
                           fail_at_step = 15  # Fail at 50%
 
                           for step in range(fail_at_step):
-                              time.sleep(0.2)  # 0.2s per step
+                              time.sleep(0.5)  # 0.5s per step
                               progress = int((step / total_steps) * 100)
-                              remaining = int((total_steps - step) * 0.2)
+                              remaining = int((total_steps - step) * 0.5)
 
                               MetricsHandler.progress_data = {
                                   "progressPercentage": progress,