Skip to content

Commit 948615e

Browse files
committed
fix: make tpu_v5e_4 non-preemptible in test config
The executor heuristic auto-tags small CLI jobs as non-preemptible. With all test workers preemptible, these jobs stayed PENDING forever. Make the always-on tpu_v5e_4 group non-preemptible so executor-style jobs can schedule. Also revert the --cpu 2 workaround back to --cpu 1.
1 parent 57baa95 commit 948615e

2 files changed

Lines changed: 4 additions & 6 deletions

File tree

lib/iris/examples/test.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -33,7 +33,7 @@ controller:
3333
port: 10000
3434

3535
scale_groups:
36-
# Active scale group with capacity
36+
# Active scale group with capacity (non-preemptible so executor-style jobs land here)
3737
tpu_v5e_4:
3838
num_vms: 1
3939
resources:
@@ -43,7 +43,7 @@ scale_groups:
4343
device_type: tpu
4444
device_variant: v5litepod-4
4545
device_count: 4
46-
preemptible: true
46+
preemptible: false
4747
min_slices: 2
4848
max_slices: 10
4949
slice_template:

tests/integration/iris/test_cli_dispatch.py

Lines changed: 2 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -55,8 +55,6 @@ def _wait_for_job_state(
5555

5656
def test_cli_submit_and_succeed(integration_cluster):
5757
"""Submit a simple command via 'iris job run' CLI and verify it succeeds."""
58-
# Use --cpu 2 to avoid the executor heuristic (≤1 CPU → non-preemptible),
59-
# since test cluster workers are all preemptible.
6058
result = _run_iris_cli(
6159
integration_cluster.url,
6260
"job",
@@ -65,7 +63,7 @@ def test_cli_submit_and_succeed(integration_cluster):
6563
"--job-name",
6664
"itest-cli-ok",
6765
"--cpu",
68-
"2",
66+
"1",
6967
"--memory",
7068
"1g",
7169
"--",
@@ -91,7 +89,7 @@ def test_cli_submit_failing_command(integration_cluster):
9189
"--job-name",
9290
"itest-cli-fail",
9391
"--cpu",
94-
"2",
92+
"1",
9593
"--memory",
9694
"1g",
9795
"--",

0 commit comments

Comments
 (0)