Skip to content

[Bug] SidecarMode: submitter not wait RayCluster Running (Ready) #4199

@sunran1203

Description

@sunran1203

Search before asking

  • I searched the issues and found no similar issues.

KubeRay Component

ray-operator

What happened + What you expected to happen

NAME             JOB STATUS   DEPLOYMENT STATUS   RAY CLUSTER NAME       START TIME             END TIME   AGE
job-43n6os1dxr                Initializing        job-43n6os1dxr-2tzsh   2025-11-17T13:53:34Z              10s
job-43n6os1dxr-2tzsh-head-nbhr9            2/2     Running            0             3m58s
job-43n6os1dxr-2tzsh-worker-worker-w5qqz   0/1     Running            1 (14s ago)   3m58s

KubeRay Operator:v1.5.0

The Worker was unable to access the Head due to a domain name resolution issue, but the submitter had already submitted the task, and the task ran in the Head. After the task finished, RayJob did not change to SUCCEEDED.

Reproduction script

Perhaps we could try modifying the DNS resolution file mounted by the Worker.

Anything else

No response

Are you willing to submit a PR?

  • Yes I am willing to submit a PR!

Metadata

Metadata

Assignees

Labels

1.6.0bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions