[Feature] Support multiple Ray containers per Pod

### Search before asking

- [x] I had searched in the [issues](https://github.com/ray-project/kuberay/issues) and found no similar feature requirement.


### Description

It may be beneficial to support creating multiple Ray workers on the same Pod created with KubeRay. 

There is more context in this PR: https://github.com/ai-on-gke/kuberay-tpu-webhook/pull/19, which supports multiple TPU containers with KubeRay, and specifically this comment: https://github.com/ai-on-gke/kuberay-tpu-webhook/pull/19#issuecomment-3808077486.

I was able to create a RayCluster with 2 workers, each with 2 Ray containers (so 4 Ray nodes total), and run a workload on it. However, in order to avoid port conflicts and pass the correct Ray resources it's necessary to manually construct a `ray start` command for the second container. The amount of manual intervention required by the user would be reduced if KubeRay updated the resource detection logic and port assignment to handle multiple containers automatically.

### Use case

TPU v7x introduces a dual-chiplet architecture where a standard 4-chip VM spans two distinct NUMA nodes. To optimize memory bandwidth and avoid cross-NUMA latency, v7x workloads can now run as multiple NUMA-aligned containers within a single Pod. This would entail multiple Ray containers on the same Pod created by KubeRay.

More context on the new accelerator: https://docs.cloud.google.com/tpu/docs/tpu7x

### Related issues

N/A

### Are you willing to submit a PR?

- [x] Yes I am willing to submit a PR!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Support multiple Ray containers per Pod #4455

Search before asking

Description

Use case

Related issues

Are you willing to submit a PR?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature] Support multiple Ray containers per Pod #4455

Description

Search before asking

Description

Use case

Related issues

Are you willing to submit a PR?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions