Open
Description
Kubernetes has started adding ways to add queueing capabilities into Kubernetes. The entrypoint for enabling queueing can be by implementing the suspend field.
The BatchJob API contains these field in kubernetes upstream but custom CRDs need to implement suspend schematics for queueing.
There is some work in Kueue for adding suspend capabilities to RayJob and I imagine it would be similar for this project.
Relevant PR for RayJob: ray-project/kuberay#926
Kueue PR to incorporate RayJob: kubernetes-sigs/kueue#667
Documentation for suspend in jobs: https://kubernetes.io/docs/concepts/workloads/controllers/job/#suspending-a-job
I think it would make sense to add it as a DaskJob but there could be a reason to implement queueing in other areas also?