Skip to content

Adding a suspend field to the dask operator #701

Open
@kannon92

Description

@kannon92

Kubernetes has started adding ways to add queueing capabilities into Kubernetes. The entrypoint for enabling queueing can be by implementing the suspend field.

The BatchJob API contains these field in kubernetes upstream but custom CRDs need to implement suspend schematics for queueing.

There is some work in Kueue for adding suspend capabilities to RayJob and I imagine it would be similar for this project.

Relevant PR for RayJob: ray-project/kuberay#926
Kueue PR to incorporate RayJob: kubernetes-sigs/kueue#667

Documentation for suspend in jobs: https://kubernetes.io/docs/concepts/workloads/controllers/job/#suspending-a-job

I think it would make sense to add it as a DaskJob but there could be a reason to implement queueing in other areas also?

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions