Open
Description
The currently implemented logic for deletion can be summarized into:
- The operator asks the scheduler to retire n of the workers
- The scheduler retires them (process exits) and returns to the operator names of retired workers
- Operator deletes worker deployment sequentially
However, between 2 and 3 the Kubernetes API may interfere and restart the worker deployment so a new pod will be created and join the cluster for some time before the operator deletes the deployment effectively interpreting the pod mid-run.