Describe the bug
Currently, if we have a Pipeline Failure during the original drain, we stop and immediately switch to force drain. Should we handle transient failures, however, and give it an opportunity to recover?
Message from the maintainers:
Impacted by this bug? Give it a 👍. We often sort issues this way to know what to prioritize.