Skip to content

Expose metric with volumes stuck detaching #2255

Open
@jsafrane

Description

@jsafrane

Is your feature request related to a problem? Please describe.
Sometimes a volume gets stuck at detaching state, most usually because the host where it is attached is unhealthy. A careful, manually initiated force detach and/or node shutdown may be needed it that case.

The CSI driver should expose a metric about what volumes (with PV/PVC as a label?) are waiting for detach longer than X seconds (minutes?), so a cluster admin can set up an alert and manually investigate what is going on.

Describe alternatives you've considered
A similar metric in the kube-controller-manager or the external-attacher could be a good alternative, still, we've seen such an issue only with AWS EBS.

Additional context
See #1302 for an example of such issue.

Metadata

Metadata

Assignees

Labels

lifecycle/staleDenotes an issue or PR has remained open with no activity and has become stale.priority/important-longtermImportant over the long term, but may not be staffed and/or may need multiple releases to complete.

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions