We can start small, but should be able to point people somewhere.
Loose ideas:
- Workload doesn't start -- now what?
- Inspect status of a ComputeDomain
- Robust kubectl-based commands for fetching 'all' current DRA driver state (we can over time work towards debug-bundle creation).
- Common problems
- How to wipe DRA driver state from a cluster (node-local state, API server state)
We can start small, but should be able to point people somewhere.
Loose ideas: