In this session, we’ll work together to debug and fix a kubernetes cluster and a basic service deployed on it. The goal is to see how you approach problems in real-time—from diagnosing errors to explaining your reasoning as you go. In the archive you received, you will find an SSH key and config for getting access to the three nodes, as well as a kubectl for authenticating to the kube API, although you can also use kubectl from node-1 if you prefer.
The first step is debugging the cluster itself. There are several problems with the cluster that you will need to address before the app can be scheduled on the cluster.
The current app and postgres deployments have issues that prevent them from working properly. The manifests in the k8s
directory in this repo match exactly what is deployed on the cluster. Your job is to make the necessary changes to the
deployed resources to get the app fully operational.