test

mitchross · mitchross · commit 76b7a8817be2 · 2026-02-23T08:49:15.000-05:00
diff --git a/infrastructure/controllers/kyverno/kustomization.yaml b/infrastructure/controllers/kyverno/kustomization.yaml
@@ -7,6 +7,7 @@ resources:
 - policies/volsync-pvc-backup-restore.yaml
 - policies/volsync-nfs-inject.yaml
 - policies/volsync-orphan-cleanup.yaml
+- policies/vpa-auto-create.yaml
 helmCharts:
 - name: kyverno
   repo: https://kyverno.github.io/kyverno
diff --git a/infrastructure/controllers/kyverno/policies/vpa-auto-create.yaml b/infrastructure/controllers/kyverno/policies/vpa-auto-create.yaml
@@ -0,0 +1,75 @@
+---
+apiVersion: kyverno.io/v1
+kind: ClusterPolicy
+metadata:
+  name: vpa-auto-create
+  annotations:
+    policies.kyverno.io/title: Auto-create VPA for Deployments and StatefulSets
+    policies.kyverno.io/description: >-
+      Automatically generates a VerticalPodAutoscaler in Off mode for every
+      Deployment and StatefulSet. VPA recommendations can be read via
+      kubectl get vpa -A to right-size resource requests.
+spec:
+  mutateExistingOnPolicyUpdate: false
+  background: true
+  rules:
+    - name: generate-vpa-for-deployment
+      match:
+        any:
+          - resources:
+              kinds:
+                - Deployment
+              operations:
+                - CREATE
+                - UPDATE
+      exclude:
+        any:
+          - resources:
+              namespaces:
+                - kube-system
+                - kyverno
+                - vertical-pod-autoscaler
+      generate:
+        synchronize: true
+        apiVersion: autoscaling.k8s.io/v1
+        kind: VerticalPodAutoscaler
+        name: "{{request.object.metadata.name}}"
+        namespace: "{{request.object.metadata.namespace}}"
+        data:
+          spec:
+            targetRef:
+              apiVersion: apps/v1
+              kind: Deployment
+              name: "{{request.object.metadata.name}}"
+            updatePolicy:
+              updateMode: "Off"
+    - name: generate-vpa-for-statefulset
+      match:
+        any:
+          - resources:
+              kinds:
+                - StatefulSet
+              operations:
+                - CREATE
+                - UPDATE
+      exclude:
+        any:
+          - resources:
+              namespaces:
+                - kube-system
+                - kyverno
+                - vertical-pod-autoscaler
+      generate:
+        synchronize: true
+        apiVersion: autoscaling.k8s.io/v1
+        kind: VerticalPodAutoscaler
+        name: "{{request.object.metadata.name}}"
+        namespace: "{{request.object.metadata.namespace}}"
+        data:
+          spec:
+            targetRef:
+              apiVersion: apps/v1
+              kind: StatefulSet
+              name: "{{request.object.metadata.name}}"
+            updatePolicy:
+              updateMode: "Off"
diff --git a/infrastructure/controllers/vertical-pod-autoscaler/README.md b/infrastructure/controllers/vertical-pod-autoscaler/README.md
@@ -0,0 +1,49 @@
+# Vertical Pod Autoscaler (VPA)
+
+VPA monitors actual CPU/memory usage and recommends optimal resource requests for pods.
+
+## How It Works
+
+VPA is deployed in **Off mode** — it generates recommendations but does not apply them. A Kyverno ClusterPolicy (`vpa-auto-create`) automatically creates a VPA resource for every Deployment and StatefulSet in the cluster (excluding system namespaces).
+
+When you're ready to let VPA auto-tune, change the `updateMode` to `InPlaceOrRecreate` (K8s 1.35 GA feature — resizes pods without restarting them).
+
+## Reading Recommendations
+
+```bash
+# Quick summary of all VPA recommendations
+kubectl get vpa -A -o custom-columns=\
+NAMESPACE:.metadata.namespace,\
+NAME:.metadata.name,\
+CPU:.status.recommendation.containerRecommendations[0].target.cpu,\
+MEM:.status.recommendation.containerRecommendations[0].target.memory
+
+# Full detail for a specific app
+kubectl describe vpa <name> -n <namespace>
+```
+
+Recommendations include four values per container:
+- **target** — what VPA thinks you should set
+- **lowerBound** — minimum safe value
+- **upperBound** — max it would recommend
+- **uncappedTarget** — ideal ignoring any min/max constraints
+
+## Components
+
+| Component | Purpose |
+|-----------|---------|
+| **Recommender** | Analyzes metrics, generates recommendations |
+| **Updater** | Applies changes when mode is not Off (evicts or in-place resizes) |
+| **Admission Controller** | Sets resources on new pods when mode is not Off |
+
+## Dependencies
+
+- **metrics-server** (`infrastructure/controllers/metrics-server/`) — provides the `metrics.k8s.io` API that VPA reads from
+- **Kyverno** — auto-generates VPA resources via `vpa-auto-create` ClusterPolicy
+
+## Notes
+
+- VPA only tracks CPU and memory — GPU (`nvidia.com/gpu`) and ephemeral-storage are not managed
+- Recommendations need a few hours of pod runtime to stabilize
+- Upper bounds will be very wide initially and tighten over days
+- GPU workloads will show low CPU/memory recommendations since compute happens on GPU VRAM