- What work did the SIG do this year that should be highlighted?
SIG Node continues its streak as the record holding SIG for most KEPs each release. 2025 actually exceeded the velocity of 2024, with 1.33 merging 33 KEPs, 1.34 merging 32 and 1.35 merging 35! Many major initiatives across many different areas were advanced. Particularly, many long-standing KEPs were moved to stable, which represents an overarching initative to close out KEPs and stabalize the project.
The sheer number of KEPs makes summarizing the progress almost tedious, but look below for the full list of work done. The SIG focused largely on DRA, as well as other initatives that advance inference serving use-cases, but also made advancements in the cpu and memory manager.
The KEP Wrangler process, started in 2024, has continued, and some of the SIG's success and velocity can be attributed to it. It's also been an opportunity for contributors to the SIG to step up as leaders. A special shout out goes to Sreeram Venkitesh who stepped into the role of Wrangler lead.
The SIG is happy to sponsor a new sub-project: node readiness controller, which was spun out of an effort to have a clearer signal for foundational node workloads to signal they are ready to run end-user workloads, as well as two new working groups: WG node lifecycle and WG checkpoint restore.
-
Are there any areas and/or subprojects that your group needs help with (e.g. fewer than 2 active OWNERS)?
-
Did you have community-wide updates in 2025 (e.g. KubeCon talks)?
- Kubecon EU 2025 maintainers track
- Kubecon NA 2025 maintainers track
- KEP work in 2025 (v1.33, v1.34, v1.35):
-
Alpha
- 4188 - New kubelet gRPC API with endpoint returning local pods information - v1.35
- 4960 - Container Stop Signals - v1.33
- 5328 - Node Declared Features - v1.35
- 5394 - PSI based Node Conditions - v1.34
- 5419 - In-Place Pod-Level Resources Resize - v1.35
- 5526 - Pod-Level Resource Managers - v1.35
- 5532 - Restart All Containers on Container Exits - v1.35
- 5607 - Allow HostNetwork Pods to Use User Namespaces - v1.35
-
Beta
- 127 - Support User Namespaces - v1.35
- 2033 - Rootless mode - v1.35
- 2371 - cAdvisor-less, CRI-full Container and Pod Stats - v1.35
- 2535 - Ensure Secret Pulled Images - v1.35
- 2837 - KEP Template - v1.34
- 2862 - Fine grained Kubelet API authorization - v1.33
- 3695 - Extend the PodResources API to include resources allocated by DRA - v1.34
- 3721 - Support for env files. - v1.35
- 4205 - Expose PSI Metrics - v1.34
- 4265 - Add ProcMount option - v1.33
- 4680 - Add Resource Health Status to the Pod Status for Device Plugin and DRA - v1.35
- 4742 - Node Topologies via Downward API - v1.35
- 4800 - Split UnCoreCache Toplogy Awareness in CPU Manager - v1.34
- 5307 - Container Restart Policy - v1.35
- 5573 - Remove cgroup v1 support - v1.35
- 5593 - Configure the max CrashLoopBackOff delay - v1.35
-
Stable
- 1287 - In-place Update of Pod Resources - v1.35
- 2008 - Forensic Container Checkpointing - v1.33
- 2400 - Node system swap support - v1.34
- 2625 - SMT aware cpumanager policy - v1.33
- 2902 - CPUManager Policy Option to Distribute CPUs Across NUMA Nodes Instead of Packing Them - v1.35
- 3288 - Split Stdout and Stderr Log Stream of Container - v1.34
- 3619 - Fine grained SupplementalGroups control - v1.35
- 3673 - Kubelet limit of Parallel Image Pulls - v1.35
- 3857 - Recursive read-only mounts - v1.33
- 3960 - Pod lifecycle sleep action - v1.34
- 3983 - Add support for a kubelet drop-in configuration directory - v1.35
- 4033 - Discover cgroup driver from CRI - v1.34
- 4176 - New CPUManager Static Policy which spread hyperthreads across physical CPUs to better utilize CPU Cache - v1.33
- 4210 - ImageMaximumGCAge in Kubelet - v1.35
- 4216 - Image pull per runtime class - v1.35
- 4369 - Allow special characters environment variable - v1.34
- 4381 - DRA Structured Parameters - v1.34
- 4438 - Restarting sidecar containers during Pod termination - v1.35
- 4540 - Add CPUManager policy option to restrict reservedSystemCPUs to system daemons and interrupt processing - v1.35
- 4622 - New TopologyManager Policy which configure the value of maxAllowableNUMANodes - v1.35
- 4639 - OCI images as VolumeSource - v1.35
- 4817 - Resource Claim Status With Possible Standardized Network Interface Data - v1.35
- 4818 - Allow zero value for Sleep Action of PreStop Hook - v1.34
- 5067 - Pod Generation - v1.35
- 753 - Sidecar Containers - v1.33
New in 2025:
- node-readiness-controller Continuing:
- ci-testing
- cri-api
- cri-client
- cri-tools
- kernel-module-management
- kubelet
- node-api
- node-feature-discovery
- node-problem-detector
- resource-management
- security-profiles-operator
New in 2025:
- Checkpoint Restore
- Node Lifecycle Retired in 2025:
- Policy Continuing:
- Batch
- Device Management
- Serving
- Structured Logging
Operational tasks in sig-governance.md:
- README.md reviewed for accuracy and updated if needed
- CONTRIBUTING.md reviewed for accuracy and updated if needed
- Other contributing docs (e.g. in devel dir or contributor guide) reviewed for accuracy and updated if needed
- Subprojects list and linked OWNERS files in sigs.yaml reviewed for accuracy and updated if needed
- SIG leaders (chairs, tech leads, and subproject leads) in sigs.yaml are accurate and active, and updated if needed
- Meeting notes and recordings for 2025 are linked from README.md and updated/uploaded if needed