Skip to content

Commit 3003b33

Browse files
committed
Add a brief outline comment in the dual-pods controller source
Signed-off-by: Mike Spreitzer <mspreitz@us.ibm.com>
1 parent 9dffa27 commit 3003b33

1 file changed

Lines changed: 16 additions & 0 deletions

File tree

pkg/controller/dual-pods/controller.go

Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -39,6 +39,22 @@ import (
3939
genctlr "github.com/llm-d-incubation/llm-d-fast-model-actuation/pkg/controller/generic"
4040
)
4141

42+
// This package implements the dual-pods controller.
43+
44+
// The controller works in the context of one Kubernetes API namespace.
45+
46+
// A Pod is a server-requesting Pod if it has the server patch annotation.
47+
// A Pod is a server-running Pod if has a controlling OwnerReference to
48+
// a Pod (the server-requesting Pod).
49+
50+
// There are two types of item in the controller's work queue.
51+
// One is a reference to the gpu-map ConfigMap.
52+
53+
// The other type of queue item is a reference to an inference server.
54+
// This reference carries the inference server's UID and the name
55+
// of the server-requesting Pod.
56+
// An inference server's UID is the UID of the server-requesting Pod.
57+
4258
const ControllerName = "dual-pods-controller"
4359

4460
// GPUMapName is the name of the ConfigMap(s) parsed to discover the mapping from GPU UUID to location.

0 commit comments

Comments
 (0)