File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -39,6 +39,22 @@ import (
3939 genctlr "github.com/llm-d-incubation/llm-d-fast-model-actuation/pkg/controller/generic"
4040)
4141
42+ // This package implements the dual-pods controller.
43+
44+ // The controller works in the context of one Kubernetes API namespace.
45+
46+ // A Pod is a server-requesting Pod if it has the server patch annotation.
47+ // A Pod is a server-running Pod if has a controlling OwnerReference to
48+ // a Pod (the server-requesting Pod).
49+
50+ // There are two types of item in the controller's work queue.
51+ // One is a reference to the gpu-map ConfigMap.
52+
53+ // The other type of queue item is a reference to an inference server.
54+ // This reference carries the inference server's UID and the name
55+ // of the server-requesting Pod.
56+ // An inference server's UID is the UID of the server-requesting Pod.
57+
4258const ControllerName = "dual-pods-controller"
4359
4460// GPUMapName is the name of the ConfigMap(s) parsed to discover the mapping from GPU UUID to location.
You can’t perform that action at this time.
0 commit comments