Skip to content

Commit 93b468c

Browse files
jihyeonRyujryu6965
andauthored
feat: add nvidia-platform components for NVIDIA Dynamo LLM inference (#102)
* add k8s platform setting for debuging Signed-off-by: jihyeonRyu <jryu@nvidia.com> * gpu operator installation on EKS Signed-off-by: jihyeonRyu <jryu@nvidia.com> * Install dynamo platform Signed-off-by: jihyeonRyu <jryu@nvidia.com> * Install dynamo platform Signed-off-by: jihyeonRyu <jryu@nvidia.com> * fix bug for installing dynamo-platform Signed-off-by: jihyeonRyu <jryu@nvidia.com> * vllm aggregated serving mode Signed-off-by: jihyeonRyu <jryu@nvidia.com> * fix blocking issue after deployment Signed-off-by: jihyeonRyu <jryu@nvidia.com> * fix to local pvc -> nfs Signed-off-by: jihyeonRyu <jryu@nvidia.com> * remove stale rock file automatically Signed-off-by: jihyeonRyu <jryu@nvidia.com> * dynamo kv router and kv cache offloading Signed-off-by: jihyeonRyu <jryu@nvidia.com> * fix deployment name bug, add EP setting Signed-off-by: jihyeonRyu <jryu@nvidia.com> * add readme Signed-off-by: jihyeonRyu <jryu@nvidia.com> * udate dynamo version and fix monitoring error * add dra mode * fix bug * update dynamo version and fix bugs Signed-off-by: jihyeonRyu <jryu@nvidia.com> * add monitoring filter option: model name Signed-off-by: jihyeonRyu <jryu@nvidia.com> * fix v0.9.0 dynamo crd bug: missing component Signed-off-by: jihyeonRyu <jryu@nvidia.com> * add aiperf benchmark and benchmark monitoring dashboard Signed-off-by: jihyeonRyu <jryu@nvidia.com> * fix dashboard bug Signed-off-by: jihyeonRyu <jryu@nvidia.com> * change discovery backend to etcd Signed-off-by: jihyeonRyu <jryu@nvidia.com> * aiconfigurator setup Signed-off-by: jihyeonRyu <jryu@nvidia.com> * fix benchmark bug Signed-off-by: jihyeonRyu <jryu@nvidia.com> * fix benchmark dashboard bug Signed-off-by: jihyeonRyu <jryu@nvidia.com> * fix dashboard bug Signed-off-by: jihyeonRyu <jryu@nvidia.com> * minor change Signed-off-by: jihyeonRyu <jryu@nvidia.com> * minor change of benchmark Signed-off-by: jihyeonRyu <jryu@nvidia.com> * update readme and dashboard lang Signed-off-by: jihyeonRyu <jryu@nvidia.com> * delete ingress for vllm Signed-off-by: jihyeonRyu <jryu@nvidia.com> --------- Signed-off-by: jihyeonRyu <jryu@nvidia.com> Co-authored-by: jihyeonRyu <jryu@nvidia.com>
1 parent 8c8bc04 commit 93b468c

26 files changed

+8702
-2
lines changed

β€Ž.envβ€Ž

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,7 @@
1+
# Platform type: eks | k8s
2+
PLATFORM=k8s
3+
4+
# EKS Settings (required when PLATFORM=eks)
15
REGION=us-west-2
26
EKS_CLUSTER_NAME=genai-on-eks
37
EKS_MODE=auto

β€Žcliβ€Ž

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -35,7 +35,10 @@ const configPath = path.join(DIR, "config.json");
3535
let config = JSON.parse(fs.readFileSync(configPath, "utf8"));
3636
mergeConfig("config.workshop.json");
3737
mergeConfig("config.local.json");
38-
process.env.KUBECONFIG = path.join(DIR, ".kubeconfig");
38+
// Only set custom kubeconfig for EKS mode; K8s mode uses existing kubeconfig
39+
if (process.env.PLATFORM !== "k8s") {
40+
process.env.KUBECONFIG = path.join(DIR, ".kubeconfig");
41+
}
3942
const TERRAFORM_DIR = path.join(DIR, "terraform");
4043
const COMPONENTS_DIR = path.join(DIR, "components");
4144
const EXAMPLES_DIR = path.join(DIR, "examples");

β€Žcli-menu.jsonβ€Ž

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,17 @@
11
{
22
"componentCategories": [
3+
{
4+
"dir": "nvidia-platform",
5+
"name": "NVIDIA Platform",
6+
"components": [
7+
{ "dir": "gpu-operator", "name": "GPU Operator" },
8+
{ "dir": "monitoring", "name": "Monitoring (Prometheus + Grafana)" },
9+
{ "dir": "dynamo-platform", "name": "Dynamo Platform" },
10+
{ "dir": "dynamo-vllm", "name": "Dynamo vLLM Serving" },
11+
{ "dir": "benchmark", "name": "AIPerf Benchmark" },
12+
{ "dir": "aiconfigurator", "name": "AIConfigurator (Auto Config + SLA Deploy)" }
13+
]
14+
},
315
{
416
"dir": "ai-gateway",
517
"name": "AI Gateway",

0 commit comments

Comments
Β (0)