Commit 93b468c
feat: add nvidia-platform components for NVIDIA Dynamo LLM inference (#102)
* add k8s platform setting for debuging
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* gpu operator installation on EKS
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* Install dynamo platform
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* Install dynamo platform
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* fix bug for installing dynamo-platform
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* vllm aggregated serving mode
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* fix blocking issue after deployment
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* fix to local pvc -> nfs
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* remove stale rock file automatically
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* dynamo kv router and kv cache offloading
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* fix deployment name bug, add EP setting
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* add readme
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* udate dynamo version and fix monitoring error
* add dra mode
* fix bug
* update dynamo version and fix bugs
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* add monitoring filter option: model name
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* fix v0.9.0 dynamo crd bug: missing component
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* add aiperf benchmark and benchmark monitoring dashboard
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* fix dashboard bug
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* change discovery backend to etcd
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* aiconfigurator setup
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* fix benchmark bug
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* fix benchmark dashboard bug
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* fix dashboard bug
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* minor change
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* minor change of benchmark
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* update readme and dashboard lang
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
* delete ingress for vllm
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
---------
Signed-off-by: jihyeonRyu <jryu@nvidia.com>
Co-authored-by: jihyeonRyu <jryu@nvidia.com>1 parent 8c8bc04 commit 93b468c
File tree
26 files changed
+8702
-2
lines changed- components/nvidia-platform
- aiconfigurator
- benchmark
- dynamo-platform
- crds
- dynamo-vllm
- gpu-operator
- monitoring
- dashboards
26 files changed
+8702
-2
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
1 | 5 | | |
2 | 6 | | |
3 | 7 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
35 | 35 | | |
36 | 36 | | |
37 | 37 | | |
38 | | - | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
39 | 42 | | |
40 | 43 | | |
41 | 44 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
3 | 15 | | |
4 | 16 | | |
5 | 17 | | |
| |||
0 commit comments