You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
update(GHA): bump action versions and simplify lint (#197)
* update(GHA): bump action versions and simplify lint
- only run if changes found on charts
- uplift action version to use new one
- remove download chart from GIE and GW
- move lychee to use action and config move to .github
Signed-off-by: Wen Zhou <wenzhou@redhat.com>
* fix: markdown lint
Signed-off-by: Wen Zhou <wenzhou@redhat.com>
---------
Signed-off-by: Wen Zhou <wenzhou@redhat.com>
Copy file name to clipboardExpand all lines: README.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -44,7 +44,7 @@ Note that the field `llm-d.ai/role` will be addition in addition to the labels s
44
44
45
45
### HTTPRoute
46
46
47
-
In addition to deploying the GAIE chart, an `HTTPRoute` is typically required to connect the `Gateway` to the `InferencePool`. Creating an HTTPRoute is not part of either chart. Some examples are provided [here](https://github.com/llm-d-incubation/llm-d-modelservice/blob/main/examples/README.md#httproute).
47
+
In addition to deploying the GAIE chart, an `HTTPRoute` is typically required to connect the `Gateway` to the `InferencePool`. Creating an HTTPRoute is not part of either chart. Some examples are provided [here](examples/README.md#httproute-examples).
Copy file name to clipboardExpand all lines: examples/README.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -14,11 +14,11 @@ helm repo update
14
14
|---------|-------------|----------------------|
15
15
|[`values-cpu.yaml`](#1-cpu-only)| CPU-only inference example | Single node, no GPU required |
16
16
|[`values-pd.yaml`](#2-pd-disaggregation)| Prefill/decode disaggregation example | Multi-GPU, demonstrates P/D splitting |
17
-
|[`values-xpu.yaml`](#5-intel-xpu-examples)| Intel XPU single-node example | Intel Data Center GPU Max |
18
-
|[`pvc/`](#4-loading-a-model-from-a-pvc)| Persistent volume examples | Shows different storage options |
17
+
|[`values-xpu.yaml`](#4-intel-xpu-examples)| Intel XPU single-node example | Intel Data Center GPU Max |
18
+
|[`pvc/`](#3-loading-a-model-from-a-pvc)| Persistent volume examples | Shows different storage options |
19
19
|[`dra/`](#6-dynamic-resource-allocation)| Dynamic Resource Allocation (DRA) examples | Shows different DRA use cases |
20
20
21
-
All the examples assume a `Gateway` and GAIE configuration have been deployed. See the [llm-d guides](https://github.com/llm-d/llm-d/tree/main/guides) for examples. Further, an `HTTPRoute` must be deployed. Some examples of `HTTPRoute` is provided [below](https://github.com/llm-d-incubation/llm-d-modelservice/blob/main/examples/README.md#httproute).
21
+
All the examples assume a `Gateway` and GAIE configuration have been deployed. See the [llm-d guides](https://github.com/llm-d/llm-d/tree/main/guides) for examples. Further, an `HTTPRoute` must be deployed. Some examples of `HTTPRoute` is provided [below](#httproute-examples).
0 commit comments