Skip to content

Commit 720df67

Browse files
authored
Fix demo links (#15)
Signed-off-by: Brent Salisbury <bsalisbu@redhat.com>
1 parent 710973b commit 720df67

File tree

1 file changed

+4
-3
lines changed

1 file changed

+4
-3
lines changed

deployment/kuadrant-openshift/README-token-rate-limiting-openshift.md

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -8,19 +8,21 @@ These demos require a Kuadrant wasm-shim fork with metric support `oci://ghcr.io
88

99
### Demo: Token Rate Limiting User Quotas
1010

11+
> ⚠ If the videos do not render for you, please the [comments in PR-7](https://github.com/redhat-et/maas-billing/pull/7#issuecomment-3179989695) to view them.
12+
1113
1. Generate requests across various users
1214
2. Hit token caps on each of the user's policies based on their group
1315
3. View token metrics in Prometheus
1416

15-
<video src="https://github.com/user-attachments/assets/2c205809-b7fa-466f-b74a-e0c6c9c26569" controls></video>
17+
https://github.com/user-attachments/assets/4147912c-bd2b-4926-a937-fed99051163d
1618

1719
### Demo: Token Rate Policy and Charge Metering
1820

1921
1. Increase `TokenRateLimitPolicy` quota
2022
2. Generate continous user request load across users in the background
2123
3. View token metrics and chargeback metering in Prometheus (Premium group: $0.008 per/token | Freemium group: $0.005 per/token)
2224

23-
<video src="https://github.com/user-attachments/assets/c08d3e39-3d70-49ff-a67e-1a062eb339b0" controls></video>
25+
https://github.com/user-attachments/assets/3bf87c3d-0956-4104-80d7-676177c5a79a
2426

2527
## Architecture Overview
2628

@@ -671,4 +673,3 @@ If token rate limiting stops working with the custom image:
671673
kubectl -n llm delete wasmplugin kuadrant-inference-gateway
672674
# The operator will recreate it automatically
673675
```
674-

0 commit comments

Comments
 (0)