Skip to content

Add README for queueing model analyzer#893

Merged
mamy-CS merged 6 commits intollm-d:mainfrom
vishakha-ramani:model-based
Apr 2, 2026
Merged

Add README for queueing model analyzer#893
mamy-CS merged 6 commits intollm-d:mainfrom
vishakha-ramani:model-based

Conversation

@vishakha-ramani
Copy link
Copy Markdown
Contributor

Adds a comprehensive README for the QueueingModelAnalyzer at internal/engines/analyzers/queueingmodel/README.md.

Structured as a hybrid document: operational content (activation, ConfigMap reference, SLO targeting, how it works) first, with a theoretical background appendix (service time model, steady-state analysis via Little's Law, TTFT/ITL derivation, cold start bootstrap) at the end.

@lionelvillard
Copy link
Copy Markdown
Collaborator

@vishakha-ramani documentation is placed under docs. Can you move it there?

@lionelvillard lionelvillard enabled auto-merge (squash) April 1, 2026 18:37
@lionelvillard
Copy link
Copy Markdown
Collaborator

/ok-to-test

lionelvillard
lionelvillard previously approved these changes Apr 1, 2026
@github-actions
Copy link
Copy Markdown
Contributor

github-actions bot commented Apr 1, 2026

🚀 Kind E2E (full) triggered by /ok-to-test

View the Kind E2E workflow run

@github-actions
Copy link
Copy Markdown
Contributor

github-actions bot commented Apr 1, 2026

🚀 OpenShift E2E — approve and run (/ok-to-test)

View the OpenShift E2E workflow run

@github-actions
Copy link
Copy Markdown
Contributor

github-actions bot commented Apr 1, 2026

GPU Pre-flight Check ✅

GPUs are available for e2e-openshift tests. Proceeding with deployment.

Resource Total Allocated Available
GPUs 50 28 22
Cluster Value
Nodes 16 (7 with GPUs)
Total CPU 993 cores
Total Memory 10383 Gi
GPUs required 4 (min) / 6 (recommended)

@mamy-CS
Copy link
Copy Markdown
Collaborator

mamy-CS commented Apr 2, 2026

Please rebase @vishakha-ramani

auto-merge was automatically disabled April 2, 2026 15:06

Head branch was pushed to by a user without write access

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@mamy-CS
Copy link
Copy Markdown
Collaborator

mamy-CS commented Apr 2, 2026

/ok-to-test

@github-actions
Copy link
Copy Markdown
Contributor

github-actions bot commented Apr 2, 2026

🚀 Kind E2E (full) triggered by /ok-to-test

View the Kind E2E workflow run

@github-actions
Copy link
Copy Markdown
Contributor

github-actions bot commented Apr 2, 2026

🚀 OpenShift E2E — approve and run (/ok-to-test)

View the OpenShift E2E workflow run

@github-actions
Copy link
Copy Markdown
Contributor

github-actions bot commented Apr 2, 2026

GPU Pre-flight Check ✅

GPUs are available for e2e-openshift tests. Proceeding with deployment.

Resource Total Allocated Available
GPUs 50 32 18
Cluster Value
Nodes 16 (7 with GPUs)
Total CPU 993 cores
Total Memory 10383 Gi
GPUs required 4 (min) / 6 (recommended)

@mamy-CS mamy-CS merged commit f097ac8 into llm-d:main Apr 2, 2026
16 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants