Add baseline configuration for performance comparison of llm-d by oshfeder · Pull Request #8 · llm-d-incubation/llm-d-skills

oshfeder · 2026-04-16T14:00:46Z

Baseline configuration:
uses a new service endpoint (instead of lllm-d scheduling gateway)
runs on same model pods

Signed-off-by: Oshrit Feder <oshritf@il.ibm.com>

rachelt44 · 2026-04-16T18:39:05Z

+   kubectl apply -f skills/compare-llm-d-configurations/llm-d-baseline-model-server-svc.yaml -n $NAMESPACE
+   ```
+
+   Or if that file is not available locally, create it inline:


The file is always available locally, it is part of the skill. No need to repeat it here,.

rachelt44 · 2026-04-16T18:40:33Z


+### 0.5 Check for Baseline Configuration
+
+If either run is labeled as "baseline" (doesnt use llm-d scheduling) or the user indicates one configuration is a baseline run, perform the following checks:


This assumes that the llm-d stack is already up and running, but we may want to deploy it using the skill, then create the service

rachelt44 · 2026-04-16T18:40:51Z

+
+   If no endpoints exist, inform the user that pods with the label `llm-d.ai/inference-serving=true` must be running for the baseline service to work. Check the running pod labels, list them to the user and suggest which label to use for the sellector (update the baseline service accordingly).
+
+4. **Set baseline base_url**: When running the benchmark for the baseline configuration (as Run A or as Run B), ensure the Run's config.yaml uses:


This part should be moved to the step that runs the benchmark

rachelt44 · 2026-04-16T18:41:35Z

@@ -0,0 +1,13 @@
+apiVersion: v1


please move this yaml to a sub directory called scripts (or resources)

Add baseline configuration for performance comparison of llm-d

a1bf77e

Signed-off-by: Oshrit Feder <oshritf@il.ibm.com>

rachelt44 requested changes Apr 16, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add baseline configuration for performance comparison of llm-d#8

Add baseline configuration for performance comparison of llm-d#8
oshfeder wants to merge 1 commit intollm-d-incubation:mainfrom
oshfeder:baseline_config

oshfeder commented Apr 16, 2026 •

edited

Loading

Uh oh!

rachelt44 Apr 16, 2026

Uh oh!

rachelt44 Apr 16, 2026

Uh oh!

rachelt44 Apr 16, 2026

Uh oh!

rachelt44 Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		### 0.5 Check for Baseline Configuration

		If either run is labeled as "baseline" (doesnt use llm-d scheduling) or the user indicates one configuration is a baseline run, perform the following checks:


		If no endpoints exist, inform the user that pods with the label `llm-d.ai/inference-serving=true` must be running for the baseline service to work. Check the running pod labels, list them to the user and suggest which label to use for the sellector (update the baseline service accordingly).

		4. Set baseline base_url: When running the benchmark for the baseline configuration (as Run A or as Run B), ensure the Run's config.yaml uses:

Conversation

oshfeder commented Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rachelt44 Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

rachelt44 Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

rachelt44 Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

rachelt44 Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

oshfeder commented Apr 16, 2026 •

edited

Loading