Removed allocation struct used in modelbased scaling by asm582 · Pull Request #553 · llm-d/llm-d-workload-variant-autoscaler

asm582 · 2026-01-09T14:48:31Z

allocation struct has ITL, TTFT, and load average fields. This was handled by model-based scaling, which is not required for saturation-based scaling and therefore is not part of the API.

lionelvillard · 2026-01-09T14:55:42Z

+// including the rate of incoming requests (ArrivalRate) and the average
+// length of each request (AvgLength). Both fields are specified as strings
+// to allow flexible input formats.
+type LoadProfile struct {


Why do we still need this?

backward compatibility from an algorithm perspective. I assume we will soon need features for model-based work, either to handle wobble or to go beyond it. If this is absolutely not needed, we can remove it later.

lionelvillard · 2026-01-09T14:55:57Z

+	MaxBatch int `json:"maxBatch"`
+
+	// ITLAverage is the average inter token latency for the current allocation.
+	ITLAverage string `json:"itlAverage"`


Is this still needed?

Same comment as above. Until the wobble issue is fixed, I would like to keep it.

lionelvillard · 2026-01-09T14:56:21Z

+	ITLAverage string `json:"itlAverage"`
+
+	// TTFTAverage is the average time to first token for the current allocation
+	TTFTAverage string `json:"ttftAverage"`


Is this still needed?

I kept it as a safeguard in case we ever want to bring back model-based features. The user-facing API does not have these fields for now.

Remove documentation for CurrentAlloc, Allocation, and LoadProfile types that were removed in PR #553 as part of the model-based scaling cleanup. Changes: - Remove Allocation and LoadProfile type definitions from CRD reference - Remove currentAlloc field from VariantAutoscalingStatus documentation - Update status examples to use desiredOptimizedAlloc instead - Update VariantAutoscalingStatus description to reflect current state These types were part of the model-based scaling implementation and are no longer needed for saturation-based scaling.

Removed allocation struct used in modelbased scaling

Remove documentation for CurrentAlloc, Allocation, and LoadProfile types that were removed in PR llm-d#553 as part of the model-based scaling cleanup. Changes: - Remove Allocation and LoadProfile type definitions from CRD reference - Remove currentAlloc field from VariantAutoscalingStatus documentation - Update status examples to use desiredOptimizedAlloc instead - Update VariantAutoscalingStatus description to reflect current state These types were part of the model-based scaling implementation and are no longer needed for saturation-based scaling.

removed allocation struct used in modelbased scaling

db784e3

lionelvillard reviewed Jan 9, 2026

View reviewed changes

asm582 requested review from atantawi and clubanderson January 9, 2026 15:12

asm582 marked this pull request as ready for review January 9, 2026 15:12

lionelvillard approved these changes Jan 9, 2026

View reviewed changes

asm582 merged commit deeb63f into llm-d:main Jan 9, 2026
7 checks passed

This was referenced Jan 9, 2026

docs: Remove references to deprecated Allocation API types #554

Merged

docs: Complete removal of deprecated Allocation API references #557

Closed

ev-shindin pushed a commit to ev-shindin/workload-variant-autoscaler that referenced this pull request Jan 14, 2026

Merge pull request llm-d#553 from asm582/rem_allocation

9c1f149

Removed allocation struct used in modelbased scaling

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Removed allocation struct used in modelbased scaling#553

Removed allocation struct used in modelbased scaling#553
asm582 merged 1 commit intollm-d:mainfrom
asm582:rem_allocation

asm582 commented Jan 9, 2026 •

edited

Loading

Uh oh!

lionelvillard Jan 9, 2026

Uh oh!

asm582 Jan 9, 2026

Uh oh!

lionelvillard Jan 9, 2026

Uh oh!

asm582 Jan 9, 2026

Uh oh!

lionelvillard Jan 9, 2026

Uh oh!

asm582 Jan 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

asm582 commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lionelvillard Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

asm582 Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

lionelvillard Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

asm582 Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

lionelvillard Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

asm582 Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

asm582 commented Jan 9, 2026 •

edited

Loading