Skip to content
Merged
Show file tree
Hide file tree
Changes from 8 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 3 additions & 2 deletions api/v1alpha1/variantautoscaling_types.go
Original file line number Diff line number Diff line change
Expand Up @@ -72,8 +72,9 @@ type OptimizedAlloc struct {
LastRunTime metav1.Time `json:"lastRunTime,omitempty"`

// Accelerator is the type of accelerator for the optimized allocation.
// +kubebuilder:validation:MinLength=2
Accelerator string `json:"accelerator"`
// Deprecated: This field is deprecated and will be removed in a future version. Use node selector or node affinity from scale target instead.
// +optional
Accelerator string `json:"accelerator,omitempty"`

// NumReplicas is the number of replicas for the optimized allocation.
// nil means no optimization decision has been made yet.
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -201,9 +201,9 @@ spec:
allocation based on autoscaling logic.
properties:
accelerator:
description: Accelerator is the type of accelerator for the optimized
allocation.
minLength: 2
description: |-
Accelerator is the type of accelerator for the optimized allocation.
Deprecated: This field is deprecated and will be removed in a future version. Use node selector or node affinity from scale target instead.
type: string
lastRunTime:
description: LastRunTime is the timestamp of the last optimization
Expand All @@ -217,8 +217,6 @@ spec:
format: int32
minimum: 0
type: integer
required:
- accelerator
type: object
type: object
type: object
Expand Down
8 changes: 3 additions & 5 deletions config/crd/bases/llmd.ai_variantautoscalings.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -201,9 +201,9 @@ spec:
allocation based on autoscaling logic.
properties:
accelerator:
description: Accelerator is the type of accelerator for the optimized
allocation.
minLength: 2
description: |-
Accelerator is the type of accelerator for the optimized allocation.
Deprecated: This field is deprecated and will be removed in a future version. Use node selector or node affinity from scale target instead.
type: string
lastRunTime:
description: LastRunTime is the timestamp of the last optimization
Expand All @@ -217,8 +217,6 @@ spec:
format: int32
minimum: 0
type: integer
required:
- accelerator
type: object
type: object
type: object
Expand Down
2 changes: 1 addition & 1 deletion docs/user-guide/crd-reference.md
Original file line number Diff line number Diff line change
Expand Up @@ -44,7 +44,7 @@ _Appears in:_
| Field | Description | Default | Validation |
| --- | --- | --- | --- |
| `lastRunTime` _[Time](https://kubernetes.io/docs/reference/generated/kubernetes-api/v1.32/#time-v1-meta)_ | LastRunTime is the timestamp of the last optimization run. | | |
| `accelerator` _string_ | Accelerator is the type of accelerator for the optimized allocation. | | MinLength: 2 <br /> |
| `accelerator` _string_ | Accelerator is the type of accelerator for the optimized allocation. This field is deprecated and will be removed in a future version. Use node selector or node affinity from scale target instead. | | MinLength: 2 <br /> |
| `numReplicas` _integer_ | NumReplicas is the number of replicas for the optimized allocation.<br />nil means no optimization decision has been made yet. | | Minimum: 0 <br /> |


Expand Down
Loading