Skip to content

Commit 3698807

Browse files
committed
Update memory resource limits in qwen.yaml for improved performance
- Increase memory limit from 6Gi to 12Gi for the container's resource limits. - Adjust memory request from 12Gi to 6Gi to optimize resource allocation.
1 parent 4defb27 commit 3698807

File tree

1 file changed

+2
-2
lines changed
  • clusters/k3s-stpetersburg/apps/ai/kserve

1 file changed

+2
-2
lines changed

clusters/k3s-stpetersburg/apps/ai/kserve/qwen.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -15,11 +15,11 @@ spec:
1515
resources:
1616
limits:
1717
cpu: "2"
18-
memory: 6Gi
18+
memory: 12Gi
1919
nvidia.com/gpu: "1"
2020
requests:
2121
cpu: "1"
22-
memory: 12Gi
22+
memory: 6Gi
2323
nvidia.com/gpu: "1"
2424
---
2525
apiVersion: v1

0 commit comments

Comments
 (0)