Commit c093640
Increase limits_cpu for the short_running_workers (#615)
Increase limits_cpu for the short_running_workers
I am monitoring the hard time limits exceptions since a while, seems like that when we have spikes both in the short running workers and in the long running we sometimes get an hard time limit exception for the celery tasks.
We have spike above the requested limits (as you can see in the today picture) mainly for the short running workers.
We have exceptions for the spikes even when the single worker cpu usage is below the limits_cpu and I am wondering if this limit applies to the sum of all our replicas or just for the single replica?
I would say it is the limit for all the replicas together and in this case we need to increase much more the limits_cpu both for the long running and the short running pods (I would put there 1G of limits_cpu for both kind of workers in this case).
One of the highest cpu requests (for just a single short running worker) I registered:
Reviewed-by: Matej Focko2 files changed
+2
-2
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
250 | 250 | | |
251 | 251 | | |
252 | 252 | | |
253 | | - | |
| 253 | + | |
254 | 254 | | |
255 | 255 | | |
256 | 256 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
206 | 206 | | |
207 | 207 | | |
208 | 208 | | |
209 | | - | |
| 209 | + | |
210 | 210 | | |
211 | 211 | | |
212 | 212 | | |
| |||
0 commit comments