You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
I'm running localai 3.4.0 on my kubernetes cluster. Everything is ok from pod point of view.
I have persistent volume for models and backends.
i'm only running on CPU for the moment. With 4 CPU at maximum and 10GB of ram. 4 threads by env variable.
I'm trying to use gemma-3-4b-it-qat on a llama.ccp backend.
If i try to say Hello from webui i can see a beginning of an answer then it stop.
i saw that i have an rpc error:
DBG Sending chunk failed: connection closed │
│ Error rpc error: code = Canceled desc = context canceled
How to solve this , all my ressources are ok, i have acheck with my grafana, i don't use all my ressources.
PLease help me to use this wonderfuill project.
I think i'm not alone to have this problem.
Beta Was this translation helpful? Give feedback.
All reactions