llama.cpp server docker not spinning up #422
Unanswered
gianlucagilardi
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello everyone,
I feel completely dumb, but apaprelntly i cannot find a way to make this work....
I have a pretty simple config. yaml to test the system:
I spin up llama-swap with:
docker run -it --rm -p 9292:8080 -v /home/user/models:/models -v /home/user/llama-swap/config.yaml:/app/config.yaml ghcr.io/mostlygeek/llama-swap:vulkanllama swap start ccorrectly; i can access the web interface at :9292 but when i try to load the model (also, via llama swap gui) i get this error:
if i just try to run
docker run --name docker-oss --init --rm --device=/dev/dri --device=/dev/kfd --group-add video -e AMD_VULKAN_ICD=RADV -v /usr/share/vulkan:/usr/share/vulkan:ro -p 10001:8080 -v /home/user/models:/models:ro ghcr.io/ggml-org/llama.cpp:server-vulkan --model /models/oss-120b/gpt-oss-120b.gguf -ngl 999 -c 4096 -b 512 --no-mmap -fa 1 -n -1llama.cpp server runs and answers correctly.
I am sure I am missing something but - for the life of me - I cannot understand what. I suspect the ", env:" part is messing up but have no idea how to get rid of it.
Any idea?
Beta Was this translation helpful? Give feedback.
All reactions