-
Notifications
You must be signed in to change notification settings - Fork 169
Description
hi... i cleaned my ollama directory , re-installed it
took your ollama.exe and placed in the original ollama directory
, i understand i need to always use setvars. i downloaded the llama 3.2 model in r1 , turned it into ollama model but when i run it (via open web ui on the first reply i get an error in the logs:
time=2025-06-12T15:46:24.312+03:00 level=INFO source=runner.go:403 msg="The model had been load by GenAI, ov_model_path: \tmp\llama-3.2-3b-instruct-int4-ov-npu_latest\llama-3.2-3b-instruct-int4-ov-npu , GPU"
time=2025-06-12T15:46:24.453+03:00 level=INFO source=genaiserver.go:397 msg="llama runner started in 20.31 seconds"
time=2025-06-12T15:46:24.454+03:00 level=INFO source=runner.go:240 msg="Request info :\nPOST /completion HTTP/1.1\r\nHost: 127.0.0.1:32628\r\nAccept-Encoding: gzip\r\nContent-Length: 1106\r\nContent-Type: application/json\r\nUser-Agent: Go-http-client/1.1\r\n\r\n{"cache_prompt":true,"frequency_penalty":0,"image_data":null,"main_gpu":0,"max_new_token":2048,"min_p":0,"mirostat":0,"mirostat_eta":0.1,"mirostat_tau":5,"n_keep":4,"n_predict":-1,"presence_penalty":0,"prompt":"### Task:\nGenerate 1-3 broad tags categorizing the main themes of the chat history, along with 1-3 more specific subtopic tags.\n\n### Guidelines:\n- Start with high-level domains (e.g. Science, Technology, Philosophy, Arts, Politics, Business, Health, Sports, Entertainment, Education)\n- Consider including relevant subfields/subdomains if they are strongly represented throughout the conversation\n- If content is too short (less than 3 messages) or too diverse, use only [\"General\"]\n- Use the chat's primary language; default to English if multilingual\n- Prioritize accuracy over specificity\n\n### Output:\nJSON format: { \"tags\": [\"tag1\", \"tag2\", \"tag3\"] }\n\n### Chat History:\n<chat_history>\nUSER: hi.\nASSISTANT: \n</chat_history>","repeat_last_n":64,"repeat_penalty":1,"seed":-1,"stop":null,"stop_id":null,"stream":true,"temperature":1,"top_k":40,"top_p":1,"typical_p":1}"
time=2025-06-12T15:46:24.454+03:00 level=INFO source=genai.go:161 msg="Sampling Parameters - Temperature: 1.00, TopP: 1.00, TopK: 40, RepeatPenalty: 1.00"
panic: runtime error: cgo argument has Go pointer to unpinned Go pointer
goroutine 52 [running]:
github.com/ollama/ollama/genai.GenerateTextWithMetrics.func3(...)
C:/hongbo/test/ollama_ov/genai/genai.go:221
github.com/ollama/ollama/genai.GenerateTextWithMetrics(0x0, {0xc0001c6600?, 0x7ff6e8183320?}, 0xc0004a24e0, 0xc0002108f0)
C:/hongbo/test/ollama_ov/genai/genai.go:221 +0x1d2
github.com/ollama/ollama/genai/runner.(*Server).processBatch(0xc0002f3880)
C:/hongbo/test/ollama_ov/genai/runner/runner.go:150 +0x1eb
github.com/ollama/ollama/genai/runner.(*Server).run(0xc0002f3880, {0x7ff6e78dd5b0, 0xc0004a59a0})
C:/hongbo/test/ollama_ov/genai/runner/runner.go:230 +0x31
created by github.com/ollama/ollama/genai/runner.Execute in goroutine 1
C:/hongbo/test/ollama_ov/genai/runner/runner.go:454 +0x6eb
[GIN] 2025/06/12 - 15:46:24 | 500 | 20.5961542s | 127.0.0.1 | POST "/api/chat"
time=2025-06-12T15:46:24.462+03:00 level=ERROR source=genaiserver.go:255 msg="llama runner terminated" error="exit status 2"
[GIN] 2025/06/12 - 15:51:22 | 200 | 3.4991ms | 127.0.0.1 | GET "/api/tags"
[GIN] 2025/06/12 - 15:51:22 | 200 | 0s | 127.0.0.1 | GET "/api/ps"