You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
docs: fix LM Studio model ID format and add setup guidance
Use full model ID (openai/qwen/qwen3.5-35b-a3b) matching what LM Studio
reports via /v1/models. Add curl step to find exact model ID and
context_window config to match LM Studio's context length.
[LM Studio](https://lmstudio.ai/) runs open-weight models locally and exposes an OpenAI-compatible API. No API keys, no cloud — everything stays on your machine.
319
319
320
320
1. Download and install [LM Studio](https://lmstudio.ai/).
321
-
2. In LM Studio, download a model (e.g., Qwen 3.5, Llama 4, Gemma 3, Phi-4) and start the local server. Note the model name shown in the server tab.
322
-
3. Edit `~/.config/localgpt/config.toml`:
321
+
2. In LM Studio, download a model (e.g., Qwen 3.5, Llama 4, Gemma 3, Phi-4) and start the local server.
322
+
3. Find the exact model ID — run `curl http://127.0.0.1:1234/v1/models` and look for the `"id"` field (e.g., `qwen/qwen3.5-35b-a3b`). The model name must match exactly.
323
+
4. Edit `~/.config/localgpt/config.toml`:
323
324
324
325
```toml
325
326
[agent]
326
-
default_model = "openai/qwen3.5-35b-a3b"# use the model name from LM Studio
327
+
default_model = "openai/qwen/qwen3.5-35b-a3b"# use the exact model ID from step 3
328
+
context_window = 32768# match the context length set in LM Studio
327
329
328
330
[providers.openai]
329
331
api_key = "lm-studio"
330
332
base_url = "http://127.0.0.1:1234/v1"
331
333
```
332
334
333
-
4. Run `localgpt chat` — all requests go to LM Studio on your machine.
335
+
5. Run `localgpt chat` — all requests go to LM Studio on your machine.
334
336
335
-
LM Studio's default port is `1234`. If you changed it, update the `base_url` accordingly. Tool calling works with models that support it (e.g., Qwen 3.5, Llama 4).
337
+
LM Studio's default port is `1234`. If you changed it, update the `base_url` accordingly. Set the context length in LM Studio's model load settings to match `context_window`. Tool calling works with models that support it (e.g., Qwen 3.5, Llama 4).
0 commit comments