You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: playbooks/core/lmstudio-rocm-llms/README.md
+2-10Lines changed: 2 additions & 10 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -18,21 +18,13 @@ LM Studio is a powerful GUI-based wrapper for [llama.cpp](https://github.com/ggm
18
18
<!-- @setup:memory-config -->
19
19
20
20
21
-
## Downloading models
21
+
## Downloading Models
22
22
23
-
Your LM Studio instance in the STX Halo™ comes with OpenAI GPT-OSS 120B pre-downloaded.
24
-
25
-
26
-
Should you wish to download additional models - you can do so by pressing "Ctrl" + "Shift" + "M" on your keyboard or clicking on the "Discover" tab (Magnifying Glass) and searching for additional models.
27
-
28
-
LM Studio will automatically download and put the model in the correct directory.
23
+
<!-- @require:lmstudio-models-gpt-oss-120b -->
29
24
30
25
## Chatting with an LLM
31
26
Learn how to start chatting with a ChatGPT-grade LLM completely locally.
32
27
33
-
34
-
> This example uses OpenAI GPT-OSS 120B in its original MXFP4 precision. Please make sure to have completed the [System Setup](#system-setup) section above for optimal performance.
35
-
36
28
1. Press "Ctrl" + "L" or select the central drop down menu at the top. Select "manually chose model load parameters" and click on "OpenAI GPT-OSS 120B"
37
29
2. Make sure "show advanced settings" is checked.
38
30
3. Change context size to "128,000". Make sure "Flash Attention" is On and "GPU offload layers" is set to maximum.
0 commit comments