You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* Adding a tip to use Python 3.12
* pip install beautifulsoup4 added as a general setup instruction
* Updating images, instructions (to align with Lemonade v10.2.0), and model used
* Pip installing in a single line
Copy file name to clipboardExpand all lines: playbooks/supplemental/open-webui-chat/README.md
+53-47Lines changed: 53 additions & 47 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -55,7 +55,7 @@ Open WebUI is built around standard OpenAI-style endpoints, like:
55
55
- Image generation: `/images/generations`
56
56
- Audio transcription: `/audio/transcriptions`
57
57
58
-
Lemonade exposes these under `http://localhost:8000/api/v1/...`
58
+
Lemonade exposes these under `http://localhost:13305/api/v1/...`
59
59
60
60
If a backend supports those endpoints, Open WebUI can talk to it with minimal setup. That’s why we can switch backends without changing our workflow.
61
61
@@ -73,23 +73,23 @@ This section establishes a stable local environment: Lemonade running, Open WebU
73
73
<!-- @os:linux -->
74
74
- Install Lemonade (App + Server) by following the Linux distribution-specific package manager instructions on the [official documentation page](https://lemonade-server.ai/install_options.html).
75
75
<!-- @os:end -->
76
-
- Start the Lemonade server:
77
-
- Open Powershell
78
-
- Run the command: `lemonade-server serve`
79
-
- Verify server status:
80
-
-In the same Powershell terminal, run: `lemonade-server status`
81
-
-Expect to see `Server is runningon port 8000`
76
+
After installation:
77
+
- The lemonade CLI is added to your system PATH automatically
78
+
- Lemonade server is expected to run in the background automatically
79
+
- Verify Lemonade installation and server status:
80
+
-Open a terminal and run: `lemonade --version`. You should see something like `lemonade version x.y.z`.
81
+
-In the same terminal, run `lemonade status`. You should see the output showing that the server is running, typically on port `13305`.
82
82
- Open the Lemonade Server app and download required models from the `Model Manager` tab
83
83
84
84
<palign="center">
85
85
<imgsrc="assets/lemonade_model_manager.png"alt="Lemonade Server App"width="600"/>
86
86
</p>
87
87
88
88
- Confirm the API is reachable:
89
-
- Open `http://localhost:8000/api/v1/models` in your web browser.
89
+
- Open `http://localhost:13305/api/v1/models` in your web browser.
90
90
- You should see a JSON list of models downloaded in Lemonade
91
91
92
-
> If you don’t see your models in `http://localhost:8000/api/v1/models`, Open WebUI won’t be able to select them later.
92
+
> If you don’t see your models in `http://localhost:13305/api/v1/models`, Open WebUI won’t be able to select them later.
@@ -448,6 +446,7 @@ echo "OK: open-webui installed in venv"
448
446
<!-- @test:end -->
449
447
<!-- @os:end -->
450
448
449
+
> **Tip (Python version):** Install Open WebUI using **Python 3.12**. The `open-webui` PyPI package may not install on Python 3.13+ (you’ll see “No matching distribution found”).
451
450
> Note: Open WebUI also provides a variety of other installation options, such as Docker, on their GitHub.
452
451
453
452
### 3) Start Open WebUI Server
@@ -559,35 +558,39 @@ echo "OK: Open WebUI is responding on /health"
559
558
560
559
In Open WebUI:
561
560
562
-
1. Go to **Admin Settings → Connections**
561
+
1. Go to **Admin Settings → Connections** (http://localhost:8080/admin/settings/connections):
<imgsrc="assets/connection_settings.png"alt="Navigating to the connection settings"width="69%"/>
569
566
</p>
570
567
571
568
2. Under **OpenAI API**, add a new connection:
572
-
-**Base URL:**`http://localhost:8000/api/v1`
569
+
-**Base URL:**`http://localhost:13305/api/v1`
573
570
-**API Key:**`-` (a single dash works for local)
574
571
<palign="center">
575
-
<imgsrc="assets/connection_form.png"alt="Connection details for Lemonade server"width="500"/>
572
+
<imgsrc="assets/connection_form.png"alt="Connection details for Lemonade server"width="400"/>
573
+
</p>
574
+
575
+
3. In http://localhost:8080/admin/settings/connections, esnure that under __"Manage OpenAI API Connections"__, only `http://localhost:13305/api/v1` is enabled.
576
+
<palign="center">
577
+
<imgsrc="assets/connection.png"alt="Admin settings connections page showing 'Manage OpenAI API Connections' with only http://localhost:13305/api/v1 enabled."width="600"/>
576
578
</p>
577
579
578
-
3. Save
579
-
4. Apply the following suggested settings. These help Open WebUI to be more responsive with local LLMs.
580
+
4. Save
581
+
582
+
5. Apply the following suggested settings. These help Open WebUI to be more responsive with local LLMs.
580
583
- Click the user profile button again, and choose "Admin Settings".
581
584
- Click the "Settings" tab at the top, then "Interface" (which will be on the top or the left, depending on your window size), then disable the following:
5. Click the **"Save"** button in the bottom right of the page, then return to `http://localhost:8080`.
590
-
6. Click the model dropdown and expect to see all the models that you have downloaded from Lemonade!
592
+
6. Click the **"Save"** button in the bottom right of the page, then return to `http://localhost:8080`.
593
+
7. Click the model dropdown and expect to see all the models that you have downloaded from Lemonade!
591
594
592
595
---
593
596
@@ -599,22 +602,22 @@ Now you’re all set up. Let's look at three interesting things to do.
599
602
600
603
### Activity 1: Chat with a Local LLM
601
604
602
-
1. Click the dropdown menu in the top-left of the interface. This will display all of the Lemonade models you have installed. Select one to proceed. (example: `Llama-3.2-1B-Instruct-Hybrid`).
605
+
1. Click the dropdown menu in the top-left of the interface. This will display all of the Lemonade models you have installed. Select one to proceed. (example: `Qwen3-4B-Hybrid`).
2. Enter a message to the LLM and click send (or hit Enter). The LLM will take a few seconds to load into memory and then you will see the response stream in.
608
611
<palign="center">
609
-
<imgsrc="assets/sending_a_message.png"alt="Sending a message"width="53%"/>
4. At this time, open `Task Manager` on your system. You will see **high GPU/NPU utilization** based on whether the model you selected is **Hybrid** or **NPU** respectively. That clearly shows you’re running locally.
4. The model answers based on the image content, not generic text.
@@ -649,38 +652,40 @@ Stable Diffusion models don't support text generation, they only generate images
649
652
650
653
#### Step 1: Configure Image Generation in Open WebUI
651
654
652
-
1. Go to **Admin Settings → Images**
655
+
1. Go to **Admin Settings → Images** (http://localhost:8080/admin/settings/images)
653
656
2. Set:
654
657
-**Image Generation:** ON
655
658
-**Image Generation Engine:** Default (OpenAI)
656
-
-**OpenAI API Base URL:**`http://localhost:8000/api/v1`
659
+
-**OpenAI API Base URL:**`http://localhost:13305/api/v1`
657
660
-**OpenAI API Key:**`-`
658
-
-**Model:**`SD-Turbo` (fast) or `SDXL-Base-1.0` (higher quality)
661
+
-**Model:**`SDXL-Turbo` (fast) or `SDXL-Base-1.0` (higher quality)
659
662
3. If you want to add more parameters, add them to the text field as JSON. For example: `{ "steps": 4, "cfg_scale": 1 }`. See available parameters at [Image Generation (Stable Diffusion CPP)](https://lemonade-server.ai/models.html).
#### Step 3: Generate an image from the chat screen
676
680
677
681
1. Go back to chat at `http://localhost:8080`.
678
-
2. Select a **Text Generation LLM** in the model dropdown (example: DeepSeek, CodeLlama). **Do not select a Stable Diffusion model** as this is a chat model selector.
682
+
2. Select a **Text Generation LLM** in the model dropdown (example: Qwen, Llama).**Do not select a Stable Diffusion model** as this is a chat model selector.
679
683
3. In the message area, toggle **Image** ON.
680
684
4. Use a prompt like: `A cinematic photo of heavy traffic at sunset, ultra detailed`.
0 commit comments