You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Split the skill guidance into two use-cases: speech and image.
This fixes the issue of agent setting up both flows when only one was requested by the user
> **Do ONLY the group the user asked for.** If the user wants to generate an
33
+
> image, set up `image` only — do **not** pull the speech models. The setup
34
+
> command takes the group as an argument (`image`, `speech`, or `all`), and the
35
+
> rule installed into `AGENTS.md` contains only the group(s) you set up.
36
+
37
+
For each group you set up, the skill does two things:
25
38
26
39
1.**Verifies that local Lemonade is reachable and has the right models.**
27
40
2.**Drops a `Local AI Use` block into the workspace `AGENTS.md`** so the agent
@@ -48,32 +61,36 @@ instead.
48
61
missing, install from <https://lemonade-server.ai/install_options.html>
49
62
before continuing. Do not silently install on the user's machine; that is a
50
63
system-wide change and must be the user's call.
51
-
-**Disk:**~8 GB free for the three default models (SD-Turbo + Whisper-Tiny
52
-
+kokoro-v1).
64
+
-**Disk:**~5 GB for `image`(SD-Turbo); ~0.4 GB for `speech`
65
+
(kokoro-v1 + Whisper-Tiny). Only the group(s) you set up are downloaded.
53
66
-**Network:** required for the first `lemonade pull` of each model. After
54
67
that, every modality runs offline.
55
68
56
69
## The opinionated path
57
70
58
-
Run this checklist top to bottom. Track progress against it; do not move on
59
-
until each step verifies.
71
+
Run this checklist top to bottom for the group(s) the user needs. Track progress
72
+
against it; do not move on until each step verifies.
60
73
61
74
```
62
75
[ ] 1. Confirm Lemonade Server is installed and reachable
63
-
[ ] 2. Pull the three default modality models
76
+
[ ] 2. Pull the selected group's default models
64
77
[ ] 3. Install the routing rule into the workspace AGENTS.md
65
-
[ ] 4. Smoke-test image, TTS, and STT against the local endpoint
78
+
[ ] 4. Smoke-test the selected group's endpoints
66
79
```
67
80
68
-
The single command that does steps 1, 2, and 3 in one shot is:
81
+
The single command that does steps 1, 2, and 3 in one shot, scoped to a group:
69
82
70
83
```bash
71
-
python scripts/setup_local_ai.py
84
+
python scripts/setup_local_ai.py image # image only
85
+
python scripts/setup_local_ai.py speech # TTS + STT only
86
+
python scripts/setup_local_ai.py all # both (only if the user wants both)
72
87
```
73
88
74
-
(Run from this skill's folder.) The script is idempotent: re-running it on a
75
-
fully configured workspace is a no-op apart from a healthcheck. Read the
76
-
sections below for what to do when each step fails.
89
+
(Run from this skill's folder.) The script pulls only the selected group's
90
+
models and writes only that group's rule section. It is idempotent: re-running
91
+
with the same group is a no-op apart from a healthcheck. To add a group later,
92
+
re-run with the full set you want (e.g. `all`). Read the sections below for what
93
+
to do when each step fails.
77
94
78
95
---
79
96
@@ -101,33 +118,36 @@ and no API key is required (the system-wide server defaults to no auth on
101
118
loopback). If the user has set `LEMONADE_API_KEY`, the routing rule template
102
119
in `templates/local-ai-rule.md` shows where to add the `Authorization` header.
103
120
104
-
## Step 2: pull the three default modality models
121
+
## Step 2: pull the selected group's default models
105
122
106
-
Pull these three. They are the **Lite Collection** defaults from Lemonade
107
-
OmniRouter, sized to keep token-and-cost savings real on commodity hardware:
123
+
Pull only the models for the group(s) you are setting up. They are the
124
+
**Lite Collection** defaults from Lemonade OmniRouter, sized to keep
125
+
token-and-cost savings real on commodity hardware:
108
126
109
-
| Modality | Model | Size | Why this default |
110
-
|---|---|---|---|
111
-
| Image generation |`SD-Turbo`|~5 GB | Single-step generation, runs on CPU and AMD iGPU/dGPU |
112
-
| Text-to-speech |`kokoro-v1`|~0.3 GB | Only TTS model Lemonade currently supports; CPU-only, low latency |
113
-
| Speech-to-text |`Whisper-Tiny`|~0.1 GB | Smallest Whisper; fast on CPU. Upgrade to `Whisper-Large-v3-Turbo` if accuracy matters more than latency. |
127
+
|Group |Modality | Model | Size | Why this default |
128
+
|---|---|---|---|---|
129
+
|`image`|Image generation |`SD-Turbo`|~5 GB | Single-step generation, runs on CPU and AMD iGPU/dGPU |
130
+
|`speech`|Text-to-speech |`kokoro-v1`|~0.3 GB | Only TTS model Lemonade currently supports; CPU-only, low latency |
131
+
|`speech`|Speech-to-text |`Whisper-Tiny`|~0.1 GB | Smallest Whisper; fast on CPU. Upgrade to `Whisper-Large-v3-Turbo` if accuracy matters more than latency. |
114
132
115
133
```bash
134
+
# image group
116
135
lemonade pull SD-Turbo
136
+
# speech group
117
137
lemonade pull kokoro-v1
118
138
lemonade pull Whisper-Tiny
119
139
```
120
140
121
141
To choose a different model while installing the rule, pass it to the setup
122
-
script. For example, to make future image requests use SDXL:
142
+
script alongside the group. For example, to make future image requests use SDXL:
0 commit comments