Lemonade HA — Local Voice Assist for Home Assistant

Transparency notice: This integration was entirely vibe coded with GitHub Copilot. Use it accordingly.

Run fully local voice assistants in Home Assistant using Lemonade as the AI backend.

Capability	Backend	Models
Speech-to-Text	Whisper via whisper.cpp or FastFlowLM	Whisper Tiny → Large-v3-Turbo
Conversation / LLM	LLama.cpp (Vulkan/ROCm/CPU), FastFlowLM (NPU), RyzenAI (NPU)	Qwen 3, Llama 3.2, Phi-4, Mistral 7B, and 100+ more
Text-to-Speech	Kokoro TTS	kokoro-v1 (multiple voices)

Prerequisites

Lemonade server installed and running on your machine. Download from https://github.com/lemonade-sdk/lemonade/releases.
If running Home Assistant OS or Supervised, Lemonade should run on the host (not inside a container). Set the host to homeassistant.local or the host's LAN IP.

Installation

Method A — HACS (recommended)

The integration can be installed and updated directly through HACS, with no add-on required.

In HACS, click ⋮ → Custom repositories, paste https://github.com/rwfsmith/Lemonade-HA, choose category Integration, and click Add.
Search for Lemonade HA and click Download.
Restart Home Assistant so HA loads the custom component.
Proceed to Add the integration below.

To update: HACS will notify you when a new version is available — click Update and restart HA.

Method B — Add-on (auto-installs the integration)

The add-on bundles the integration and copies it into your HA config directory on startup. Use this method if you prefer not to use HACS.

Settings → Add-ons → Add-on Store → ⋮ → Repositories, add:
```
https://github.com/rwfsmith/Lemonade-HA
```
Find Lemonade HA, click Install, then Start.
Restart Home Assistant once so HA loads the custom component.

Tip: If you later switch to HACS-managed updates, set copy_integration: false in the add-on Configuration tab. The add-on will no longer overwrite the component on restart, so HACS controls the integration version independently.

Add the integration

Settings → Devices & Services → Add Integration → search "Lemonade HA"

Enter the host and port of your Lemonade server (default localhost:8000). HA will verify the connection before proceeding.
Choose your STT, LLM, and TTS models.
Click Submit. The integration registers:
- Lemonade Whisper — Speech-to-Text engine
- Lemonade Kokoro — Text-to-Speech engine
- Lemonade LLM — Conversation agent

Create a Voice Assistant

Settings → Voice Assistants → Add Assistant

Field	Value
Conversation agent	Lemonade LLM
Speech-to-text	Lemonade Whisper
Text-to-speech	Lemonade Kokoro

Add-on Configuration Options

Speech-to-Text

Option	Default	Description
STT Model	`Whisper-Large-v3-Turbo`	Whisper model size. Larger = more accurate, slower.
STT Backend	`auto`	`whispercpp:cpu`, `whispercpp:npu`, `whispercpp:vulkan`, or `flm:npu`
STT Language	`en`	ISO 639-1 code or `auto` for language detection
STT Beam Size	`0`	Beam search width (`0` = auto)

Conversation / LLM

Option	Default	Description
LLM Backend	`llamacpp`	`llamacpp`, `ryzenai`, or `flm`
LLM Model	`Qwen3-4B-GGUF`	Model name for the selected backend
LLM Context Size	`4096`	Max context window in tokens
LLM Max Tokens	`256`	Max tokens per response
LLM System Prompt	(built-in)	Customise the assistant personality

Text-to-Speech

Option	Default	Description
TTS Model	`kokoro-v1`	Kokoro ONNX model
TTS Voice	`af_heart`	Voice ID (`af_` = American Female, `am_` = American Male, `bf_` = British Female, `bm_` = British Male)

Connection

Option	Default	Description
Lemonade Host	`localhost`	Hostname or IP of the Lemonade server
Lemonade Port	`8000`	HTTP port of the Lemonade server
Auto-start Lemonade	`false`	Try to launch `lemond` if not already running

Supported Hardware

Hardware	Recommended Backend
AMD Ryzen AI (NPU)	`ryzenai` (LLM), `flm:npu` (STT)
AMD GPU (Radeon)	`llamacpp` + `vulkan` sub-backend
CPU only	`llamacpp` + `cpu` sub-backend
Apple Silicon	`llamacpp` + `metal` sub-backend

Troubleshooting

Add-on starts but integration doesn't appear after HA restart → Check that the add-on started successfully and that /config/custom_components/lemonade_wyoming/ exists. Restart HA again if needed.

"Cannot connect" error when adding integration → Verify Lemonade is running (curl http://localhost:8000/api/v1/health). If HA OS, use the host LAN IP instead of localhost.

Models not downloading → Check the add-on log (Add-ons → Lemonade HA → Log). Model downloads can take several minutes.

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
custom_components/lemonade_ha		custom_components/lemonade_ha
rootfs/etc/s6-overlay/s6-rc.d		rootfs/etc/s6-overlay/s6-rc.d
translations		translations
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Dockerfile		Dockerfile
README.md		README.md
config.yaml		config.yaml
hacs.json		hacs.json
repository.yaml		repository.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Lemonade HA — Local Voice Assist for Home Assistant

Prerequisites

Installation

Method A — HACS (recommended)

Method B — Add-on (auto-installs the integration)

Add the integration

Create a Voice Assistant

Add-on Configuration Options

Speech-to-Text

Conversation / LLM

Text-to-Speech

Connection

Supported Hardware

Troubleshooting

Links

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Lemonade HA — Local Voice Assist for Home Assistant

Prerequisites

Installation

Method A — HACS (recommended)

Method B — Add-on (auto-installs the integration)

Add the integration

Create a Voice Assistant

Add-on Configuration Options

Speech-to-Text

Conversation / LLM

Text-to-Speech

Connection

Supported Hardware

Troubleshooting

Links

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages