Light-Heart-Labs
diff --git a/‎dream-server/extensions/services/comfyui/README.md‎
Lines changed: 140 additions & 0 deletions b/‎dream-server/extensions/services/comfyui/README.md‎
Lines changed: 140 additions & 0 deletions
diff --git a/‎dream-server/extensions/services/dashboard-api/README.md‎
Lines changed: 190 additions & 0 deletions b/‎dream-server/extensions/services/dashboard-api/README.md‎
Lines changed: 190 additions & 0 deletions
@@ -0,0 +1,140 @@
+# ComfyUI
+
+Node-based image generation UI and backend for Dream Server
+
+## Overview
+
+ComfyUI provides a powerful, node-based interface for running Stable Diffusion and FLUX.1 image generation models locally. It exposes both a visual workflow editor in the browser and a REST API, enabling programmatic image generation from other services. ComfyUI requires a GPU (NVIDIA or AMD) and is not available on CPU-only systems.
+
+## Features
+
+- **Node-based workflow editor**: Build and share custom generation pipelines visually
+- **FLUX.1 support**: Configured for FLUX.1 image generation out of the box
+- **Multiple model types**: Supports checkpoints, LoRAs, VAEs, text encoders, and diffusion models
+- **Persistent model storage**: Models stored in `./data/comfyui/models` and survive container rebuilds
+- **Workflow templates**: Pre-loaded workflow JSON files from `./data/comfyui/workflows`
+- **REST API**: Programmatic image generation via HTTP
+- **NVIDIA and AMD GPU support**: Separate optimized images for each GPU vendor
+
+## GPU Requirements
+
+ComfyUI is GPU-only. The service definition is split by GPU vendor:
+
+| Backend | Compose file | Notes |
+|---------|-------------|-------|
+| NVIDIA (CUDA) | `compose.nvidia.yaml` | Requires NVIDIA Container Toolkit |
+| AMD (ROCm) | `compose.amd.yaml` | Targets gfx1151 (RX 9000 series); uses ROCm with flash attention |
+
+> **Apple Silicon:** ComfyUI is not currently configured for Apple Silicon (macOS ARM). Use the native ComfyUI application instead.
+
+## Configuration
+
+Environment variables (set in `.env`):
+
+| Variable | Default | Description |
+|----------|---------|-------------|
+| `COMFYUI_PORT` | 8188 | External port for the ComfyUI web UI and API |
+
+## Volume Mounts
+
+### NVIDIA
+
+| Host Path | Container Path | Purpose |
+|-----------|---------------|---------|
+| `./data/comfyui/models` | `/models` | AI model files (checkpoints, LoRAs, VAEs, etc.) |
+| `./data/comfyui/output` | `/output` | Generated images output directory |
+| `./data/comfyui/input` | `/input` | Input images for img2img and inpainting |
+| `./data/comfyui/workflows` | `/workflows` | Workflow JSON templates (read-only) |
+
+### AMD
+
+| Host Path | Container Path | Purpose |
+|-----------|---------------|---------|
+| `./data/comfyui/ComfyUI` | `/opt/ComfyUI` | Full ComfyUI installation (models, outputs, custom nodes) |
+
+> **Note:** The AMD image uses a single volume containing the entire ComfyUI directory. Models go inside `./data/comfyui/ComfyUI/models/` rather than a separate `./data/comfyui/models/` mount.
+
+### Model Subdirectories (NVIDIA)
+
+Place model files in the appropriate subdirectory under `./data/comfyui/models/`:
+
+| Subdirectory | Model type |
+|-------------|------------|
+| `checkpoints/` | Full Stable Diffusion / FLUX checkpoints |
+| `diffusion_models/` | Standalone diffusion model weights |
+| `text_encoders/` | CLIP and T5 text encoders |
+| `vae/` | Variational Autoencoders |
+| `loras/` | LoRA fine-tuned weights |
+| `latent_upscale_models/` | Latent upscale models |
+
+## Architecture
+
+```
+┌──────────┐  HTTP :8188    ┌──────────────┐
+│ Browser  │───────────────▶│   ComfyUI    │
+│ (Node UI)│◀───────────────│  (PyTorch)   │
+└──────────┘                └──────┬───────┘
+                                   │
+                    ┌──────────────┼──────────────┐
+                    ▼              ▼              ▼
+              /models/       /output/        /input/
+           (checkpoints,   (generated      (source
+            LoRAs, VAEs)    images)         images)
+```
+
+## API Endpoints
+
+| Endpoint | Method | Description |
+|----------|--------|-------------|
+| `GET /` | GET | Web UI / health check |
+| `POST /prompt` | POST | Queue a generation workflow |
+| `GET /queue` | GET | View current generation queue |
+| `GET /history` | GET | View completed generation history |
+| `GET /view` | GET | Retrieve a generated image by filename |
+| `GET /system_stats` | GET | GPU memory and system resource stats |
+
+## Files
+
+- `manifest.yaml` — Service metadata (port, health endpoint, GPU backends, features)
+- `compose.yaml` — Base stub (actual definition is in GPU overlays)
+- `compose.nvidia.yaml` — NVIDIA CUDA service definition
+- `compose.amd.yaml` — AMD ROCm service definition (gfx1151)
+- `startup.sh` — Entrypoint script: sets up model symlinks and launches ComfyUI server
+- `Dockerfile` — Container build definition (used by NVIDIA overlay)
+
+## Troubleshooting
+
+**ComfyUI not starting (long start period):**
+
+The container has a 120-second start period to allow model loading. Wait for it to elapse, then check:
+```bash
+docker compose ps dream-comfyui
+docker compose logs dream-comfyui --follow
+```
+
+**GPU not detected:**
+
+For NVIDIA:
+```bash
+# Verify NVIDIA Container Toolkit is installed
+docker run --rm --gpus all nvidia/cuda:12.0-base nvidia-smi
+```
+
+For AMD:
+```bash
+# Verify GPU device access
+ls /dev/dri /dev/kfd
+```
+
+**Models not appearing in the UI:**
+- Ensure model files are placed in the correct subdirectory under `./data/comfyui/models/`
+- Restart ComfyUI or click **Refresh** in the model loader node
+
+**Out of VRAM errors:**
+- Use smaller or quantized model variants
+- Close other GPU-intensive services before running ComfyUI
+- Check VRAM usage: `nvidia-smi` (NVIDIA) or `rocm-smi` (AMD)
+
+**Generated images not saving:**
+- Verify `./data/comfyui/output` exists and is writable
+- Check container logs for permission errors
@@ -0,0 +1,190 @@
+# dashboard-api
+
+FastAPI backend providing system status, metrics, and management for Dream Server
+
+## Overview
+
+The Dashboard API is a Python FastAPI service that powers the Dream Server Dashboard UI. It exposes endpoints for GPU metrics, service health monitoring, LLM inference stats, workflow management, agent monitoring, setup wizard, version checking, and Privacy Shield control.
+
+It runs at `http://localhost:3002` and is the single backend used by the React dashboard frontend.
+
+## Features
+
+- **GPU monitoring**: Real-time VRAM usage, temperature, utilization, and power draw (NVIDIA + AMD)
+- **Service health**: Health checks for all Dream Server services via Docker network
+- **LLM metrics**: Tokens/second, lifetime tokens, loaded model, context size
+- **System metrics**: CPU usage, RAM usage, uptime, disk space
+- **Workflow management**: n8n workflow catalog — install, enable, disable, track executions
+- **Feature discovery**: Hardware-aware feature recommendations with VRAM tier detection
+- **Setup wizard**: First-run setup, persona selection, diagnostic tests
+- **Agent monitoring**: Session counts, throughput, cluster status, per-model token usage
+- **Privacy Shield control**: Enable/disable container, fetch PII scrubbing statistics
+- **Version checking**: GitHub releases integration for update notifications
+- **Storage reporting**: Breakdown of disk usage by models, vector DB, and total data
+
+## Configuration
+
+Environment variables (set in `.env`):
+
+| Variable | Default | Description |
+|----------|---------|-------------|
+| `DASHBOARD_API_PORT` | `3002` | External + internal port |
+| `DASHBOARD_API_KEY` | *(auto-generated)* | API key for all protected endpoints. If unset, a random key is generated and written to `/data/dashboard-api-key.txt` |
+| `GPU_BACKEND` | `nvidia` | GPU backend: `nvidia` or `amd` |
+| `OLLAMA_URL` | `http://llama-server:8080` | LLM backend URL |
+| `LLM_MODEL` | `qwen3:30b-a3b` | Active model name shown in dashboard |
+| `KOKORO_URL` | `http://tts:8880` | Kokoro TTS URL |
+| `N8N_URL` | `http://n8n:5678` | n8n workflow URL |
+| `OPENCLAW_TOKEN` | *(empty)* | OpenClaw agent auth token |
+
+## API Endpoints
+
+### Core
+
+| Method | Path | Auth | Description |
+|--------|------|------|-------------|
+| `GET` | `/health` | No | Health check |
+| `GET` | `/gpu` | Yes | GPU metrics (VRAM, temp, utilization) |
+| `GET` | `/services` | Yes | All service health statuses |
+| `GET` | `/disk` | Yes | Disk usage |
+| `GET` | `/model` | Yes | Current model info |
+| `GET` | `/bootstrap` | Yes | Model bootstrap/download status |
+| `GET` | `/status` | Yes | Full system status (all above combined) |
+| `GET` | `/api/status` | Yes | Dashboard-formatted status with inference metrics |
+
+### Preflight
+
+| Method | Path | Auth | Description |
+|--------|------|------|-------------|
+| `GET` | `/api/preflight/docker` | Yes | Check Docker availability |
+| `GET` | `/api/preflight/gpu` | Yes | Check GPU availability |
+| `GET` | `/api/preflight/required-ports` | No | List service ports |
+| `POST` | `/api/preflight/ports` | Yes | Check port availability conflicts |
+| `GET` | `/api/preflight/disk` | Yes | Check available disk space |
+
+### Settings & Storage
+
+| Method | Path | Auth | Description |
+|--------|------|------|-------------|
+| `GET` | `/api/service-tokens` | Yes | Service auth tokens (e.g. OpenClaw) |
+| `GET` | `/api/external-links` | Yes | Sidebar links from service manifests |
+| `GET` | `/api/storage` | Yes | Storage breakdown (models, vector DB, total) |
+
+### Workflows (n8n integration)
+
+| Method | Path | Auth | Description |
+|--------|------|------|-------------|
+| `GET` | `/api/workflows` | Yes | Workflow catalog with install status |
+| `POST` | `/api/workflows/{id}/enable` | Yes | Import and activate a workflow in n8n |
+| `DELETE` | `/api/workflows/{id}` | Yes | Remove a workflow from n8n |
+| `GET` | `/api/workflows/{id}/executions` | Yes | Recent execution history |
+
+### Features
+
+| Method | Path | Auth | Description |
+|--------|------|------|-------------|
+| `GET` | `/api/features` | Yes | Feature status with hardware recommendations |
+| `GET` | `/api/features/{id}/enable` | Yes | Enable instructions for a feature |
+
+### Setup Wizard
+
+| Method | Path | Auth | Description |
+|--------|------|------|-------------|
+| `GET` | `/api/setup/status` | Yes | First-run check and current step |
+| `GET` | `/api/setup/personas` | Yes | List available personas |
+| `GET` | `/api/setup/persona/{id}` | Yes | Get persona details |
+| `POST` | `/api/setup/persona` | Yes | Select a persona |
+| `POST` | `/api/setup/complete` | Yes | Mark setup complete |
+| `POST` | `/api/setup/test` | Yes | Run diagnostic tests (streaming) |
+| `POST` | `/api/chat` | Yes | Quick chat for setup wizard |
+
+### Agent Monitoring
+
+| Method | Path | Auth | Description |
+|--------|------|------|-------------|
+| `GET` | `/api/agents/metrics` | Yes | Full agent metrics (sessions, tokens, cost) |
+| `GET` | `/api/agents/metrics.html` | Yes | Agent metrics as HTML fragment (htmx) |
+| `GET` | `/api/agents/cluster` | Yes | Cluster health and GPU node status |
+| `GET` | `/api/agents/throughput` | Yes | Throughput stats (tokens/sec) |
+
+### Privacy Shield
+
+| Method | Path | Auth | Description |
+|--------|------|------|-------------|
+| `GET` | `/api/privacy-shield/status` | Yes | Privacy Shield container status |
+| `POST` | `/api/privacy-shield/toggle` | Yes | Start or stop Privacy Shield |
+| `GET` | `/api/privacy-shield/stats` | Yes | PII scrubbing statistics |
+
+### Updates
+
+| Method | Path | Auth | Description |
+|--------|------|------|-------------|
+| `GET` | `/api/version` | Yes | Current version + GitHub update check |
+| `GET` | `/api/releases/manifest` | No | Recent release history from GitHub |
+| `POST` | `/api/update` | Yes | Trigger update actions (`check`, `backup`, `update`) |
+
+## Authentication
+
+When `DASHBOARD_API_KEY` is set in `.env`, all authenticated endpoints require the key:
+
+```bash
+curl http://localhost:3002/api/status \
+  -H "Authorization: Bearer YOUR_KEY"
+```
+
+When `DASHBOARD_API_KEY` is empty (default), all endpoints are accessible without authentication.
+
+## Architecture
+
+```
+Dashboard UI (:3001)
+       │
+       ▼
+Dashboard API (:3002)
+  ├── gpu.py ──────────────── nvidia-smi / sysfs AMD
+  ├── helpers.py ──────────── Docker-network health checks
+  ├── agent_monitor.py ─────── Background metrics collection
+  └── routers/
+       ├── workflows.py ────── n8n API integration
+       ├── features.py ─────── Hardware-aware feature discovery
+       ├── setup.py ─────────── Setup wizard + persona system
+       ├── updates.py ──────── GitHub releases + dream-update.sh
+       ├── agents.py ───────── Agent session + throughput metrics
+       └── privacy.py ──────── Privacy Shield container control
+```
+
+## Files
+
+- `main.py` — FastAPI application, core endpoints, startup
+- `config.py` — Shared configuration and manifest loading
+- `models.py` — Pydantic response schemas
+- `security.py` — API key authentication
+- `gpu.py` — GPU detection for NVIDIA and AMD
+- `helpers.py` — Service health checks, LLM metrics, system metrics
+- `agent_monitor.py` — Background agent metrics collection
+- `routers/` — Endpoint modules (workflows, features, setup, updates, agents, privacy)
+- `Dockerfile` — Container definition
+- `requirements.txt` — Python dependencies
+
+## Troubleshooting
+
+**API not responding:**
+```bash
+docker compose ps dashboard-api
+docker compose logs dashboard-api
+```
+
+**GPU metrics missing:**
+- NVIDIA: confirm `nvidia-smi` works on the host
+- AMD: the AMD overlay mounts `/sys/class/drm` — confirm `GPU_BACKEND=amd` in `.env`
+
+**Workflow operations failing:**
+- Verify n8n is running: `curl http://localhost:5678/healthz`
+- Check `N8N_URL` environment variable
+
+**Storage endpoint returning zeros:**
+- The container mounts `./data` at `/data` — verify the path exists
+
+## License
+
+Part of Dream Server — Local AI Infrastructure