Setup Guide

This document explains how to set up the project for development, local rendering, and desktop packaging.

System Requirements

Windows, macOS, or Linux
Node.js 18 or newer
npm
Python 3.8 or newer for development voice generation
8 GB RAM minimum, 16 GB recommended for larger renders
Enough free disk space for cached assets, rendered segments, and output videos

Runtime Overview

The project now supports four main runtime entrypoints that share one application core:

Browser portal and HTTP API with npm run dev
CLI generation with npm run generate
MCP server with npm run mcp
Windows desktop packaging with Electron

Current executable entry files:

src/server.ts
src/cli.ts
src/mcp-server.ts
electron/electron-main.ts

For the architectural layout behind these entrypoints, see ARCHITECTURE.md.

Voice generation now follows this order when possible:

Edge-TTS
Windows offline speech fallback in packaged desktop mode
Google TTS fallback when available

Development Setup

1. Install Node.js

Install Node.js from nodejs.org.

Verify:

node -v
npm -v

2. Install project dependencies

npm install

3. Install Python voice dependencies

Windows:

py -m pip install -r requirements.txt

If py is unavailable:

python -m pip install -r requirements.txt

macOS or Linux:

python3 -m pip install -r requirements.txt

This installs the edge-tts runtime used during normal development.

4. FFmpeg

The project prefers bundled ffmpeg-static and ffprobe-static, so many machines do not need a global FFmpeg install.

A system FFmpeg install is still useful as a fallback.

Verify if you have a global install:

ffmpeg -version

5. Configure environment variables

Copy .env.example to .env.

Windows PowerShell:

Copy-Item .env.example .env

macOS or Linux:

cp .env.example .env

Set at least:

PEXELS_API_KEY=your_key_here
PIXABAY_API_KEY=
GEMINI_API_KEY=
PUBLIC_BASE_URL=
PORT=3001
VIDEO_ORIENTATION=portrait
VIDEO_VOICE=en-US-GuyNeural

PEXELS_API_KEY is the main required key for the standard browser workflow.

Running The Project

Local browser portal

npm run dev

Open:

http://localhost:3001/

CLI generation

Create or update input/input-scripts.json, then run:

npm run generate

MCP server

npm run mcp

Use this runtime when connecting Claude Desktop, Claude Code, or other MCP clients.

Remotion Studio

npm run remotion:studio

Verification Commands

Backend and shared TypeScript

npx tsc -p tsconfig.json --noEmit

Electron main process

cmd /c node_modules\.bin\tsc.cmd -p tsconfig.electron.json --noEmit

Desktop bundle source check

npm run electron:verify-bundle

Unpacked Windows release check

npm run electron:verify-release

Health endpoint

Start the portal and open:

http://localhost:3001/health

The health response helps verify:

voice engine readiness
FFmpeg availability
Python runtime availability
Node module availability

Desktop-Specific Notes

Setup wizard behavior

The Electron setup window is not just informational anymore.

Launch App explicitly starts the backend and opens the portal
Skip also launches the app instead of just closing the window
closing the setup window before launch exits the app cleanly

Voice engine behavior on Windows

Packaged desktop builds try:

bundled edge-tts.exe
bundled Python -m edge_tts
system edge-tts
Windows offline speech voices
Google TTS fallback if available

This makes fresh Windows installs much more resilient than before.

Render security default

Chromium web security is now enabled by default during Remotion render.

Only disable it if you have a specific compatibility issue:

set REMOTION_DISABLE_WEB_SECURITY=1

Use this only for debugging or controlled environments.

Troubleshooting

Edge-TTS works on your laptop but not on a fresh machine

That usually means your dev machine already had Python or edge-tts installed globally.

The current desktop app now reduces that problem by:

preferring bundled Edge-TTS
repairing the bundled runtime through the setup wizard
falling back to Windows offline speech on Windows

Voice engine still not available on Windows

Check:

the setup wizard
bundled runtime verification with npm run electron:verify-bundle
Windows speech voices in system settings
the /health endpoint

Packaged render behaves differently than dev

Run:

npm run electron:verify-release

That checks the unpacked desktop build for the expected bundled runtime files.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Setup Guide

System Requirements

Runtime Overview

Development Setup

1. Install Node.js

2. Install project dependencies

3. Install Python voice dependencies

4. FFmpeg

5. Configure environment variables

Running The Project

Local browser portal

CLI generation

MCP server

Remotion Studio

Verification Commands

Backend and shared TypeScript

Electron main process

Desktop bundle source check

Unpacked Windows release check

Health endpoint

Desktop-Specific Notes

Setup wizard behavior

Voice engine behavior on Windows

Render security default

Troubleshooting

Edge-TTS works on your laptop but not on a fresh machine

Voice engine still not available on Windows

Packaged render behaves differently than dev

Related Docs

FilesExpand file tree

SETUP.md

Latest commit

History

SETUP.md

File metadata and controls

Setup Guide

System Requirements

Runtime Overview

Development Setup

1. Install Node.js

2. Install project dependencies

3. Install Python voice dependencies

4. FFmpeg

5. Configure environment variables

Running The Project

Local browser portal

CLI generation

MCP server

Remotion Studio

Verification Commands

Backend and shared TypeScript

Electron main process

Desktop bundle source check

Unpacked Windows release check

Health endpoint

Desktop-Specific Notes

Setup wizard behavior

Voice engine behavior on Windows

Render security default

Troubleshooting

Edge-TTS works on your laptop but not on a fresh machine

Voice engine still not available on Windows

Packaged render behaves differently than dev

Related Docs