Montage AI

AI-assisted rough-cutting for video creators, with CLI and web UI workflows.

This README is the single source of truth for onboarding and local setup.

Quick Start (Docker)

# Build image (first run)
docker compose build

# Start Web UI
docker compose up

Open http://localhost:8080.

Port conflict? If port 8080 is already in use, override with: WEB_PORT=8081 docker compose up (then open http://localhost:8081).

CLI run (uses data/input + data/music):

docker compose run --rm montage-ai /app/montage-ai.sh run

Output: data/output/montage_<timestamp>.mp4 Duration: ~2-5 min for 3x 30s clips (system-dependent) Progress: Logs show beat detection -> scene analysis -> clip assembly -> rendering

Preview mode (faster):

QUALITY_PROFILE=preview docker compose run --rm montage-ai /app/montage-ai.sh run
# → 360p output, ~60% faster

See also: Installation Test Guide to verify your setup, Configuration for all environment variables.

System Requirements

Docker + Docker Compose v2
16 GB RAM recommended (8 GB minimum for preview)
4+ CPU cores recommended (2 cores minimum)
10+ GB free disk space

Quick checks:

docker --version
docker compose version
free -h | grep Mem
nproc
df -h /

Windows (PowerShell):

docker --version
docker compose version
RAM: (Get-CimInstance Win32_ComputerSystem).TotalPhysicalMemory / 1GB
CPU: (Get-CimInstance Win32_ComputerSystem).NumberOfLogicalProcessors

Resource limits: Docker defaults to 12 GB memory / 4 CPUs (optimized for 16 GB systems). Override for your system:

# 32 GB system (recommended for best performance):
DOCKER_MEMORY_LIMIT=24g DOCKER_CPU_LIMIT=8 docker compose up

# 8 GB system (minimum, use preview mode):
DOCKER_MEMORY_LIMIT=6g DOCKER_CPU_LIMIT=2 QUALITY_PROFILE=preview docker compose up

If Docker fails to start with "OCI runtime error", reduce the memory limit below your system RAM.

First-Time Setup

git clone https://github.com/mfahsold/montage-ai.git
cd montage-ai

# Run setup script (required before first build)
./scripts/setup.sh

# Generate test media (optional; or provide your own)
./scripts/ops/create-test-video.sh

What setup.sh does:

Check	Action
Data directories	Creates `data/input`, `data/music`, `data/output`, `data/assets`, `data/luts`
Permissions (Linux)	Fixes ownership if directories are owned by root
Docker	Verifies Docker and Compose v2 are installed
Disk space	Warns if < 30 GB free, fails if < 5 GB
RAM	Reports available system memory
Architecture	Detects ARM64 and notes MediaPipe limitation

The script is idempotent — safe to re-run at any time.

Add your own media (alternative):

# Copy your videos to data/input/
cp ~/Videos/*.mp4 data/input/

# Copy music to data/music/
cp ~/Music/track.mp3 data/music/

Permission note:

Linux: If you see "Permission denied" errors, the setup script will help fix data/ directory ownership. If needed, run: sudo chown -R $USER:$USER data/
macOS / Windows (Docker Desktop): Permissions are handled automatically by Docker Desktop. No manual fix needed.

Run:

docker compose up
# Then open http://localhost:8080

ARM64 (Snapdragon, Apple Silicon)

ARM64 is supported via multi-arch Docker images. Use the same commands.

ARM64 limitation: MediaPipe (face detection for auto-reframe) is not available on ARM64. Auto-reframe uses center-crop fallback, which works well for most content. The [WARN] MediaPipe not installed log message is safe to ignore. See Optional Dependencies for details.

Verify architecture:

uname -m

Recommended Docker resources (examples):

Snapdragon 12 GB: memory 8g, cpus 8
Apple Silicon 16 GB: memory 12g, cpus 8

If you want an automated check and a preview render test:

./scripts/quick-setup-arm.sh
./scripts/validate-onboarding.sh

Troubleshooting

Port in use: WEB_PORT=8081 docker compose up
Low RAM (8 GB): DOCKER_MEMORY_LIMIT=6g QUALITY_PROFILE=preview docker compose up
Docker OCI error: lower memory limit: DOCKER_MEMORY_LIMIT=6g docker compose up
High-performance system: DOCKER_MEMORY_LIMIT=24g DOCKER_CPU_LIMIT=8 docker compose up
aarch64 local venv: mediapipe is unavailable on Python >= 3.13; use Docker or skip [ai]

Documentation Navigator

Where should I start?

New user?
├── Just want to try it → docs/quickstart.md (5 min)
├── Full setup guide    → docs/getting-started.md
└── ARM64 device?       → docs/getting-started-arm.md

Already running?
├── Configure settings  → docs/configuration.md
├── See all features    → docs/features.md
├── High-res workflow   → docs/high-res-workflow.md
├── Fix an error        → docs/troubleshooting.md
└── Tune performance    → docs/performance-tuning.md

Deploying to K8s?
├── Local K3s dev       → docs/k3s-local-setup.md
├── Cluster setup       → docs/cluster-deploy.md
├── Full K8s reference  → deploy/k3s/README.md
└── Operations          → docs/operations/README.md

Developing?
├── Architecture        → docs/architecture.md
├── Contributing        → CONTRIBUTING.md
└── Full docs index     → docs/README.md

LLM Backend: Optional

Montage AI works without any LLM backend. Style templates, beat-synced editing, and all video effects work out of the box. LLM adds natural language creative direction but is not required.

Capability	No LLM	With LLM (Ollama/Gemini/OpenAI)
Style templates (7 built-in)	Yes	Yes
Beat-synced editing	Yes	Yes
All video effects	Yes	Yes
Natural language prompts	—	Yes
Creative Loop (iterative refinement)	—	Yes
Custom editing instructions	—	Yes

If you see No LLM backend available in the logs, this is informational — not an error. To enable LLM features, see Configuration: AI/LLM Settings.

How It Works

Input Videos + Music ─> Beat Detection ─> Scene Analysis ─> Clip Assembly ─> FFmpeg Render ─> Final Video
                              │                                    ▲
                              └── Creative Director (LLM) ─────────┘
                                  Style Template (JSON)

See Architecture for the full component diagram.

Key Features

Beat-synced editing with 7 built-in style templates
Smart reframing (16:9 to 9:16) for TikTok/Shorts
AI denoising, stabilization, upscaling, color grading, film grain
Dialogue ducking, audio normalization, voice isolation
Caption burning (TikTok, Minimal, Bold, Cinematic, Karaoke)
Timeline export (OTIO, EDL, CSV) for DaVinci Resolve / Premiere Pro
Story engine with narrative arc optimization
Cloud GPU acceleration via cgpu (optional)
Quality profiles: Preview (360p fast) / Standard / High / Master (4K)

See Features for the complete feature matrix and CLI examples.

Development

./scripts/ci.sh
make code-health

Name		Name	Last commit message	Last commit date
Latest commit History 523 Commits
.github		.github
bin		bin
config		config
data/luts		data/luts
deploy		deploy
docs		docs
go		go
issues		issues
private/docs		private/docs
scripts		scripts
src/montage_ai		src/montage_ai
tests		tests
.dockerignore		.dockerignore
.drone.yml		.drone.yml
.env.example		.env.example
.gitignore		.gitignore
.markdownlintignore		.markdownlintignore
.pre-commit-config.yaml		.pre-commit-config.yaml
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CINEMATIC_STABILIZED_EPIC_COMPLETION_REPORT.md		CINEMATIC_STABILIZED_EPIC_COMPLETION_REPORT.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
DEPLOYMENT_NOTES_2026_02_10.md		DEPLOYMENT_NOTES_2026_02_10.md
DEPLOYMENT_READINESS_SUMMARY.md		DEPLOYMENT_READINESS_SUMMARY.md
Dockerfile		Dockerfile
Dockerfile.qsv		Dockerfile.qsv
EXTREME_MODE_CHEATSHEET.txt		EXTREME_MODE_CHEATSHEET.txt
EXTREME_STABILIZATION_GUIDE.md		EXTREME_STABILIZATION_GUIDE.md
IMPLEMENTATION_SUMMARY_EXTREME_STABILIZATION.md		IMPLEMENTATION_SUMMARY_EXTREME_STABILIZATION.md
INSTALLATION_AND_DEPLOYMENT_ANALYSIS.md		INSTALLATION_AND_DEPLOYMENT_ANALYSIS.md
INSTALLATION_VERIFICATION_REPORT.md		INSTALLATION_VERIFICATION_REPORT.md
Jenkinsfile		Jenkinsfile
K3S_CLUSTER_TEST_RESULTS.md		K3S_CLUSTER_TEST_RESULTS.md
K3S_DEPLOYMENT_SESSION_NOTES.md		K3S_DEPLOYMENT_SESSION_NOTES.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
NOTICE		NOTICE
QUALITY_ENHANCEMENTS_TEST_REPORT_2026_02_10.md		QUALITY_ENHANCEMENTS_TEST_REPORT_2026_02_10.md
QUALITY_SESSION_COMPLETE.md		QUALITY_SESSION_COMPLETE.md
QUICKSTART_COLOR_GRADING.md		QUICKSTART_COLOR_GRADING.md
README.md		README.md
SECURITY.md		SECURITY.md
SESSION_SUMMARY_2026_02_10.md		SESSION_SUMMARY_2026_02_10.md
STABILITY_IMPROVEMENTS.md		STABILITY_IMPROVEMENTS.md
THIRD_PARTY_LICENSES.md		THIRD_PARTY_LICENSES.md
analyze_footage_creative.py		analyze_footage_creative.py
buildkitd.toml		buildkitd.toml
docker-compose.web.yml		docker-compose.web.yml
docker-compose.yml		docker-compose.yml
evaluate_montage_quality.py		evaluate_montage_quality.py
extreme-stabilization-quickstart.sh		extreme-stabilization-quickstart.sh
manual-test.yaml		manual-test.yaml
montage-ai.sh		montage-ai.sh
montage-status.sh		montage-status.sh
pro_stabilization_quickref.sh		pro_stabilization_quickref.sh
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
render_creative_cut.py		render_creative_cut.py
requirements.txt		requirements.txt
service.build.yaml		service.build.yaml
test_color_grading_render.py		test_color_grading_render.py
test_pro_stabilization.py		test_pro_stabilization.py
test_quality_enhancements.py		test_quality_enhancements.py
test_quality_simple.py		test_quality_simple.py
uv.lock		uv.lock
verify-deployment.sh		verify-deployment.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Montage AI

Quick Start (Docker)

System Requirements

First-Time Setup

ARM64 (Snapdragon, Apple Silicon)

Troubleshooting

Documentation Navigator

LLM Backend: Optional

How It Works

Key Features

Development

About

Uh oh!

Releases

Sponsor this project

Uh oh!

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Montage AI

Quick Start (Docker)

System Requirements

First-Time Setup

ARM64 (Snapdragon, Apple Silicon)

Troubleshooting

Documentation Navigator

LLM Backend: Optional

How It Works

Key Features

Development

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages