Thanks for your interest in improving OmniVoice Studio! This guide covers everything you need to get started.
| 💬 Chat | Discord |
| 🐛 Bugs | GitHub Issues |
| 🏷️ Good First Issues | Filtered list |
| 📋 Roadmap | README → Roadmap |
- Git
- Bun (frontend package manager)
- uv (Python environment manager)
- ffmpeg (audio/video processing)
- Python 3.10+ (managed automatically by
uv)
git clone https://github.com/debpalash/OmniVoice-Studio.git
cd OmniVoice-Studio
bun install
bun run devThis starts both services:
| Service | URL | What it does |
|---|---|---|
| Backend | localhost:3900 |
FastAPI server — TTS, ASR, diarization, dubbing pipeline |
| Frontend | localhost:3901 |
React + Vite UI |
bun run desktopRequires Rust and platform-specific Tauri dependencies — see the Tauri prerequisites.
OmniVoice-Studio/
├── backend/ # Python FastAPI server
│ ├── api/ # Route handlers
│ ├── core/ # Config, prefs, constants
│ └── services/ # TTS engines, ASR, dubbing, audio DSP
│ └── tts_backend.py # ← Multi-engine TTS registry
├── frontend/ # React + Vite
│ ├── src/
│ │ ├── components/ # UI components
│ │ ├── hooks/ # Custom React hooks
│ │ ├── stores/ # Zustand state slices
│ │ └── utils/ # Shared utilities
│ └── src-tauri/ # Rust/Tauri desktop shell
├── deploy/ # Docker, CI configs
├── docs/ # Screenshots, MCP config
└── scripts/ # Build & release scripts
Open an issue with:
- What happened vs what you expected
- Steps to reproduce
- OS, GPU, and Python version (find in Settings → Logs)
- Error logs (Settings → Logs → copy relevant lines)
- Fork the repo and create a branch from
main - Keep PRs focused — one feature or fix per PR
- Run tests before pushing:
# Backend tests uv run pytest backend/ -x -q # Frontend build check cd frontend && npx vite build --mode development
- Write a clear PR title — it becomes the squash-merge commit message
- Don't include local machine stats, file paths, or private system info in PR descriptions
OmniVoice's TTS backend is a plugin registry. Adding a new engine takes ~50 lines:
- Open
backend/services/tts_backend.py - Create a class extending
TTSBackend:
class MyEngineBackend(TTSBackend):
id = "my-engine"
display_name = "My Engine (description)"
@classmethod
def is_available(cls) -> tuple[bool, str]:
try:
import my_engine # noqa: F401
return True, "ready"
except ImportError:
return False, "my_engine not installed. pip install my-engine"
@property
def sample_rate(self) -> int:
return 24000
@property
def supported_languages(self) -> list[str]:
return ["en", "zh"]
def generate(self, text: str, **kw) -> torch.Tensor:
# ... call your engine, return [1, num_samples] tensor- Register it in
_REGISTRYat the bottom of the file - That's it — it auto-appears in Settings → TTS Engine
- Formatter: We don't enforce one globally — match the style of the file you're editing
- Logging: Use
logger.warning()/logger.error(), never bareprint() - Exceptions: Avoid bare
except: pass— catch specific exceptions - Type hints: Use them for public API functions and class methods
- Components: Functional components with hooks
- State: Zustand stores in
src/stores/, organized by slice - CSS: Vanilla CSS in component-level files — no Tailwind
- Naming:
PascalCasefor components,camelCasefor hooks and utils
- Format:
cargo fmtbefore committing - Modules: One concern per file (
bootstrap.rs,tools.rs,config.rs,commands.rs)
Write clear, concise messages. The PR title becomes the squash-merge commit.
good: fix: prevent CUDA OOM during concurrent transcription + TTS
good: feat: add CosyVoice 3 TTS backend adapter
good: docs: add platform compatibility matrix to README
bad: fixed stuff
bad: update
bad: WIP
# Run all backend tests
uv run pytest backend/ -x -q
# Run a specific test file
uv run pytest backend/tests/test_api.py -x -q
# Frontend build validation (no test suite yet)
cd frontend && npx vite build --mode development
# Tauri shell check (requires Rust)
cd frontend/src-tauri && cargo check- Stuck on setup? Ask in Discord #help
- Not sure where to start? Check good first issues
- Want to discuss a big change? Open a discussion or Discord thread before coding
Thank you for contributing! 🎙️