✨ Ultimate TTS Studio SUP3R Edition ✨

💖 Support & Donations

If you enjoy using Ultimate TTS Studio and would like to support its ongoing development, your generosity is deeply appreciated.
Any amount — big or small — makes a difference!

💰 Donate via PayPal

Support this project securely via PayPal.
Click below to make a one-time or recurring donation:

👉 paypal.me/30second

🪙 Bitcoin Donations Welcome

Bitcoin Address: 1N942jHr6vVuR2KAe2JEf3nN59eR21tpKv

Your donations help cover:

🧠 Development time and continuous feature updates
💾 Storage upgrades and data handling
🧾 Subscription costs for dev tools and AI services
🚀 Hardware improvements (e.g., GPU upgrades)

Every contribution directly supports the creation of more innovative apps like this.
Thank you for helping keep the project alive and growing!

⚠️ Notice: Major App Update

📅 Sept 18, 2025

This update brings VibeVoice and IndexTTS2 as newly supported TTS engines, expanding the variety and flexibility of voice options available.

📅 Sept 13, 2025

This update brings KittenTTS

📅 July 29, 2025

This recent update brings Higgs-Audio TTS:

📅 June 23, 2025

This recent update brings a few UI improvements focused on clarity and usability:

🎛️ TTS UI Refactor

The TTS engine selector is now organized into a tabbed interface, making it easier to navigate and less overwhelming.
The audiobook feature has been moved into its own tab to reduce visual clutter and improve user experience.

📅 June 18, 2025

We’ve pushed another exciting update packed with new functionality and improvements!

🆕 New Additions & Improvements

🗣️ TTS Integration Expanded

F5-TTS has now been added as a fifth supported engine, and it works seamlessly across all modes.
Index-TTS has been added as a supported speech engine.
All TTS engines now work across all modes, including narration, conversation, and ambient.

💬 Kokoro Conversation Mode

Kokoro now fully supports conversation mode, offering a more dynamic and interactive experience.

✅ Recommended Setup

For the smoothest installation and full feature compatibility:

Use a Conda environment, or
Install via Pinokio for the easiest experience.

📅 June 10, 2025

We’re excited to announce a major update to the app!

🎧 New Feature: eBook to Audiobook

Bring your favorite eBooks to life with our brand-new custom voice audiobook feature. Instantly convert any eBook into a personalized listening experience—perfect for learning, multitasking, or relaxing on the go.

📅 June 7, 2025

This update brings key improvements to performance, model management, and the user interface. Here's what's new:

🔧 Model Management

Models are no longer auto-loaded into GPU memory at app launch.
You can now manually load and unload models, giving you more precise control over memory usage.

🎨 UI Enhancements

A refreshed interface is now live.
The app is now optimized for dark mode. It still works in light mode, but some visuals may not display as intended.

🐟 Fish Speech Fix

Fixed a bug where Fish Speech did not chunk text correctly, which could cause processing issues.

📥 Model Download Behavior

Chatterbox and Kokoro models will automatically download the first time you click "Load."
Fish Speech models must still be downloaded manually and are not included in the auto-download process.

🗣️ New Feature: Custom Kokoro Voices

Kokoro now supports custom .pt voice models!
Use the Custom Voice Upload section in the Kokoro interface to upload your own compatible voice files.

✨ Ultimate TTS Studio SUP3R Edition ✨

Ultimate TTS Studio is a powerful all-in-one text-to-speech studio that brings together ChatterboxTTS, Kokoro TTS, and Fish Speech under one interactive Gradio interface.

🎭 Reference Audio Cloning 🗣️ Pre-trained Multi-Language Voices 🐟 Natural TTS with Audio Effects 🎵 Real-time Voice Synthesis & Export

---.

🚀 Features

🎤 ChatterboxTTS: Custom voice cloning using short reference clips.
🗣️ Kokoro TTS: High-quality, multilingual pre-trained voices.
🐟 Fish Speech: Advanced TTS engine.
🎛️ Professional Audio Effects: Reverb, Echo, EQ, Pitch shift, Gain.

🚨🚨 WARNING / IMPORTANT NOTES 🚨🚨

⚠️ Tested Hardware: This project has only been tested on a Windows 11 machine with an RTX 4090 GPU. 💻 Performance or compatibility on other systems is not guaranteed.

🔊 Audio Caution: The Fish Speech feature may occasionally produce extremely loud or muffled audio. 🎧 Please lower your volume and avoid using headphones during initial tests.

🛠️ Installation

⚠️ Windows Users — Important Note on pynini If you encounter the following error when installing pynini: ERROR: Failed building wheel for pynini You can fix this by installing it via conda: Pynini and wetextprocessing is needed for index-tts to work at its best Espeak-ng is needed for Kokoro to work at its best.

# After activating your conda environment (e.g., conda activate index-tts)
conda install -c conda-forge pynini==2.1.6
pip install WeTextProcessing --no-deps

Option 1.

Install via Pinokio You can use the Pinokio script here for one-click setup: Pinokio App Installer

Option 1a: Install via Dione

You can also use Dione for an easy one-click installation experience:

Option 2

🔁 Auto-Installer Method (Recommended)

This is the fastest way to get started. It uses a built-in installer script for automatic setup and app launching.

🛠️ Before You Begin: Make sure you have Miniconda or Anaconda installed on your system. You can download Miniconda here: https://docs.conda.io/en/latest/miniconda.html

1. Clone the Repository

git clone https://github.com/SUP3RMASS1VE/Ultimate-TTS-Studio-SUP3R-Edition.git
cd Ultimate-TTS-Studio-SUP3R-Edition

2. Run the Installer

👉 Double-click RUN_INSTALLER in the project folder. This will automatically set up everything for you — dependencies, environment, etc.

3. Launch the App

👉 Double-click RUN_APP to open the app.

4. Update the App (When Needed)

👉 Double-click RUN_UPDATE to update the app to the latest version.

Option 3

🧠 Ultimate-TTS-Studio-SUP3R-Edition — Setup Guide (Conda)

Follow these steps to set up your environment for Ultimate TTS Studio SUP3R Edition using Conda and UV for fast dependency management.

🔹 1. Clone the Repository

git clone https://github.com/SUP3RMASS1VE/Ultimate-TTS-Studio-SUP3R-Edition.git
cd Ultimate-TTS-Studio-SUP3R-Edition

🔹 2. Create a Conda Environment

conda create -n ultimate-tts python=3.10 -y

🔹 3. Activate the Environment

conda activate ultimate-tts

🔹 4. (Optional) Install `uv` for Faster Installs

pip install uv

💡 Tip: uv dramatically speeds up installation. If you prefer, you can use regular pip install instead.

🔹 5. Install Dependencies

🧩 Step 1 — Core Requirements

uv pip install -r requirements.txt

⚙️ Step 2 — Specific Packages and CUDA Builds

uv pip install voxcpm openai-whisper --no-deps
uv pip install https://huggingface.co/lldacing/flash-attention-windows-wheel/resolve/main/flash_attn-2.7.4.post1%2Bcu128torch2.7.0cxx11abiFALSE-cp310-cp310-win_amd64.whl
uv pip install WeTextProcessing --no-deps
uv pip install triton-windows==3.3.1.post19
uv pip install torch==2.7.0 torchvision==0.22.0 torchaudio==2.7.0 --index-url https://download.pytorch.org/whl/cu128

✅ Done!

Your environment is now ready to run Ultimate TTS Studio SUP3R Edition with CUDA 12.8 support. Launch the app and start generating high-quality speech!

💡 If you encounter CUDA or package conflicts, ensure your GPU drivers are updated and that Conda’s python=3.10 matches the wheel compatibility.

💡 If you're not using uv, you can just use pip install in its place.

🧠 First-Time Setup Tips

📥 Download Fish Speech Model (one-time)

To use Fish Speech, you must download the model checkpoint from Hugging Face. This requires a Hugging Face account and access token.

🔐 Step-by-Step:

Create an account (if needed): https://huggingface.co/join
Get your access token: Visit https://huggingface.co/settings/tokens and create a read token.
Log in via CLI:
```
huggingface-cli login
```
Paste your token when prompted.
(Optional) Accept the model license: Visit https://huggingface.co/fishaudio/openaudio-s1-mini and click "Access repository" if prompted.

Download the model:

huggingface-cli download fishaudio/openaudio-s1-mini --local-dir checkpoints/openaudio-s1-mini

▶️ Run the Studio

python launch.py

This will launch a local Gradio interface at: 📍 http://127.0.0.1:7860

💡 Notes

All engines are optional — the app will gracefully disable missing engines.
ChatterboxTTS and Fish Speech support reference audio input.
Audio effects are applied post-synthesis for professional-quality output.
Custom Kokoro voices can be added to custom_voices/ as .pt files.

📜 License

🙏 Acknowledgments

This project proudly integrates and builds upon the amazing work of:

Fish Speech by fishaudio – Natural and expressive TTS engine. 📜 License: MIT License
Kokoro TTS by hexgrad – High-quality multilingual voice synthesis. 📜 License: Apache 2.0 License
ChatterboxTTS by Resemble AI – Custom voice cloning from short reference clips. 📜 License: Apache 2.0 License
F5-TTS by SWivid – Efficient and lightweight TTS model focused on real-time synthesis. 📜 License: MIT License
Index TTS – Modular and scalable text-to-speech system with advanced voice capabilities. 📜 License: Apache 2.0 License

We deeply thank the authors and contributors to these projects for making this work possible.

Name		Name	Last commit message	Last commit date
Latest commit History 91 Commits
chatterbox/src/chatterbox		chatterbox/src/chatterbox
fish_speech		fish_speech
higgs_audio		higgs_audio
indextts		indextts
indextts2		indextts2
sample		sample
tools		tools
vibevoice		vibevoice
.gitignore		.gitignore
.project-root		.project-root
README.md		README.md
RUN_APP.bat		RUN_APP.bat
RUN_INSTALLER.bat		RUN_INSTALLER.bat
RUN_UPDATER.bat		RUN_UPDATER.bat
ebook_converter.py		ebook_converter.py
f5_tts_handler.py		f5_tts_handler.py
ffmpeg_env_config.py		ffmpeg_env_config.py
higgs_audio_handler.py		higgs_audio_handler.py
indextts2_handler.py		indextts2_handler.py
install_direct.bat		install_direct.bat
kitten_tts_handler.py		kitten_tts_handler.py
launch.py		launch.py
requirements.txt		requirements.txt
update.bat		update.bat
vibevoice_handler.py		vibevoice_handler.py
voxcpm_handler.py		voxcpm_handler.py

SUP3RMASS1VE/Ultimate-TTS-Studio-SUP3R-Edition

Folders and files

Latest commit

History

Repository files navigation

💖 Support & Donations

💰 Donate via PayPal

🪙 Bitcoin Donations Welcome

⚠️ Notice: Major App Update

📅 Sept 18, 2025

📅 Sept 13, 2025

📅 July 29, 2025

📅 June 23, 2025

🎛️ TTS UI Refactor

📅 June 18, 2025

🆕 New Additions & Improvements

🗣️ TTS Integration Expanded

💬 Kokoro Conversation Mode

✅ Recommended Setup

📅 June 10, 2025

🎧 New Feature: eBook to Audiobook

📅 June 7, 2025

🔧 Model Management

🎨 UI Enhancements

🐟 Fish Speech Fix

📥 Model Download Behavior

🗣️ New Feature: Custom Kokoro Voices

✨ Ultimate TTS Studio SUP3R Edition ✨

🚀 Features

🚨🚨 WARNING / IMPORTANT NOTES 🚨🚨

🛠️ Installation

Option 1.

Option 1a: Install via Dione

Option 2

🔁 Auto-Installer Method (Recommended)

1. Clone the Repository

2. Run the Installer

3. Launch the App

4. Update the App (When Needed)

Option 3

🧠 Ultimate-TTS-Studio-SUP3R-Edition — Setup Guide (Conda)

🔹 1. Clone the Repository

🔹 2. Create a Conda Environment

🔹 3. Activate the Environment

🔹 4. (Optional) Install uv for Faster Installs

🔹 5. Install Dependencies

🧩 Step 1 — Core Requirements

⚙️ Step 2 — Specific Packages and CUDA Builds

✅ Done!

🧠 First-Time Setup Tips

📥 Download Fish Speech Model (one-time)

🔐 Step-by-Step:

▶️ Run the Studio

💡 Notes

📜 License

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

🔹 4. (Optional) Install `uv` for Faster Installs

Packages