Skip to content

🟒 NVIDIA ONLY – All-in-One TTS App with Kokoro, KittenTTS, Higgs audio, Chatterbox, Fish-Speech, F5 & index-tts & indextts2, Supports Conversation Mode & eBook-to-Audiobook. All features work across all engines in a unified interface except vibe voice which is it's own app panel.

Notifications You must be signed in to change notification settings

jcannonsr/Ultimate-TTS-Studio-SUP3R-Edition

Β 
Β 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

90 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ’– Support & Donations

If you enjoy using Ultimate TTS Studio and would like to support its ongoing development, your generosity is deeply appreciated.
Any amount β€” big or small β€” makes a difference!

πŸ’° Donate via PayPal

Donate with PayPal

Support this project securely via PayPal.
Click below to make a one-time or recurring donation:

πŸ‘‰ paypal.me/30second

πŸͺ™ Bitcoin Donations Welcome

Bitcoin Address: 1N942jHr6vVuR2KAe2JEf3nN59eR21tpKv

Donate via Bitcoin

Your donations help cover:

  • 🧠 Development time and continuous feature updates
  • πŸ’Ύ Storage upgrades and data handling
  • 🧾 Subscription costs for dev tools and AI services
  • πŸš€ Hardware improvements (e.g., GPU upgrades)

Every contribution directly supports the creation of more innovative apps like this.
Thank you for helping keep the project alive and growing!


⚠️ Notice: Major App Update

πŸ“… Sept 18, 2025

This update brings VibeVoice and IndexTTS2 as newly supported TTS engines, expanding the variety and flexibility of voice options available. Screenshot 2025-09-18 164743Screenshot 2025-09-18 164736

πŸ“… Sept 13, 2025

This update brings KittenTTS Screenshot 2025-09-13 231901

πŸ“… July 29, 2025

This recent update brings Higgs-Audio TTS:

πŸ“… June 23, 2025

This recent update brings a few UI improvements focused on clarity and usability:

πŸŽ›οΈ TTS UI Refactor

  • The TTS engine selector is now organized into a tabbed interface, making it easier to navigate and less overwhelming. Screenshot 2025-06-23 204234

  • The audiobook feature has been moved into its own tab to reduce visual clutter and improve user experience. Screenshot 2025-06-23 204240


πŸ“… June 18, 2025

We’ve pushed another exciting update packed with new functionality and improvements!

πŸ†• New Additions & Improvements

πŸ—£οΈ TTS Integration Expanded

  • F5-TTS has now been added as a fifth supported engine, and it works seamlessly across all modes.
  • Index-TTS has been added as a supported speech engine.
  • All TTS engines now work across all modes, including narration, conversation, and ambient.

πŸ’¬ Kokoro Conversation Mode

  • Kokoro now fully supports conversation mode, offering a more dynamic and interactive experience.

βœ… Recommended Setup

For the smoothest installation and full feature compatibility:

  • Use a Conda environment, or
  • Install via Pinokio for the easiest experience.

πŸ“… June 10, 2025

We’re excited to announce a major update to the app!

🎧 New Feature: eBook to Audiobook

Bring your favorite eBooks to life with our brand-new custom voice audiobook feature. Instantly convert any eBook into a personalized listening experienceβ€”perfect for learning, multitasking, or relaxing on the go. Screenshot 2025-06-10 204108


πŸ“… June 7, 2025

This update brings key improvements to performance, model management, and the user interface. Here's what's new:

πŸ”§ Model Management

  • Models are no longer auto-loaded into GPU memory at app launch.
  • You can now manually load and unload models, giving you more precise control over memory usage.

🎨 UI Enhancements

  • A refreshed interface is now live.
  • The app is now optimized for dark mode. It still works in light mode, but some visuals may not display as intended.

🐟 Fish Speech Fix

  • Fixed a bug where Fish Speech did not chunk text correctly, which could cause processing issues.

πŸ“₯ Model Download Behavior

  • Chatterbox and Kokoro models will automatically download the first time you click "Load."
  • Fish Speech models must still be downloaded manually and are not included in the auto-download process.

πŸ—£οΈ New Feature: Custom Kokoro Voices

  • Kokoro now supports custom .pt voice models!
  • Use the Custom Voice Upload section in the Kokoro interface to upload your own compatible voice files.

Screenshot 2025-06-22 183117 Screenshot 2025-06-22 183134 Screenshot 2025-06-22 183147

✨ Ultimate TTS Studio SUP3R Edition ✨

Ultimate TTS Studio is a powerful all-in-one text-to-speech studio that brings together ChatterboxTTS, Kokoro TTS, and Fish Speech under one interactive Gradio interface.

🎭 Reference Audio Cloning πŸ—£οΈ Pre-trained Multi-Language Voices 🐟 Natural TTS with Audio Effects 🎡 Real-time Voice Synthesis & Export

---.

πŸš€ Features

  • 🎀 ChatterboxTTS: Custom voice cloning using short reference clips.
  • πŸ—£οΈ Kokoro TTS: High-quality, multilingual pre-trained voices.
  • 🐟 Fish Speech: Advanced TTS engine.
  • πŸŽ›οΈ Professional Audio Effects: Reverb, Echo, EQ, Pitch shift, Gain.

🚨🚨 WARNING / IMPORTANT NOTES 🚨🚨

⚠️ Tested Hardware: This project has only been tested on a Windows 11 machine with an RTX 4090 GPU. πŸ’» Performance or compatibility on other systems is not guaranteed.

πŸ”Š Audio Caution: The Fish Speech feature may occasionally produce extremely loud or muffled audio. 🎧 Please lower your volume and avoid using headphones during initial tests.


πŸ› οΈ Installation

⚠️ Windows Users β€” Important Note on pynini If you encounter the following error when installing pynini: ERROR: Failed building wheel for pynini You can fix this by installing it via conda: Pynini and wetextprocessing is needed for index-tts to work at its best Espeak-ng is needed for Kokoro to work at its best.

# After activating your conda environment (e.g., conda activate index-tts)
conda install -c conda-forge pynini==2.1.6
pip install WeTextProcessing --no-deps

Option 1.

Install via Pinokio You can use the Pinokio script here for one-click setup: Pinokio App Installer

Option 1a: Install via Dione

You can also use Dione for an easy one-click installation experience:


Option 2

πŸ” Auto-Installer Method (Recommended)

This is the fastest way to get started. It uses a built-in installer script for automatic setup and app launching.

πŸ› οΈ Before You Begin: Make sure you have Miniconda or Anaconda installed on your system. You can download Miniconda here: https://docs.conda.io/en/latest/miniconda.html

1. Clone the Repository

git clone https://github.com/SUP3RMASS1VE/Ultimate-TTS-Studio-SUP3R-Edition.git
cd Ultimate-TTS-Studio-SUP3R-Edition

2. Run the Installer

πŸ‘‰ Double-click RUN_INSTALLER in the project folder. This will automatically set up everything for you β€” dependencies, environment, etc.

3. Launch the App

πŸ‘‰ Double-click RUN_APP to open the app.

4. Update the App (When Needed)

πŸ‘‰ Double-click RUN_UPDATE to update the app to the latest version.


Option 3


🧠 Ultimate-TTS-Studio-SUP3R-Edition β€” Setup Guide (Conda)

Follow these steps to set up your environment for Ultimate TTS Studio SUP3R Edition using Conda and UV for fast dependency management.


πŸ”Ή 1. Clone the Repository

git clone https://github.com/SUP3RMASS1VE/Ultimate-TTS-Studio-SUP3R-Edition.git
cd Ultimate-TTS-Studio-SUP3R-Edition

πŸ”Ή 2. Create a Conda Environment

conda create -n ultimate-tts python=3.10 -y

πŸ”Ή 3. Activate the Environment

conda activate ultimate-tts

πŸ”Ή 4. (Optional) Install uv for Faster Installs

pip install uv

πŸ’‘ Tip: uv dramatically speeds up installation. If you prefer, you can use regular pip install instead.


πŸ”Ή 5. Install Dependencies

🧩 Step 1 β€” Core Requirements

uv pip install -r requirements.txt

βš™οΈ Step 2 β€” Specific Packages and CUDA Builds

uv pip install voxcpm openai-whisper --no-deps
uv pip install https://huggingface.co/lldacing/flash-attention-windows-wheel/resolve/main/flash_attn-2.7.4.post1%2Bcu128torch2.7.0cxx11abiFALSE-cp310-cp310-win_amd64.whl
uv pip install WeTextProcessing --no-deps
uv pip install triton-windows==3.3.1.post19
uv pip install torch==2.7.0 torchvision==0.22.0 torchaudio==2.7.0 --index-url https://download.pytorch.org/whl/cu128

βœ… Done!

Your environment is now ready to run Ultimate TTS Studio SUP3R Edition with CUDA 12.8 support. Launch the app and start generating high-quality speech!

πŸ’‘ If you encounter CUDA or package conflicts, ensure your GPU drivers are updated and that Conda’s python=3.10 matches the wheel compatibility.

πŸ’‘ If you're not using uv, you can just use pip install in its place.

🧠 First-Time Setup Tips

πŸ“₯ Download Fish Speech Model (one-time)

To use Fish Speech, you must download the model checkpoint from Hugging Face. This requires a Hugging Face account and access token.

πŸ” Step-by-Step:

  1. Create an account (if needed): https://huggingface.co/join

  2. Get your access token: Visit https://huggingface.co/settings/tokens and create a read token.

  3. Log in via CLI:

    huggingface-cli login

    Paste your token when prompted.

  4. (Optional) Accept the model license: Visit https://huggingface.co/fishaudio/openaudio-s1-mini and click "Access repository" if prompted.

  5. Download the model:

    huggingface-cli download fishaudio/openaudio-s1-mini --local-dir checkpoints/openaudio-s1-mini

▢️ Run the Studio

python launch.py

This will launch a local Gradio interface at: πŸ“ http://127.0.0.1:7860


πŸ’‘ Notes

  • All engines are optional β€” the app will gracefully disable missing engines.
  • ChatterboxTTS and Fish Speech support reference audio input.
  • Audio effects are applied post-synthesis for professional-quality output.
  • Custom Kokoro voices can be added to custom_voices/ as .pt files.

πŸ“œ License

MIT License Β© SUP3RMASS1VE


πŸ™ Acknowledgments

This project proudly integrates and builds upon the amazing work of:

We deeply thank the authors and contributors to these projects for making this work possible.


About

🟒 NVIDIA ONLY – All-in-One TTS App with Kokoro, KittenTTS, Higgs audio, Chatterbox, Fish-Speech, F5 & index-tts & indextts2, Supports Conversation Mode & eBook-to-Audiobook. All features work across all engines in a unified interface except vibe voice which is it's own app panel.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 98.3%
  • Other 1.7%