MockLLM-ChatTTS Quick Start and Fine-tuning Guide

This project is finetuned from ChatTTS.

Environment

Python Version: Python 3.10

Python version lower than 3.10 may occur errors.

Installation

Install the required dependencies with:

pip install -r requirements.txt

Generate Audio

Run the following command to generate a sample audio file (output.wav):

python test.py

You can modify the text content inside test.py to generate audio from your custom input.

Fine-tuning

You can fine-tune the DVAE and GPT modules using your own dataset. Note: Fine-tuning starts from the pre-trained models located in the asset folder (e.g., DVAE_full.pt, Decoder.pt).

Prepare Your Data

Prepare your .wav audio files and create a .list file formatted according to the provided examples.

Fine-tune DVAE

Run the following command:

CUDA_VISIBLE_DEVICES=0 python examples/finetune/finetune.py \
  --color \
  --save_folder ./saved_models \
  --data_path yours.list \
  --tar_path data/Xz.tar \
  --batch_size 32 \
  --epochs 10 \
  --train_module dvae

Fine-tune GPT Speaker

Run the following command:

CUDA_VISIBLE_DEVICES=0 python -m examples.finetune.finetune \
  --color \
  --save_folder ./saved_models \
  --data_path yours.list \
  --tar_path data/Xz.tar \
  --batch_size 32 \
  --epochs 10 \
  --train_module gpt_speaker

Make sure to update data_path to point to your dataset's .list file.

Name		Name	Last commit message	Last commit date
Latest commit History 460 Commits
.github/workflows		.github/workflows
ChatTTS		ChatTTS
docs		docs
dummy_data		dummy_data
examples		examples
tests		tests
tools		tools
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MockLLM-ChatTTS Quick Start and Fine-tuning Guide

Environment

Installation

Generate Audio

Fine-tuning

Prepare Your Data

Fine-tune DVAE

Fine-tune GPT Speaker

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 45

Uh oh!

Languages

License

Jackxiini/MockLLM-ChatTTS

Folders and files

Latest commit

History

Repository files navigation

MockLLM-ChatTTS Quick Start and Fine-tuning Guide

Environment

Installation

Generate Audio

Fine-tuning

Prepare Your Data

Fine-tune DVAE

Fine-tune GPT Speaker

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 45

Uh oh!

Languages

Packages