Question about Flow-matching training from scratch

I’m trying to train CosyVoice2’s Flow-Matching model from scratch using the Dual Codec's code, since I need its streaming inference capability.
⚙️ Training Setup
Hardware: 8 × A800 GPUs
Training time:
1e-5 LR → trained for 3 days, stable but still not well-fitted (poor audio quality, inconsistent timbre).

<img width="1280" height="651" alt="Image" src="https://github.com/user-attachments/assets/7b04958b-3d16-4d41-b834-39f6d5c55b18" />

1e-4 LR → trained for 1 day, but quickly leads to gradient explosion and no convergence.
<img width="1280" height="860" alt="Image" src="https://github.com/user-attachments/assets/68833ec7-5dde-4a1d-a66d-b70d46156628" />
Optimizer: AdamW
Dataset: internal speech dataset (not using LibriTTS, the scale of dataset is about 2000 hours)
📉 Observations
The 1e-5 model produces overly smooth and unclear results, while the 1e-4 model diverges rapidly.

Below are 1e-5 model's mel-spectrogram comparisons:

predict mel：

<img width="1280" height="338" alt="Image" src="https://github.com/user-attachments/assets/f54fd636-86b3-4ad3-9832-70934198eb9f" />

gt mel:

<img width="1280" height="346" alt="Image" src="https://github.com/user-attachments/assets/6011b0f7-8614-4cd4-99d0-5f1a3ec1e66d" />


❓ Question
Has anyone managed to successfully train the Flow-Matching model from scratch (not fine-tuning pretrained weights)?
Any advice or experience on:

Choosing an appropriate learning rate or LR schedule

Using EMA, gradient clipping, or warmup strategies

Adjusting flow noise schedule or loss balancing to stabilize early training

Any hints would be greatly appreciated!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Question about Flow-matching training from scratch #1625

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Question about Flow-matching training from scratch #1625

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions