Training Steps #564
-
First, I'd like to express my gratitude for the excellent fish-speech model. It's truly impressive work. My dataset details: 76,000 voice samples Given these dataset characteristics, could you recommend an appropriate number of training steps? Any advice on optimizing the training process would be greatly appreciated. |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 3 replies
-
With 76,000 samples, a batch size of 32 gives you 2,375 batches per epoch. 10 epochs: 23,750 steps. |
Beta Was this translation helpful? Give feedback.
-
Hi, I just found this thread and I am also interested in training a custom fish speech model. However, I could not find any information about how to do this. Could you please provide me a pointer to some documentation? Thanks, Daniel |
Beta Was this translation helpful? Give feedback.
With 76,000 samples, a batch size of 32 gives you 2,375 batches per epoch.
A good starting point for speech models is to aim for around 10–20 epochs.
Therefore, total steps = epochs × steps per epoch.
10 epochs: 23,750 steps.
20 epochs: 47,500 steps.