PaddleSpeech r0.1.0
Features
CLI : New Feature
- Easy install by pip
pip install paddlespeech - CLI to quick explore ASR, TTS, audio classification, speech translation and punctuation restoration.
ASR
- Join CTC LM decoder
- Transformer LM model
- Improve DeepSpeech2 online model
- Refactor some configs
TTS
- Merge Parakeet into PaddleSpeech
- Add FastSpeech2-Conformer
- paper link: fastspeech2 、conformer
- example link
- Add Multi Band MelGAN
- Add HiFiGAN
- Add Style MelGAN
- Add FastSpeech2 Voice Cloning with GE2E (SV2TTS)
CLS
- Add audio classification example on ESC-50 and custom dataset.
- Add audio tagging demo based on PANNs and Audioset labels.
ST
- ST-MTL
- FAT-ST-MTL
Docs
- Add quick start
- Add read the doc
- Improve installation documentation
- Add README for each example
Demos
- Audio_tagging
- Automatic_video_subtitiles
- Metaverse
- Punctuation_restoration
- Speech_recognition
- Speech_translation
- Story_talker
- Style_fs2
- Text_to_speech
Others
- Update released models and results
Acknowledgements
@zh794390558 @KPatr1ck @Jackwaterveg @yt605155624 @Mingxue-Xu @grasswolfs @jerryuhoo