PaddleSpeech r0.1.1

Jackwaterveg released this 14 Jan 03:27

· 2728 commits to develop since this release

3d5aac6

New Features

CLI :

Add cli stats. #1274
Add unit test. #1321
ASR: Support English: Add transformer_libirspeech model. #1297
ASR: Support 4 decoding methods: ctc_greedy_search, ctc_beam_search, attention, attention_rescoring. #1297
ASR & ST: Use the unified config. #1305 / #1312
ASR: Refactor the code. #1260 by @AdamBear
TTS: Support long input text by default. #1241
TTS: Add Style MelGAN and HiFiGAN. #1241

ASR

Refactor configs in examples. #1225

TTS

Fix some frontend bugs. #1262 by @JiehangXie / #1310
Add speaker embedding and speaker id for style fastspeech2 inference. #1197 by @jerryuhoo
Add support for finetuning speedyspeech. #1302 by @jerryuhoo / #1322 / #1337
Update VCTK Parallel WaveGAN. #1294
Update Multi Band MelGAN. #1272

ST

Refactor configs in examples. #1225

Text

Refactor Punctuation Restoration example. #1215

Docs

Add topic note for releasing python packages
Add TTS papers. #1330
Add Frontend G2P topic. #1254

Others

Update released models and results. #1306

Acknowledgements

@zh794390558 @yt605155624 @Jackwaterveg @KPatr1ck @Mingxue-Xu @JiehangXie @grasswolfs @jerryuhoo @AdamBear @LittleChenCc @JamesLim-sy

Contributors

AdamBear, zh794390558, and 9 other contributors

Assets 2