You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -180,69 +180,200 @@ Via the easy-to-use, efficient, flexible and scalable implementation, our vision
180
180
## Installation
181
181
182
182
We strongly recommend our users to install PaddleSpeech in **Linux** with *python>=3.7* and *paddlepaddle>=2.3.1*.
183
-
Up to now, **Linux** supports CLI for the all our tasks, **Mac OSX** and **Windows** only supports PaddleSpeech CLI for Audio Classification, Speech-to-Text and Text-to-Speech. To install `PaddleSpeech`, please see [installation](./docs/source/install.md).
183
+
184
+
### **Dependency Introduction**
185
+
186
+
+ gcc >= 4.8.5
187
+
+ paddlepaddle >= 2.3.1
188
+
+ python >= 3.7
189
+
+ OS support: Linux(recommend), Windows, Mac OSX
190
+
191
+
PaddleSpeech depends on paddlepaddle. For installation, please refer to the official website of [paddlepaddle](https://www.paddlepaddle.org.cn/en) and choose according to your own machine. Here is an example of the cpu version.
For more installation problems, such as conda environment, librosa-dependent, gcc problems, kaldi installation, etc., you can refer to this [installation document](./docs/source/install.md). If you encounter problems during installation, you can leave a message on [#2150](https://github.com/PaddlePaddle/PaddleSpeech/issues/2150) and find related problems
184
215
185
216
186
217
<aname="quickstart"></a>
187
218
## Quick Start
188
219
189
-
Developers can have a try of our models with [PaddleSpeech Command Line](./paddlespeech/cli/README.md). Change `--input` to test your own audio/text.
220
+
Developers can have a try of our models with [PaddleSpeech Command Line](./paddlespeech/cli/README.md) or Python. Change `--input` to test your own audio/text and support 16k wav format audio.
221
+
222
+
**You can also quickly experience it in AI Studio 👉🏻 [PaddleSpeech API Demo](https://aistudio.baidu.com/aistudio/projectdetail/4353348?sUid=2470186&shared=1&ts=1660876445786)**
- web demo for Automatic Speech Recognition is integrated to [Huggingface Spaces](https://huggingface.co/spaces) with [Gradio](https://github.com/gradio-app/gradio). See Demo: [ASR Demo](https://huggingface.co/spaces/KPatrick/PaddleSpeechASR)
- web demo for Text to Speech is integrated to [Huggingface Spaces](https://huggingface.co/spaces) with [Gradio](https://github.com/gradio-app/gradio). See Demo: [TTS Demo](https://huggingface.co/spaces/KPatrick/PaddleSpeechTTS)
218
310
219
-
**Text Postprocessing**
220
-
- Punctuation Restoration
221
-
```bash
222
-
paddlespeech text --task punc --input 今天的天气真不错啊你下午有空吗我想约你一起去吃饭
>>> result = text_punc(text="今天的天气真不错啊你下午有空吗我想约你一起去吃饭")
341
+
今天的天气真不错啊!你下午有空吗?我想约你一起去吃饭。
232
342
```
233
-
paddlespeech asr --input ./zh.wav | paddlespeech text --task punc
343
+
344
+
</details>
345
+
346
+
### Speech Translation
347
+
348
+
<details><summary> End-to-end English to Chinese Speech Translation Tool</summary>
349
+
350
+
Use pre-compiled kaldi related tools, only support experience in Ubuntu system
351
+
352
+
**command line experience**
353
+
354
+
```shell
355
+
paddlespeech st --input en.wav
234
356
```
235
357
236
-
For more command lines, please see: [demos](https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/demos)
358
+
**Python API experience**
237
359
238
-
If you want to try more functions like training and tuning, please have a look at [Speech-to-Text Quick Start](./docs/source/asr/quick_start.md) and [Text-to-Speech Quick Start](./docs/source/tts/quick_start.md).
Developers can have a try of our speech server with [PaddleSpeech Server Command Line](./paddlespeech/server/README.md).
245
374
375
+
**You can try it quickly in AI Studio (recommend): [SpeechServer](https://aistudio.baidu.com/aistudio/projectdetail/4354592?sUid=2470186&shared=1&ts=1660877827034)**
0 commit comments