Skip to content
Open
Show file tree
Hide file tree
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
26 changes: 26 additions & 0 deletions PR_DESCRIPTION.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,26 @@
## feat(qwen3-tts): Add Gradio demo for online serving

Closes part of #938 (item 1.8 - Gradio Demo)

### Summary
- Add interactive Gradio web UI for Qwen3-TTS at `examples/online_serving/qwen3_tts/`
- Support all 3 task types: CustomVoice, VoiceDesign, Base (voice cloning)
- Dynamic UI that shows/hides fields based on selected task type
- Fetches available speakers from `/v1/audio/voices` endpoint
- Add `run_gradio_demo.sh` to launch server + demo together

### Files Changed
- `examples/online_serving/qwen3_tts/gradio_demo.py` (new)
- `examples/online_serving/qwen3_tts/run_gradio_demo.sh` (new)
- `examples/online_serving/qwen3_tts/README.md` (updated)

### Test plan
- [ ] Start server with `./run_server.sh CustomVoice`, run `python gradio_demo.py`, generate speech with Vivian/Ryan speakers
- [ ] Start server with VoiceDesign model, verify instructions field is required
- [ ] Start server with Base model, upload reference audio and verify voice cloning
- [ ] Test `run_gradio_demo.sh` launches both server and Gradio
- [ ] Verify error messages when server is down or inputs are invalid

### Notes
- Streaming audio playback will be added as a follow-up once #1189 is merged
- References `examples/online_serving/qwen3_omni/gradio_demo.py` as design pattern
16 changes: 16 additions & 0 deletions examples/online_serving/qwen3_tts/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,6 +10,22 @@ This directory contains examples for running Qwen3-TTS models with vLLM-Omni's o
| `Qwen/Qwen3-TTS-12Hz-1.7B-VoiceDesign` | VoiceDesign | Natural language voice style description |
| `Qwen/Qwen3-TTS-12Hz-1.7B-Base` | Base | Voice cloning from reference audio |

## Gradio Demo

The interactive Gradio demo supports all 3 task types with a web UI.

```bash
# Option 1: Launch server + Gradio together
./run_gradio_demo.sh # CustomVoice (default)
./run_gradio_demo.sh --task-type VoiceDesign # VoiceDesign
./run_gradio_demo.sh --task-type Base # Voice cloning

# Option 2: If server is already running
python gradio_demo.py --api-base http://localhost:8000
```

Then open http://127.0.0.1:7860 in your browser.

## Quick Start

### 1. Start the Server
Expand Down
Loading
Loading