vllm-project · lishunyang12 · Feb 5, 2026 · Feb 5, 2026 · Feb 5, 2026
diff --git a/PR_DESCRIPTION.md b/PR_DESCRIPTION.md
@@ -0,0 +1,26 @@
+## feat(qwen3-tts): Add Gradio demo for online serving
+
+Closes part of #938 (item 1.8 - Gradio Demo)
+
+### Summary
+- Add interactive Gradio web UI for Qwen3-TTS at `examples/online_serving/qwen3_tts/`
+- Support all 3 task types: CustomVoice, VoiceDesign, Base (voice cloning)
+- Dynamic UI that shows/hides fields based on selected task type
+- Fetches available speakers from `/v1/audio/voices` endpoint
+- Add `run_gradio_demo.sh` to launch server + demo together
+
+### Files Changed
+- `examples/online_serving/qwen3_tts/gradio_demo.py` (new)
+- `examples/online_serving/qwen3_tts/run_gradio_demo.sh` (new)
+- `examples/online_serving/qwen3_tts/README.md` (updated)
+
+### Test plan
+- [ ] Start server with `./run_server.sh CustomVoice`, run `python gradio_demo.py`, generate speech with Vivian/Ryan speakers
+- [ ] Start server with VoiceDesign model, verify instructions field is required
+- [ ] Start server with Base model, upload reference audio and verify voice cloning
+- [ ] Test `run_gradio_demo.sh` launches both server and Gradio
+- [ ] Verify error messages when server is down or inputs are invalid
+
+### Notes
+- Streaming audio playback will be added as a follow-up once #1189 is merged
+- References `examples/online_serving/qwen3_omni/gradio_demo.py` as design pattern
@@ -10,6 +10,22 @@ This directory contains examples for running Qwen3-TTS models with vLLM-Omni's o
 | `Qwen/Qwen3-TTS-12Hz-1.7B-VoiceDesign` | VoiceDesign | Natural language voice style description |
 | `Qwen/Qwen3-TTS-12Hz-1.7B-Base` | Base | Voice cloning from reference audio |
 
+## Gradio Demo
+
+The interactive Gradio demo supports all 3 task types with a web UI.
+
+```bash
+# Option 1: Launch server + Gradio together
+./run_gradio_demo.sh                                # CustomVoice (default)
+./run_gradio_demo.sh --task-type VoiceDesign        # VoiceDesign
+./run_gradio_demo.sh --task-type Base               # Voice cloning
+
+# Option 2: If server is already running
+python gradio_demo.py --api-base http://localhost:8000
+```
+
+Then open http://127.0.0.1:7860 in your browser.
+
 ## Quick Start
 
 ### 1. Start the Server