Added a vllm quickstart playbook and scripts for curl and gradio web interactive client by hongxiayang · Pull Request #47 · amd/playbooks

hongxiayang · 2026-01-23T20:27:11Z

In this playbook, you will learn how to:

Set up and run vLLM with ROCm support using Docker for high-performance LLM inference on AMD GPUs
Download and configure language models from Hugging Face for use with vLLM
Start and configure a vLLM server with OpenAI-compatible API endpoints on port 8000
Test the server using curl commands and API requests
Launch and use the Gradio web interface (port 7860) for interactive chat with real-time streaming responses
Configure server parameters like GPU memory utilization, model length limits, and multi-GPU support
Make API calls to the vLLM server using both streaming and non-streaming requests
Troubleshoot common issues with server startup, memory, and client connections

…nts - tutorial style

danielholanda · 2026-01-23T22:55:16Z

@hongxiayang Please fix the CI failure by moving the additional scripts you added to an assets folder inside the vllm-inference repo.

danielholanda · 2026-01-24T00:26:50Z

Next steps:

Daniel to investigate alternatives on preinstalling vLLM or facilitating installs without Docker.

hongxiayang · 2026-01-27T22:47:36Z

@hongxiayang Please fix the CI failure by moving the additional scripts you added to an assets folder inside the vllm-inference repo.

@danielholanda Done.

danielholanda

Thanks for your contribution @hongxiayang . Eddie will now take it over and adapt it to work with vllm whls instead of docker.

Added a vllm quickstart playbook and scripts for curl and gradio clie…

7e4d627

…nts - tutorial style

danielholanda assigned hongxiayang Jan 23, 2026

hongxiayang added 3 commits January 27, 2026 16:32

move the scripts to assets directory

3998b0c

modify playbook.json

bf29344

modify playbook.json

b102485

danielholanda added 3 commits February 11, 2026 11:47

move

cc54b6e

Merge main into branch

4867fc9

Add banner highlighting status

ec21856

danielholanda self-requested a review February 11, 2026 22:41

danielholanda approved these changes Feb 11, 2026

View reviewed changes

danielholanda merged commit f8d5e34 into main Feb 11, 2026
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added a vllm quickstart playbook and scripts for curl and gradio web interactive client#47

Added a vllm quickstart playbook and scripts for curl and gradio web interactive client#47
danielholanda merged 7 commits intomainfrom
vllm-quick-start

hongxiayang commented Jan 23, 2026

Uh oh!

danielholanda commented Jan 23, 2026

Uh oh!

danielholanda commented Jan 24, 2026

Uh oh!

hongxiayang commented Jan 27, 2026

Uh oh!

danielholanda left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

hongxiayang commented Jan 23, 2026

Uh oh!

danielholanda commented Jan 23, 2026

Uh oh!

danielholanda commented Jan 24, 2026

Uh oh!

hongxiayang commented Jan 27, 2026

Uh oh!

danielholanda left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants