vLLM CI tests by sreeram-11 · Pull Request #251 · amd/playbooks

sreeram-11 · 2026-05-04T16:30:30Z

What the CI tests validate

Required asset scripts exist:
- curl_script.sh
- chat_with_model.py
chat_with_model.py has valid Python syntax.
Python virtual environment can be created and activated.
PyTorch 2.9.1 built for ROCm 7.12.0 can be installed.
vLLM can be installed from AMD’s prebuilt ROCm wheel index.
Required ROCm-related environment variables can be set:
- PYTHONPATH
- FLASH_ATTENTION_TRITON_AMD_ENABLE
vLLM, PyTorch, and flash-attn can be imported successfully.
PyTorch detects HIP/GPU availability.
vLLM server can start on 127.0.0.1:8000 with Qwen/Qwen3-1.7B
vLLM server responds to the /health endpoint.
vLLM server responds to the OpenAI-compatible /v1/models endpoint.
Chat completion works through the direct README-style curl -X POST command.
Chat completion works through curl_script.sh.
Chat completion works through the OpenAI Python API using the local vLLM server.
vLLM server is stopped/cleaned up after the smoke test.

User Actions Required

Make sure Python 3.12 is available on the Linux runner.
Make sure the runner has internet access to:
- install Python packages from the AMD ROCm wheel indexes.
- download Qwen/Qwen3-1.7B from Hugging Face, or has the model already cached.

Info worth noting

The Qwen/Qwen3-1.7B model does not need to be manually downloaded before running the tests. vllm serve Qwen/Qwen3-1.7B downloads the model automatically on first run if it is not already cached.
Updated test-playbooks.yml to use Python 3.12 for the vLLM playbook.
System/global ROCm on runner: 7.2.0
Python venv ROCm/PyTorch/vLLM packages: ROCm 7.12.0-based wheels
- So the runner can have the global ROCm stack at 7.2.0, while .venv contains ROCm 7.12 Python packages.
- The important condition is that the system driver must be compatible enough with the user-space ROCm packages we are using. In this case, that condition is satisfied.
- Installing PyTorch/vLLM from the ROCm 7.12 wheel indexes installs ROCm 7.12-compatible Python/user-space packages into the virtual environment, but it does not install or upgrade the global system ROCm driver stack.
- No reboot is required because pip is only changing the venv, not the system driver.

danielholanda

This is one of the most thorough tests we added so far. Thank you for the great work here!

sreeram-11 added 2 commits May 4, 2026 09:18

vLLM CI tests

8c725ca

Removing assets keyword in commands

ceede72

sreeram-11 requested a review from danielholanda May 4, 2026 16:30

sreeram-11 marked this pull request as draft May 4, 2026 16:37

sreeram-11 added 2 commits May 4, 2026 10:39

Using python 3.12 in test-playbooks.yml

1a75482

Setting env var and checking install in a single test

9f35757

sreeram-11 marked this pull request as ready for review May 4, 2026 18:09

danielholanda approved these changes May 4, 2026

View reviewed changes

sreeram-11 merged commit ddf25eb into main May 4, 2026
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vLLM CI tests#251

vLLM CI tests#251
sreeram-11 merged 4 commits intomainfrom
sreeram/vLLM-tests

sreeram-11 commented May 4, 2026 •

edited

Loading

Uh oh!

danielholanda left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sreeram-11 commented May 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What the CI tests validate

User Actions Required

Info worth noting

Uh oh!

danielholanda left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sreeram-11 commented May 4, 2026 •

edited

Loading