CI: Add warmup requests before benchmark to avoid JIT compilation during timed run by gyohuangxin · Pull Request #302 · ROCm/ATOM

gyohuangxin · 2026-03-11T06:29:07Z

Summary

Add --num-warmups=$CONC to benchmark_serving.py invocation in atom_test.sh
This sends warmup requests before the timed benchmark run, triggering JIT compilation of prefill-path kernels (e.g. mha_varlen_fwd) so they don't compile during the actual measurement
Without this, the first batch of benchmark requests triggers ~24s of aiter JIT compilation, skewing TTFT and throughput numbers

Test plan

Run a benchmark CI job and verify Warming up with N requests... appears before Starting main benchmark run...
Confirm no [aiter] start build messages appear after Starting main benchmark run...

Copilot

Pull request overview

Updates the benchmark CI script to run warmup requests before the timed benchmark phase, aiming to avoid JIT compilation skewing TTFT/throughput measurements.

Changes:

Add --num-warmups=$CONC to the benchmark_serving.py invocation in the benchmark path of atom_test.sh.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-11T06:33:56Z

.github/scripts/atom_test.sh

    --trust-remote-code \
+    --num-warmups=$CONC \
    --request-rate=inf --ignore-eos \


--num-warmups=$CONC relies on a CLI option that may not exist in the cloned https://github.com/kimbochen/bench_serving.git (the script is cloned from HEAD, unpinned). If upstream changes or lacks this flag, the benchmark step will fail at runtime. Consider pinning bench_serving to a known commit/tag that supports --num-warmups (or switching to the in-repo atom/benchmarks/benchmark_serving.py), and/or adding a quick guard that checks the flag is supported before invoking the benchmark. Also note this adds CONC extra requests per run (up to 256 in the nightly matrix); you may want to cap warmups to a smaller fixed number if the goal is only to trigger JIT once.

Add --num-warmups to benchmark to avoid JIT during timed run

2325cce

Copilot AI review requested due to automatic review settings March 11, 2026 06:29

Copilot started reviewing on behalf of gyohuangxin March 11, 2026 06:30 View session

Copilot AI reviewed Mar 11, 2026

View reviewed changes

gyohuangxin closed this Mar 12, 2026

gyohuangxin reopened this Mar 12, 2026

Change num-warmups from $CONC to 1

bb94f88

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CI: Add warmup requests before benchmark to avoid JIT compilation during timed run#302

CI: Add warmup requests before benchmark to avoid JIT compilation during timed run#302
gyohuangxin wants to merge 2 commits intomainfrom
ci/add-benchmark-warmups

gyohuangxin commented Mar 11, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gyohuangxin commented Mar 11, 2026

Summary

Test plan

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants