Adding Warmup #1

kimbochen · 2025-10-16T20:55:19Z

Added warmup with CLI argument --num-warmups
Added endpoint ready checking: Client tests endpoint and waits up to 10 minutes
Updated sequence length generation logic:
- --random_range_ratio matches vLLM: sequence length is uniformly sampled in [seq_len * (1.0 - random_range_ratio), seq_len * (1.0 + random_range_ratio)]
- Added iterative encode-decode to minimize token mismatch

…h stats.

Kimbo Chen and others added 9 commits October 10, 2025 15:59

Testing sequence length sampling from normal distribution.

ae8f5ad

Updated sampling logic, removed re-encoding, and added sequence lengt…

cf15070

…h stats.

Added p1 and p99 stats.

f1c95d9

Added warmup.

b1d5da6

Refactor.

a0da1db

Updated warmup logic.

31204df

Added endpoint ready checker.

2c5ed90

Updated random token generation to iterative and uniform distribution.

d5584ff

Tweaked log output and removed wrong comment.

85845dd

Provide feedback