feat: generator add benchmark mode by setting rule plugins #290

Ethan-ES · 2026-01-25T16:39:41Z

Overview:

To ensure the generator can be used across different environments, we introduced a benchmark mode(running with --generator-set rule=benchmark) on top of the existing generator. In benchmark mode, the deployment configuration is generated strictly based on the simulation results produced by the aiconfigurator sdk, helping developers align performance more accurately.

By default, the generator runs in production mode, which makes trade-offs tailored for general production deployments, such as increasing batch size and reducing the CUDA Graph batch size.

In addition to these two modes, users can define their own rules by adding new folders under rule_plugin. When invoking the CLI, different rule sets can be selected via the corresponding --generator-set rule=<folder_name> option.

Signed-off-by: etshen <etshen@nvidia.com>

copy-pr-bot · 2026-01-25T16:39:45Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

simone-chen · 2026-01-28T02:44:24Z

src/aiconfigurator/generator/rule_plugin/benchmark/vllm.rule

Do we want to include cuda_graph_batch_sizes to vllm.rule as well?

Thanks for the review, I've added the cuda_graph_batch_sizes to vllm.rule on line 8 to stay consistent with trtllm and sglang. Please let me know if this is not appropriate.

feat: generator add benchmark mode by setting rule plugins

9380a26

Signed-off-by: etshen <etshen@nvidia.com>

Ethan-ES requested review from a team, Arsene12358, jasonqinzhou and tianhaox as code owners January 25, 2026 16:39

github-actions bot added the feat label Jan 25, 2026

simone-chen reviewed Jan 28, 2026

View reviewed changes

simone-chen approved these changes Jan 28, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: generator add benchmark mode by setting rule plugins #290

feat: generator add benchmark mode by setting rule plugins #290

Ethan-ES commented Jan 25, 2026 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Jan 25, 2026

Uh oh!

simone-chen Jan 28, 2026

Uh oh!

Ethan-ES Jan 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: generator add benchmark mode by setting rule plugins #290

Are you sure you want to change the base?

feat: generator add benchmark mode by setting rule plugins #290

Conversation

Ethan-ES commented Jan 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview:

Uh oh!

copy-pr-bot bot commented Jan 25, 2026

Uh oh!

simone-chen Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

Ethan-ES Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Ethan-ES commented Jan 25, 2026 •

edited

Loading