Skip to content

Conversation

@anish-shanbhag
Copy link
Contributor

@anish-shanbhag anish-shanbhag commented Nov 25, 2025

Overview:

This change updates the perf database and match_workers to cache repeated queries, which should be fine since these operations don't appear to have any side effects.

Details:

In my local testing, this speeds up aiconfigurator cli default --model QWEN3_32B --total_gpus 32 --system h200_sxm from 206s --> 9s

Where should the reviewer start?

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

  • closes GitHub issue: #xxx

@copy-pr-bot
Copy link

copy-pr-bot bot commented Nov 25, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Signed-off-by: Anish Shanbhag <ashanbhag@nvidia.com>
Signed-off-by: Anish Shanbhag <ashanbhag@nvidia.com>
@anish-shanbhag anish-shanbhag force-pushed the ashanbhag/cache-perf-db branch from e342e0e to 1b17805 Compare November 25, 2025 06:27
Signed-off-by: Anish Shanbhag <ashanbhag@nvidia.com>
@tianhaox tianhaox merged commit 37f267b into ai-dynamo:main Nov 26, 2025
5 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants