perf: cache repeated queries to perf database and match_workers #128

anish-shanbhag · 2025-11-25T04:42:50Z

Overview:

This change updates the perf database and match_workers to cache repeated queries, which should be fine since these operations don't appear to have any side effects.

Details:

In my local testing, this speeds up aiconfigurator cli default --model QWEN3_32B --total_gpus 32 --system h200_sxm from 206s --> 9s

Where should the reviewer start?

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

closes GitHub issue: #xxx

copy-pr-bot · 2025-11-25T04:42:54Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Signed-off-by: Anish Shanbhag <ashanbhag@nvidia.com>

src/aiconfigurator/sdk/perf_database.py

Signed-off-by: Anish Shanbhag <ashanbhag@nvidia.com>

anish-shanbhag requested review from AichenF, Arsene12358, YijiaZhao, ilyasher, jasonqinzhou, saturley-hall, tianhaox and xutizhou as code owners November 25, 2025 04:42

github-actions bot added the perf label Nov 25, 2025

anish-shanbhag mentioned this pull request Nov 25, 2025

perf: remove cubic interpolation and scipy dependency #116

Closed

anish-shanbhag added 2 commits November 24, 2025 22:27

perf: cache repeated queries to perf database and match_workers

4bd4aa8

Signed-off-by: Anish Shanbhag <ashanbhag@nvidia.com>

use lru_cache for match_workers

1b17805

Signed-off-by: Anish Shanbhag <ashanbhag@nvidia.com>

anish-shanbhag force-pushed the ashanbhag/cache-perf-db branch from e342e0e to 1b17805 Compare November 25, 2025 06:27

tianhaox reviewed Nov 25, 2025

View reviewed changes

src/aiconfigurator/sdk/perf_database.py Show resolved Hide resolved

Clear cache when setting default SOL mode

18958fe

Signed-off-by: Anish Shanbhag <ashanbhag@nvidia.com>

tianhaox approved these changes Nov 26, 2025

View reviewed changes

Arsene12358 approved these changes Nov 26, 2025

View reviewed changes

tianhaox merged commit 37f267b into ai-dynamo:main Nov 26, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: cache repeated queries to perf database and match_workers #128

perf: cache repeated queries to perf database and match_workers #128

Uh oh!

anish-shanbhag commented Nov 25, 2025 •

edited

Loading

Uh oh!

copy-pr-bot bot commented Nov 25, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

perf: cache repeated queries to perf database and match_workers #128

perf: cache repeated queries to perf database and match_workers #128

Uh oh!

Conversation

anish-shanbhag commented Nov 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview:

Details:

Where should the reviewer start?

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

Uh oh!

copy-pr-bot bot commented Nov 25, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

anish-shanbhag commented Nov 25, 2025 •

edited

Loading