Add multi-configuration performance benchmarking #5858

JanuszL · 2025-03-25T08:23:10Z

Add support for range-based arguments for CPU threads and HW decoder load
Implement performance testing across multiple configurations
Add best throughput configuration reporting
Enhance performance metrics with more detailed statistics
Refactor pipeline definitions to support variable HW decoder load

Category:

New feature (non-breaking change which adds functionality)

Description:

Add support for range-based arguments for CPU threads and HW decoder load
Implement performance testing across multiple configurations
Add best throughput configuration reporting
Enhance performance metrics with more detailed statistics
Refactor pipeline definitions to support variable HW decoder load

Additional information:

Affected modules and functionalities:

internal_tools/hw_decoder_bench.py

Key points relevant for the review:

NA

Tests:

Checklist

Documentation

DALI team only

Requirements

Implements new requirements
Affects existing requirements
N/A

REQ IDs: N/A

JIRA TASK: N/A

JanuszL · 2025-03-25T08:24:02Z

!build

internal_tools/hw_decoder_bench.py

dali-automaton · 2025-03-25T08:31:45Z

CI MESSAGE: [25999015]: BUILD STARTED

dali-automaton · 2025-03-25T08:36:53Z

CI MESSAGE: [25999015]: BUILD FAILED

JanuszL · 2025-03-25T08:53:33Z

!build

dali-automaton · 2025-03-25T08:55:32Z

CI MESSAGE: [25999773]: BUILD STARTED

dali-automaton · 2025-03-25T18:10:44Z

CI MESSAGE: [25999773]: BUILD PASSED

jantonguirao · 2025-03-31T08:56:56Z

internal_tools/hw_decoder_bench.py


+        @pipeline_def(


pipeline definitions could be outside of the loops, for better readibility.

jantonguirao · 2025-03-31T08:58:26Z

internal_tools/hw_decoder_bench.py

+    print(f"Total time: {best_result['total_time']:.6f} sec")
+    print(f"Throughput: {best_result['total_throughput']:.2f} frames/sec")
+else:
+    print("No results to display")


an idea, print the results for all parameters in a way that can be easily parsed and plotted.

Can you elaborate which format you have in mind?

It was just a thought. Instead of just printing the best result, print all the values for all the tested parameters, so that we can plot them. Anyway, feel free to consider this as out-of-scope for now

We print them as the benchmark goes, do you think it is better to print them again at the end?

jantonguirao · 2025-03-31T08:59:18Z

internal_tools/hw_decoder_bench.py

+parser.add_argument(
+    "-j",
+    dest="num_threads",
+    help="CPU threads. Can be a single value (e.g. 4) or range 'start:end:step' (e.g. 1:8:2)",


I'd mention that the end of the range is included (which is not typically the case in Python ranges)

jantonguirao · 2025-03-31T09:00:20Z

internal_tools/hw_decoder_bench.py

 )

+
+def parse_range_arg(arg_str, use_float=False):


just a suggestion: maybe just parse_fn=int here instead of use_float

- Add support for range-based arguments for CPU threads and HW decoder load - Implement performance testing across multiple configurations - Add best throughput configuration reporting - Enhance performance metrics with more detailed statistics - Refactor pipeline definitions to support variable HW decoder load Signed-off-by: Janusz Lisiecki <[email protected]>

Signed-off-by: Janusz Lisiecki <[email protected]>

JanuszL · 2025-04-07T09:44:30Z

!build

dali-automaton · 2025-04-07T09:53:04Z

CI MESSAGE: [26563974]: BUILD STARTED

dali-automaton · 2025-04-08T05:15:39Z

CI MESSAGE: [26563974]: BUILD PASSED

JanuszL force-pushed the hw_ben_sweep branch from 159f763 to 709a88f Compare March 25, 2025 08:23

github-advanced-security bot found potential problems Mar 25, 2025

View reviewed changes

internal_tools/hw_decoder_bench.py Fixed Show fixed Hide fixed

JanuszL force-pushed the hw_ben_sweep branch from 709a88f to 767019d Compare March 25, 2025 08:53

dali-automaton assigned jantonguirao and szalpal Mar 26, 2025

jantonguirao reviewed Mar 31, 2025

View reviewed changes

JanuszL added 2 commits April 7, 2025 09:42

Review fixes

4343d1c

Signed-off-by: Janusz Lisiecki <[email protected]>

JanuszL force-pushed the hw_ben_sweep branch from fa2715d to e3f2c88 Compare April 7, 2025 07:43

Reformat

e43bc73

Signed-off-by: Janusz Lisiecki <[email protected]>

JanuszL force-pushed the hw_ben_sweep branch from e3f2c88 to e43bc73 Compare April 7, 2025 08:27

szalpal approved these changes Apr 7, 2025

View reviewed changes

jantonguirao approved these changes Apr 7, 2025

View reviewed changes

JanuszL merged commit 09bf104 into NVIDIA:main Apr 8, 2025
7 checks passed

JanuszL deleted the hw_ben_sweep branch April 8, 2025 05:18

Add multi-configuration performance benchmarking #5858

Add multi-configuration performance benchmarking #5858

Uh oh!

Conversation

JanuszL commented Mar 25, 2025

Category:

Description:

Additional information:

Affected modules and functionalities:

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

Uh oh!

JanuszL commented Mar 25, 2025

Uh oh!

Uh oh!

dali-automaton commented Mar 25, 2025

Uh oh!

dali-automaton commented Mar 25, 2025

Uh oh!

JanuszL commented Mar 25, 2025

Uh oh!

dali-automaton commented Mar 25, 2025

Uh oh!

dali-automaton commented Mar 25, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jantonguirao Mar 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

JanuszL commented Apr 7, 2025

Uh oh!

dali-automaton commented Apr 7, 2025

Uh oh!

dali-automaton commented Apr 8, 2025

Uh oh!

Uh oh!

Uh oh!

jantonguirao Mar 31, 2025 •

edited

Loading