#10: Add more benchmarks #11

cwschilly · 2025-06-06T17:41:09Z

Fixes #10
Fixes #12 (wip)

Adds both double and Kokkos::complex<double> benchmarks for:

Level 1, 2, and 3 BLAS kernels
DPOTRF

Driver Changes

Instead of passing the dimensions of the matrices used for the benchmarks, just specify the number of floating point operations that each benchmark should perform:

./slownode <iters> <flops>

Each benchmark will then determine the matrix/vector dimensions so that the operation approximates this number of fp ops.

Detection Script Changes

When running the detection script, specify which benchmark you want to analyze with the following options:

-b (--benchmark): [level1, level2, level3, dpotrf]
-d (--datatype):  [double, complex]

If these flags are not set, the script defaults to level3 and double.

… after each benchmark

…ests

nlslatt · 2025-06-11T21:22:06Z

The final value on some gather lines seems highly suspicious.

gather: 95 (nodename): 1.25574: breakdown: 0.125147 0.132657 0.125616 0.131049 0.124445 0.124678 0.125 0.124691 0.126191 0.116261 3.16202e-322

gather: 91 (nodename): 2.6874e-05: breakdown: 2.507e-06 2.253e-06 2.24e-06 2.489e-06 2.552e-06 2.523e-06 4.249e-06 2.522e-06 2.689e-06 2.85e-06 1.83819

nlslatt · 2025-06-11T21:39:55Z

We also need to be able to specify the $N$ for different BLAS levels independently. I have 83 seconds for level 3 complex and 1e-5 for level 1 double.

cwschilly · 2025-06-12T13:05:02Z

Thanks @nlslatt these issues should be fixed now

cwschilly · 2025-06-16T14:13:14Z

Converting to draft while I work on #12

#10: wip: initial commit for adding benchmarks

e3efcaf

cwschilly linked an issue Jun 6, 2025 that may be closed by this pull request

Add more benchmarks #10

Open

cwschilly added 5 commits June 9, 2025 06:28

#10: wip: add dpotrf benchmark and refactor to run sensors before and…

58d263c

… after each benchmark

#10: wip: call sensors before/after all benchmarks

ff1fc3d

#10: wip: small fixes

111a172

#10: fix remaining bugs in driver; update detection script and unit t…

1125ab6

…ests

#10: cleanup

6b798d6

cwschilly marked this pull request as ready for review June 10, 2025 13:51

cwschilly requested review from lifflander and nlslatt June 10, 2025 15:39

cwschilly added 2 commits June 12, 2025 08:23

#10: allow for different N for each benchmark

b657cac

#10: fix error with last iteration timing

0b1765d

cwschilly added 2 commits June 12, 2025 14:16

#10: allow user to set all matrix sizes for all benchmarks

04dad3b

#10: fix handling of defaults

b19da0d

cwschilly marked this pull request as draft June 16, 2025 14:13

cwschilly added 3 commits June 16, 2025 08:15

#10: use flops as input instead of dimensions

c92dfdd

#10: fixes to new logic

51b81b5

#10: use LAPACK dpotrf and other fixes

f03406c

cwschilly marked this pull request as ready for review July 22, 2025 17:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

#10: Add more benchmarks #11

#10: Add more benchmarks #11

Uh oh!

cwschilly commented Jun 6, 2025 •

edited

Loading

Uh oh!

nlslatt commented Jun 11, 2025 •

edited

Loading

Uh oh!

nlslatt commented Jun 11, 2025

Uh oh!

cwschilly commented Jun 12, 2025

Uh oh!

cwschilly commented Jun 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

#10: Add more benchmarks #11

Are you sure you want to change the base?

#10: Add more benchmarks #11

Uh oh!

Conversation

cwschilly commented Jun 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Driver Changes

Detection Script Changes

Uh oh!

nlslatt commented Jun 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nlslatt commented Jun 11, 2025

Uh oh!

cwschilly commented Jun 12, 2025

Uh oh!

cwschilly commented Jun 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cwschilly commented Jun 6, 2025 •

edited

Loading

nlslatt commented Jun 11, 2025 •

edited

Loading