Task/rajaperf docs #38

rhornung67 · 2026-01-16T22:21:39Z

We are working to finalize the last bits about generating the baselines and how the FOM quantities are calculated. So there is more to come.

Closes #31

MrBurmark · 2026-01-16T22:55:52Z

docs/13_rajaperf/rajaperf.rst

+
+ * *Comm* group (directory src/comm)
+
+   #. **HALO_EXCHANGE_FUSED** packing and unpacking MPI message buffers for point-to-point distributed memory halo data exchange for mesh-based codes *(overhead of launching many small kernels, GPU variants use RAJA::Workgroup concepts to execute multiple kernels with one launch)* 


Suggested change

#. **HALO_EXCHANGE_FUSED** packing and unpacking MPI message buffers for point-to-point distributed memory halo data exchange for mesh-based codes *(overhead of launching many small kernels, GPU variants use RAJA::Workgroup concepts to execute multiple kernels with one launch)*

#. **HALO_PACKING_FUSED** packing and unpacking MPI message buffers for point-to-point distributed memory halo data exchange for mesh-based codes *(overhead of launching many small kernels, GPU variants use RAJA::Workgroup concepts to execute multiple kernels with one launch)*

Changed. Thanks for that. I forgot about it.

MrBurmark · 2026-01-16T23:00:28Z

docs/13_rajaperf/rajaperf.rst

+depends on the problem size run for the kernel; thus, each checksum is 
+computed at run time. Validation criteria is defined in terms of the checksum
+difference between each kernel variant and problem size run and a corresponding
+reference variant. Typically, the ``Base_Seq`` variant is used to define the


Suggested change

reference variant. Typically, the ``Base_Seq`` variant is used to define the

reference variant. The ``Base_Seq`` variant is used to define the

MrBurmark · 2026-01-16T23:04:49Z

docs/13_rajaperf/rajaperf.rst

+Whether the checksum for each kernel is considered to be within its expected
+tolerance is reported as checksum ``PASSED`` or ``FAILED`` in the output files.
+
+**Show an example of this for the EL Capitan baseline runs!!**


Reminder to add more accurate Base_Seq summation tunings (left fold is inaccurate for large problem sizes).

rhornung67 added 15 commits December 23, 2025 11:42

First cut at some RAJA Perf content

b066561

Merge branch 'develop' into task/rajaperf-docs

241f65f

Fleshing out more description of the Suite and what it does.

08e60a1

Add list of kernels and brief descriptions

f7c43b0

Attempt to describe relevant aspects of each kernel

3792cf6

More cleanup

6a080c6

Attempt to prioritize kernels in terms of importance

d757840

Bold tier levels

cee7fd9

Cleanup kernel lists and descriptions

b6ddf0a

Change kernel priority

53a46cd

More rework kernel section

374a875

Add minimal content to strong/weak scaling sections.

20b87b3

Merge branch 'develop' into task/rajaperf-docs

beb0583

Improve kernel descriptions

b47e400

Fill in more sections

2841f64

rhornung67 requested review from MrBurmark, artv3, pearce8 and rchen20 January 16, 2026 22:21

pearce8 mentioned this pull request Jan 16, 2026

Write RAJAPerf docs #31

Closed

3 tasks

MrBurmark reviewed Jan 16, 2026

View reviewed changes

Change kernel

57e7133

MrBurmark reviewed Jan 16, 2026

View reviewed changes

rhornung67 added 2 commits January 16, 2026 15:17

address review comments

d4cef8b

Fix some links

df88b93

pearce8 added this to the Initial documentation (Sections 1-6) milestone Jan 18, 2026

pearce8 approved these changes Jan 18, 2026

View reviewed changes

pearce8 merged commit a84a30f into develop Jan 18, 2026
2 checks passed

pearce8 deleted the task/rajaperf-docs branch January 18, 2026 23:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Task/rajaperf docs #38

Task/rajaperf docs #38

rhornung67 commented Jan 16, 2026 •

edited by pearce8

Loading

Uh oh!

MrBurmark Jan 16, 2026

Uh oh!

rhornung67 Jan 16, 2026

Uh oh!

MrBurmark Jan 16, 2026

Uh oh!

MrBurmark Jan 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants


		* Comm group (directory src/comm)

		#. HALO_EXCHANGE_FUSED packing and unpacking MPI message buffers for point-to-point distributed memory halo data exchange for mesh-based codes (overhead of launching many small kernels, GPU variants use RAJA::Workgroup concepts to execute multiple kernels with one launch)

	#. HALO_EXCHANGE_FUSED packing and unpacking MPI message buffers for point-to-point distributed memory halo data exchange for mesh-based codes (overhead of launching many small kernels, GPU variants use RAJA::Workgroup concepts to execute multiple kernels with one launch)
	#. HALO_PACKING_FUSED packing and unpacking MPI message buffers for point-to-point distributed memory halo data exchange for mesh-based codes (overhead of launching many small kernels, GPU variants use RAJA::Workgroup concepts to execute multiple kernels with one launch)

	reference variant. Typically, the ``Base_Seq`` variant is used to define the
	reference variant. The ``Base_Seq`` variant is used to define the

Task/rajaperf docs #38

Task/rajaperf docs #38

Conversation

rhornung67 commented Jan 16, 2026 • edited by pearce8 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MrBurmark Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

rhornung67 Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

MrBurmark Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

MrBurmark Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rhornung67 commented Jan 16, 2026 •

edited by pearce8

Loading