Skip to content

Conversation

@rhornung67
Copy link
Member

@rhornung67 rhornung67 commented Jan 16, 2026

We are working to finalize the last bits about generating the baselines and how the FOM quantities are calculated. So there is more to come.

Closes #31

@pearce8 pearce8 mentioned this pull request Jan 16, 2026
3 tasks

* *Comm* group (directory src/comm)

#. **HALO_EXCHANGE_FUSED** packing and unpacking MPI message buffers for point-to-point distributed memory halo data exchange for mesh-based codes *(overhead of launching many small kernels, GPU variants use RAJA::Workgroup concepts to execute multiple kernels with one launch)*
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
#. **HALO_EXCHANGE_FUSED** packing and unpacking MPI message buffers for point-to-point distributed memory halo data exchange for mesh-based codes *(overhead of launching many small kernels, GPU variants use RAJA::Workgroup concepts to execute multiple kernels with one launch)*
#. **HALO_PACKING_FUSED** packing and unpacking MPI message buffers for point-to-point distributed memory halo data exchange for mesh-based codes *(overhead of launching many small kernels, GPU variants use RAJA::Workgroup concepts to execute multiple kernels with one launch)*

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Changed. Thanks for that. I forgot about it.

depends on the problem size run for the kernel; thus, each checksum is
computed at run time. Validation criteria is defined in terms of the checksum
difference between each kernel variant and problem size run and a corresponding
reference variant. Typically, the ``Base_Seq`` variant is used to define the
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
reference variant. Typically, the ``Base_Seq`` variant is used to define the
reference variant. The ``Base_Seq`` variant is used to define the

Whether the checksum for each kernel is considered to be within its expected
tolerance is reported as checksum ``PASSED`` or ``FAILED`` in the output files.

**Show an example of this for the EL Capitan baseline runs!!**
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Reminder to add more accurate Base_Seq summation tunings (left fold is inaccurate for large problem sizes).

@pearce8 pearce8 merged commit a84a30f into develop Jan 18, 2026
2 checks passed
@pearce8 pearce8 deleted the task/rajaperf-docs branch January 18, 2026 23:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Write RAJAPerf docs

4 participants