-
Notifications
You must be signed in to change notification settings - Fork 0
Task/rajaperf docs #38
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
docs/13_rajaperf/rajaperf.rst
Outdated
|
|
||
| * *Comm* group (directory src/comm) | ||
|
|
||
| #. **HALO_EXCHANGE_FUSED** packing and unpacking MPI message buffers for point-to-point distributed memory halo data exchange for mesh-based codes *(overhead of launching many small kernels, GPU variants use RAJA::Workgroup concepts to execute multiple kernels with one launch)* |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| #. **HALO_EXCHANGE_FUSED** packing and unpacking MPI message buffers for point-to-point distributed memory halo data exchange for mesh-based codes *(overhead of launching many small kernels, GPU variants use RAJA::Workgroup concepts to execute multiple kernels with one launch)* | |
| #. **HALO_PACKING_FUSED** packing and unpacking MPI message buffers for point-to-point distributed memory halo data exchange for mesh-based codes *(overhead of launching many small kernels, GPU variants use RAJA::Workgroup concepts to execute multiple kernels with one launch)* |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Changed. Thanks for that. I forgot about it.
docs/13_rajaperf/rajaperf.rst
Outdated
| depends on the problem size run for the kernel; thus, each checksum is | ||
| computed at run time. Validation criteria is defined in terms of the checksum | ||
| difference between each kernel variant and problem size run and a corresponding | ||
| reference variant. Typically, the ``Base_Seq`` variant is used to define the |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| reference variant. Typically, the ``Base_Seq`` variant is used to define the | |
| reference variant. The ``Base_Seq`` variant is used to define the |
| Whether the checksum for each kernel is considered to be within its expected | ||
| tolerance is reported as checksum ``PASSED`` or ``FAILED`` in the output files. | ||
|
|
||
| **Show an example of this for the EL Capitan baseline runs!!** |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reminder to add more accurate Base_Seq summation tunings (left fold is inaccurate for large problem sizes).
We are working to finalize the last bits about generating the baselines and how the FOM quantities are calculated. So there is more to come.
Closes #31