Open
Description
We've had some performance issues that were fixed by rolling back that change. On a HIP system running on a single GPU the performance of GPU kernels was about a factor of 30 slower when we used tracing in the code. Interestingly the code ran with its normal performance as soon as the Legion profiler was switched on. I wonder if it has any side effect that could cause this.
Originally posted by @tukss in #27 (comment)
Metadata
Metadata
Assignees
Labels
No labels