Skip to content

Commit 7dc40eb

Browse files
committed
add Kokkos::fence at the beginning of profiling regions to fix profiling
issues on GPUs
1 parent 1be4115 commit 7dc40eb

1 file changed

Lines changed: 1 addition & 0 deletions

File tree

src/global.cpp

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -66,6 +66,7 @@ int initialize() {
6666
void pushRegion(const std::string& kName) {
6767
Kokkos::Profiling::pushRegion(kName);
6868
if(prof.perfEnabled) {
69+
Kokkos::fence();
6970
prof.currentRegion = prof.currentRegion->GetChild(kName);
7071
prof.currentRegion->Start();
7172
}

0 commit comments

Comments
 (0)