You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Include host activity of benched fn in CPU time when blocking kernel is used
Based on findings of #249,
m_cpu_timer.start() is being called from kernel_launcher_timer.start()
method.
Previously it was called from kernel_launcher_timer.stop() just before
unblock_stream() call with the intention to hone in time to execute
GPU work, but this excluded any host work performed by the benched function
from CPU time.
0 commit comments