-
Notifications
You must be signed in to change notification settings - Fork 13
Open
Description
I noticed very high latencies for kernel dispatches using AQL. Synchronous dispatches take up to 21 µs. Asynchronous (batch) dispatches help to hide latencies. However, kernel dispatching still takes 6 µs (in average), which is still far to slow for fine-grained offloading.
In my experiments I set HSA_ENABLE_INTERRUPT to 0, which greatly improves robustness of the kernel offload times. With interrupts enabled, latencies vary from 6 to 15 microseconds.
System setup:
- Kaveri APU (no dGPU)
- Kernel 4.0.0-100002-generic #201511031149 SMP
- kfd-v1.6.1 (7fb04c4 from git repo HSA-Drivers-Linux-AMD)
- HSA-Runtime 1.0.3 (fa0ef7e from git repo HSA-Runtime-AMD)
- CL offline compiler (CLOC) v0.9.8
Metadata
Metadata
Assignees
Labels
No labels