Open
Description
Hi,
I have a question about the performance between host_memory_bandwidth and host_memory_bandwidth_xrt.
It seems that they have same kernel and similar host code, but the performance results are quite different.
For buffer size 32.00 KB example,
OpenCL host example is
Concurrent Read and Write Throughput = 12.1802 (GB/sec) for buffer size 32.00 KB
Read Throughput = 8.17289 (GB/sec) for buffer size 32.00 KB
Write Throughput = 8.26587 (GB/sec) for buffer size 32.00 KB
,while XRT host example is
Concurrent Read and Write Throughput = 12.4713 (GB/sec) for buffer size 32.00 KB
Read Throughput = 0.00397313 (GB/sec) for buffer size 32.00 KB
Write Throughput = 0.0040426 (GB/sec) for buffer size 32.00 KB
What is the main factor contributing to this difference?
Do XRT and OpenCL affect the performance significantly?
Metadata
Metadata
Assignees
Labels
No labels