Skip to content

Curiosity of performance between host_memory_bandwidth and host_memory_bandwidth_xrt #72

Open
@anonymous1782

Description

@anonymous1782

Hi,

I have a question about the performance between host_memory_bandwidth and host_memory_bandwidth_xrt.

It seems that they have same kernel and similar host code, but the performance results are quite different.

For buffer size 32.00 KB example,

OpenCL host example is

Concurrent Read and Write Throughput = 12.1802 (GB/sec) for buffer size 32.00 KB
Read Throughput = 8.17289 (GB/sec) for buffer size 32.00 KB
Write Throughput = 8.26587 (GB/sec) for buffer size 32.00 KB

,while XRT host example is

Concurrent Read and Write Throughput = 12.4713 (GB/sec) for buffer size 32.00 KB
Read Throughput = 0.00397313 (GB/sec) for buffer size 32.00 KB
Write Throughput = 0.0040426 (GB/sec) for buffer size 32.00 KB

What is the main factor contributing to this difference?
Do XRT and OpenCL affect the performance significantly?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions