Vitis_Accel_Examples icon indicating copy to clipboard operation
Vitis_Accel_Examples copied to clipboard

Curiosity of performance between host_memory_bandwidth and host_memory_bandwidth_xrt

Open anonymous1782 opened this issue 2 years ago • 2 comments

Hi,

I have a question about the performance between host_memory_bandwidth and host_memory_bandwidth_xrt.

It seems that they have same kernel and similar host code, but the performance results are quite different.

For buffer size 32.00 KB example,

OpenCL host example is

Concurrent Read and Write Throughput = 12.1802 (GB/sec) for buffer size 32.00 KB
Read Throughput = 8.17289 (GB/sec) for buffer size 32.00 KB
Write Throughput = 8.26587 (GB/sec) for buffer size 32.00 KB

,while XRT host example is

Concurrent Read and Write Throughput = 12.4713 (GB/sec) for buffer size 32.00 KB
Read Throughput = 0.00397313 (GB/sec) for buffer size 32.00 KB
Write Throughput = 0.0040426 (GB/sec) for buffer size 32.00 KB

What is the main factor contributing to this difference? Do XRT and OpenCL affect the performance significantly?

anonymous1782 avatar Aug 09 '22 04:08 anonymous1782

Hi @anonymous1782 ,

Can you please share the Vitis and XRT version that are being used along with Platform that is targetted.

Thanks, Vishnu

vishnuchebrolu avatar Aug 10 '22 06:08 vishnuchebrolu

Hi @vishnuchebrolu

Hmm.. both results shown in the post are not my own. The results are from the README.md file in this repository. (OpenCL, XRT)

By the way, I'm using vitis v2021.2 and xrt v 2.12.427.

Thanks,

anonymous1782 avatar Aug 10 '22 09:08 anonymous1782