XRT icon indicating copy to clipboard operation
XRT copied to clipboard

large overhead for launching kernel

Open doonny opened this issue 11 months ago • 0 comments

I am using the U55c board and vitis 2022.1, I have found that the kernel launching latency is very large (50us~100us), this may cause a big differnece when using the API clGetEventProfilingInfo() to measure kernel execution time.

image

For example, when using clGetEventProfilingInfo() to measure a kernel execution time, I got 762us, as follow: image

But the kernel acutally only runs for 649us as follow: image

The will cause a BW estimation of 350GB/s and 410GB/s, respectively.

Is there anyway to minize the launching overhead for kernels ?

doonny avatar Jan 10 '25 14:01 doonny