Etaler Optimize data transfer between OpenCL devices.

Optimize data transfer between OpenCL devices.

Open marty1885 opened this issue 5 years ago • 2 comments

Currently copying data between 2 OpenCL backends are done by

Which is slow. There are other more optimized routes. But the mechanism to trigger it is yet to determined.

Sol 1: Using clEnqueueMapBuffer

Sol 2: With OpenCL 2.0's Shared Virtual Memory. Host memory is not touched, super fast.

They should make multi-gpu faster.

May 20 '19 02:05 marty1885

Sol 3:

But it is still not optimal that we need to copy a buffer on GPU1

Jun 02 '19 14:06 marty1885

Apparently Nvidia does have some OpenCL 2.0 support. https://streamhpc.com/blog/2017-02-22/nvidia-enables-opencl-2-0-beta-support/

Seems I can build OpenCL 2.0 code before grabbing myself a Navi card.

Jul 11 '19 17:07 marty1885