Fabian Tschopp
Fabian Tschopp
@bhack Yup, this link is exactly what needs to be added to the device initialization in the OpenCL backend. :) You see, this is also a reason why I want...
And also APU systems like AMD HSA Kaveri and Intel Broadwell, as this PDF points out, yup.
I'd need to know if there is a pattern on the data index: `(top_id, top_data_id, blob_id, feat_id)=0,0,0,0;` Can you find that out? Or just post some more index + values...
@psyhtest Yes.. I am currently looking if there are obvious parts of the code/kernels that could be problematic on these devices. After that I would like to do actual tests...
@dagamayank Thank you, help is very welcome, especially from AMD :) To start, you can have a look at how the kernels are generated and the public interface of the...
@bhack Yes, why? :)
@dagamayank Ok, the easiest way to get started is to compile Caffe with the USE_LIBDNN turned on in the Makefile.config (https://github.com/naibaf7/caffe/blob/master/Makefile.config.example#L15). Then, if you want to get a kernel string...
@dagamayank TestSharedWeightsUpdate seems to fail by being off by a small margin. This is weird but can be ignored and is not relevant for this implementation. The _Spatial failures are...
@dagamayank I just verified on my W9100 that the backward pass is fine. What driver are you using? I'm using 15.302 (Crimson Edition 15.12 Linux 64 bit). I had problems...
@dagamayank The defines are defining constants for the kernel, such as padding (v_p), striding (v_s), dilation (v_d) and image sizes (v_imsi, v_imso) in each dimension. Other defines are for the...