Fabian Tschopp
Fabian Tschopp
@zif520 Did you change int_tp and int_tpc both to 32bit types for both the OpenCL and C++ part of the code? https://github.com/naibaf7/caffe/blob/master/include/caffe/definitions.hpp and https://github.com/naibaf7/caffe/blob/master/src/caffe/greentea/cl_headers/header.cl however it might break if you...
@zif520 I am currently making my branch ready for 32 bit indexing again, so that both 64 bit and 32 bit work. Then it should be able to compile and...
@sh1r0 I currently don't have the time for a complete rebase - this has to wait a bit. @zif520 What's the progress? Is it working with my latest updates?
@sh1r0 Ok cool, at least you got it working! Now, what is the runtime error that you get with using the CPU on OpenCL? I use a CPU BLAS with...
@sh1r0 Ok thanks, I'll try to work out what's going wrong. Might it be that the binaries do not call set_mode and SetDevice properly? `ViennaCL: FATAL ERROR: Could not find...
@zif520 Yes, here, it is also important to mention that the device must be set before any solver or network is loaded. Knowledge of which device should be used is...
@sh1r0 I think now you benchmarked the OpenCL GPU twice: ``` Caffe::SetDevices(gpus); Caffe::set_mode(Caffe::GPU); Caffe::SetDevice(gpus[0]); ``` should be either: ``` Caffe::set_mode(Caffe::GPU); Caffe::SetDevice(gpus[0]); ``` or: ``` Caffe::set_mode(Caffe::GPU); Caffe::SetDevices(gpus); Caffe::SelectDevice(gpus[0], false); ``` Besides,...
@bhack Thanks, good to know. However ViennaCL-BLAS seems to have optimization/tuning issues on this as well (which is what we are currently using in this Android-OpenCL experiment). It is a...
@bhack @zif520 @sh1r0 Added ISAAC compile support to CMake and GNU Makefiles on my branch, if anyone fancies to try. It did not speed up on my GT650 or Intel...
@karlrupp Thanks for clarification, even though that is not good news, it indicates that the fault lies with the compiler / OpenCL libraries of the vendors and not with our...