CaffeOnSpark icon indicating copy to clipboard operation
CaffeOnSpark copied to clipboard

Core dump failures

Open ooyanglinoo opened this issue 8 years ago • 3 comments

If solver config file have some mistakes, cluster won't return failed soon, after a long time,return core dumps. How can I solve this problem.

ooyanglinoo avatar Nov 30 '17 03:11 ooyanglinoo

fix the solver prototxt file, I suppose.

junshi15 avatar Nov 30 '17 05:11 junshi15

You could run the solver file on the single node version first, i.e. BVLC Caffe. Of course, you need to change the network prototxt file accordingly (switch out the data layer, etc.). If the single node version works, then you can try the grid version (switch back in the data layer, etc.).

junshi15 avatar Nov 30 '17 07:11 junshi15

Is there possible to solve the coredump problem by changing the code of CaffeProcessor of CaffeOnSpark?

ooyanglinoo avatar Dec 18 '17 08:12 ooyanglinoo