batch_job icon indicating copy to clipboard operation
batch_job copied to clipboard

timeout not working (non deterministic)?

Open spotlightgit opened this issue 4 years ago • 1 comments

Hey Oliver,

with this kind of function call I have some trouble with your toolbox: batch_job_distrib(goal_function, x, worker, additional_data, '-chunk_lims', [1 1], '-timeout', timeout); In the past there were no issues without using the timeout option. Now I want to exclude the Master Matlab from number crunching, therefore I use the timeout option (like you suggested). Unfortunately it happens sometimes that all workers are closed/finished but a single mat-file still has a file lock (for example: chunk000001.mat.lock) and the Master Matlab waits endless and no timeout is applied. If I "finish" this situation with Ctrl + c at the command line of the Master Matlab the following error appears:

Please wait while the workers are halted.
Operation terminated by user during batch_job_collect (line 78)


In batch_job_distrib (line 201)
    output = batch_job_collect(s, co);
...

Do you have any idea what's going wrong or suggestions what I can do?

spotlightgit avatar Jun 01 '20 14:06 spotlightgit

Thanks. This sounds like a bug. I just pushed a change which seems to work for me. Please test and let me know.

ojwoodford avatar Jun 08 '20 17:06 ojwoodford