crayon icon indicating copy to clipboard operation
crayon copied to clipboard

Timeout threshold might be too low ?

Open mbanani opened this issue 7 years ago • 5 comments

I keep getting this error:

Traceback (most recent call last): File "train_viewpoint.py", line 495, in <module> main(args) File "train_viewpoint.py", line 23, in main cc = CrayonClient("focus.eecs.umich.edu") File "/z/home/mbanani/PyTorch2Env/local/lib/python2.7/site-packages/pycrayon/crayon.py", line 29, in __init__ " Server sent: {}.".format(r.text)) RuntimeError: Something went wrong! Server sent: Server: TensorBoard failed to answer request 'logdir'. Done

and I think the reason is that the timeout, specified here is too low when people are training on clusters and trying to connect to a docker on their personal machines. Maybe consider increasing the timeout ?

mbanani avatar Sep 25 '17 19:09 mbanani

Issue was caused by something else.

mbanani avatar Sep 25 '17 23:09 mbanani

I have the same error. Could you elaborate a bit more on what was causing this for you?

vakker avatar Oct 16 '17 13:10 vakker

So I closed the issue before because I thought it was caused by running within a virtual environment, but the issue reappeared and I forgot to reopen the issue.

I am not really sure what's causing the issue, and I moved to using tensorboard directly, as shown here, which proved simpler to use for my purposes.

As for the error, I found that having a firewall that blocks some of the more common network ports or having many experiments running at the same time resulted in the error being produced more often. It would still run from time to time thought, which I found pretty weird, so my guess is that it's caused by the network taking too long to respond. Sorry this isn't more helpful

mbanani avatar Oct 16 '17 15:10 mbanani

Thanks for the info, I'll investigate a bit further. I'm using Crayon in a Docker container, not sure if it has an effect.

vakker avatar Oct 18 '17 07:10 vakker

I had the same issue; for a quick and dirty workaround I simply commented out https://github.com/torrvision/crayon/blob/master/client/python/pycrayon/crayon.py#L32-L44 and it's working now.

motiwari avatar Mar 13 '18 03:03 motiwari