Jack
Jack
+1 Multi-GPU is used more and more frequently nowadays but does not work with sacred. Because the there are additional stuff in the command line to start python, just like...
@JarnoRFB Is it possible to add this feature to all observers by default? If it's able to recover from a connection interrupt, this would be a good feature to have.
@JarnoRFB Every time I use queue observer, it didnt send the metric properly. From the omniboard, I see the status becomes dead. Is there anything Im doing wrong?
@JarnoRFB Never mind, I've figured out my problem. It now works great!
@guy4261 Hi, I'm meeting the same issue. Have you resolve it? On my machine I indeed have /dev/nvidia0 but still get that error. Are you sure your two machines have...
@guy4261 I dont think you need. Just run export EGL_DEVICE_ID=1 before you run your python file. However, it still doesn't work in my case, maybe my server has some wierd...
@ahundt I fixed it. Kindly let me know if there is any issue.
@hetolin Hi, I met the same problem. It looks like the gt_RTs in 201 is not in the right order. What do you think?