awesome-semantic-segmentation-pytorch icon indicating copy to clipboard operation
awesome-semantic-segmentation-pytorch copied to clipboard

interrupt multi-GPU training

Open HSMung opened this issue 6 years ago • 3 comments

If I interrupt multi-GPU training, sometimes there will be several zombie processes. How can I avoid this situation?

HSMung avatar Oct 03 '19 10:10 HSMung

Which model and backbone are you using for multi-gpu training?

pyradd avatar Oct 08 '19 09:10 pyradd

Which model and backbone are you using for multi-gpu training?

Any one

HSMung avatar Oct 20 '19 10:10 HSMung

For the time being, I dont have a work around. However, most often the first process is the main process. If you kill that one, other zombie process seems to die.

pyradd avatar Oct 22 '19 11:10 pyradd