FastMaskRCNN icon indicating copy to clipboard operation
FastMaskRCNN copied to clipboard

Anyone experience memory leak while training

Open realwecan opened this issue 7 years ago • 5 comments

It seems to me that there may have been a memory leak during training, ultimately filling up all the available system RAM (not GPU memory) and causes the system to hang.

realwecan avatar Jul 19 '17 02:07 realwecan

Hello, check if there multiple Session are open. one more question: do you finish the evaluation code?

HuangBo-Terraloupe avatar Jul 19 '17 09:07 HuangBo-Terraloupe

@HuangBo-Terraloupe Check this one out: https://github.com/souryuu/FastMaskRCNN/tree/mask_visualization

realwecan avatar Jul 19 '17 13:07 realwecan

That is not a proper evaluation code. I only used it to visualize the mask network and groundtruth and while training. It is not for evaluating the final mask from test.

souryuu avatar Jul 20 '17 00:07 souryuu

For those who still have the problem about memory leak, I found that using "the tcmalloc allocator" solved the issue. https://stackoverflow.com/documentation/tensorflow/3883/how-to-debug-a-memory-leak-in-tensorflow#t=201612280142239281993

souryuu avatar Sep 26 '17 08:09 souryuu

Thanks @souryuu , could you please be able to share with us about your progress on the experiments? I see that you have obtained better performance with your codes, any chance we are getting closer to the results in the Mask RCNN paper? Thanks!

realwecan avatar Sep 26 '17 10:09 realwecan