tensorflow-wavenet icon indicating copy to clipboard operation
tensorflow-wavenet copied to clipboard

train.py keeps being killed

Open Arlen22 opened this issue 8 years ago • 8 comments

When running python train.py on docker tensorflow:1.0.1, the process keeps ending with "Killed".

Arlen22 avatar Jul 15 '17 21:07 Arlen22

Same problem here. tensorflow: 1.2.1; Python 3.6.1; Anaconda 4.4.0. Run on google cloud platform using screen.

allenlu2009 avatar Aug 17 '17 00:08 allenlu2009

I don't have this problem. Tried with tensorflow 1.3 (pip3 install...) and 1.3.1 (compiled from github) always with Python 3.4.2 and no GPU support. Also no Docker or Anaconda.

andimarafioti avatar Sep 28 '17 11:09 andimarafioti

got "Killed" with 2GB DRAM, after I tightly connect my 3 DRAM modules, python3 train.py keeps running with 6GB DRAM.
free -m shows it took ~2.3GB to run wavenet, so it should be safe to run with >3GB DRAM system.

rkuo2000 avatar Nov 08 '17 07:11 rkuo2000

Mmm, the computer I tested this on has a huge amount of ram. It could be the case that you're just trying this without enough ram and that's why it's getting killed.

andimarafioti avatar Nov 08 '17 10:11 andimarafioti

same problem here. any ideas how to solve this?

philszalay avatar Dec 20 '17 13:12 philszalay

make sure you have enough memory (>3GB)

rkuo2000 avatar Dec 20 '17 13:12 rkuo2000

I have expanded the memory in docker settings. Now everything works. Thank you!

philszalay avatar Dec 20 '17 13:12 philszalay

Hi, pleased could you tell me how?

sakulh avatar May 05 '18 16:05 sakulh