DL2 icon indicating copy to clipboard operation
DL2 copied to clipboard

errors when i run train.py

Open phchtl opened this issue 1 year ago • 0 comments

Hello, I've been studying your dl2 paper and code recently, and I have a few questions I'd like to ask you! Currently, my environment is Python 2.7 and TensorFlow-GPU 1.15. I've installed the necessary Python packages to run the code, modified the training mode in parameters.py to "sl," and also adjusted these two parameters: JOB_LEN_PATTERN to "Normal" and JOB_ARRIVAL_PATTERN to "Google_Trace." When I attempt to pre-train the model in sl mode, I encounter a deadlock issue. The code seems to occupy GPU resources but doesn't actually utilize the graphics card. What should I do to ensure that the code runs properly? I haven't loaded any other datasets like the Google cluster trace, and I haven't installed any modified versions of MXNet. Thank you very much.

phchtl avatar Oct 19 '23 02:10 phchtl