deep-learning-with-python-notebooks icon indicating copy to clipboard operation
deep-learning-with-python-notebooks copied to clipboard

[Chapter 6.3] Basic machine-learning approach freezing at end of first epoch

Open N1ck95 opened this issue 5 years ago • 2 comments

I've checked out the code you propose. However I can't figure out why at the end of the first epoch the training freezes and the notebook keeps running endlessly.

Epoch 1/20 499/500 [============================>.] - ETA: 0s - loss: 53.5993

It's stuck in this position almost from 15 minutes. I've tried to run the code several times both with Tensorflow 1 and 2, however nothing changes.

N1ck95 avatar Jan 31 '20 19:01 N1ck95

I ran into the same issue when using two GPUs. Not sure why, but after using only 1 GPU, it moved forward.

RoseString avatar Feb 14 '20 19:02 RoseString

I had the same problem and it was due to the version of the book I have, at least I assume! In the book where val_steps is defined, there is: val_steps = (300000 - 200001 - lookback) which must be: val_steps = (300000 - 200001 - lookback) // batch_size and also the same change for test_steps

when you do not use " // batch_size", val_steps will be much larger and it will take a lot of time to evaluate.

ghost avatar May 03 '20 13:05 ghost