show_attend_and_tell.tensorflow
show_attend_and_tell.tensorflow copied to clipboard
Why is the gradient vanishing?
Thank you very much for sharing. I want to use the attention model to do video classification, but there is always a gradient vanishing during the training. Have you had a similar problem before?