clstm icon indicating copy to clipboard operation
clstm copied to clipboard

Training two layer networks fails with gradient errors

Open mihaelacr-google opened this issue 8 years ago • 2 comments

I have added a createBidi2 method in the CLSTMText class function, but training 2 layer networks fails with a gradient assertion error, regardless of the used learning rate and hidden layer sizes.

My code to create two layer here.

The error I am getting is clstmfiltertrain: clstm.cc:642: void ocropus::GenericNPLSTM<F, G, H>::backward() [with int F = 1; int G = 2; int H = 2]: Assertion `!anynan(out)' failed.

Looks like there is an error when computing the gradient of two layer networks.

mihaelacr-google avatar Nov 13 '16 11:11 mihaelacr-google

I am facing the same problem.

abhigarg avatar Apr 19 '17 09:04 abhigarg

If this can be of any help to someone, I got the exact same error when I run the clstmocrtrain without the corresponding *.gt.txt files in the image folder.

lorenzob avatar Oct 04 '17 13:10 lorenzob