crnn icon indicating copy to clipboard operation
crnn copied to clipboard

Add STN to crnn

Open hn18001 opened this issue 7 years ago • 2 comments

@bgshih crnn is a very great project, thanks for your open. Have you ever added stn to crnn? I tried to add the stn layer(https://github.com/qassemoquab/stnbhwd) to crnn, but the train loss is always very big, I've already set the transform matrix to identity matrix, but it looks like the stn layer learns nothing in the training procedure, should I try sgd optimization method instead of adadelta or others?

hn18001 avatar Jan 24 '17 01:01 hn18001

That's a nice idea actually @hn18001 , ill try to add too, have you gone through this http://torch.ch/blog/2015/09/07/spatial_transformers.html, they have added stn layer for recognising traffic signs.

rremani avatar Jan 24 '17 13:01 rremani

@hn18001 @rremani We have another paper that has done that, but on attention-based generator rather than CRNN. But I believe the same idea would work on CRNN.

In our experience, adding STN makes the network much harder to train. Also, identity initialization doesn't work for us. We used a slightly disturbed initialization (see the paper) to encourage STN to be optimized.

bgshih avatar Jan 24 '17 19:01 bgshih