deep-text-recognition-benchmark
deep-text-recognition-benchmark copied to clipboard
spatial transformer networks with variable length of inputs with fixed height variable width?
I have a recognizer which take a variable length of width but fixed in height, does spatial transformer networks able to train with this kind of network?
Maybe we should find a way to make the TPS's GridGenerator to create a variable create.