sightseq
sightseq copied to clipboard
How is the picture processed in sequence_generate?
The shape of the picture is (Batch, Channel, H, W) The data shape that the sequence_generate can receive is (batch, seq_len,...) I did not find a solution in your code, how did you deal with this problem? Thank you
Hi @xiaoxiaokuye , I haven't followed this code for a while. I guess this is done in
https://github.com/zhiqwang/sightseq/blob/b1eac1de47020f9ab714a807e6c7553c8324ade5/sightseq/models/text_recognition_encoder.py#L74