surya
surya copied to clipboard
about training data set
Thank you very much for the open source project. After I tried it, it worked very well. Can you please give me some details about your training data set。
looks like DocLaynet dataset for text lines and layout detection. (not sure for ocr, but doclaynet contains machine-generated ocr annotations)
how about the ordering model?