surya about training data set

about training data set

Open wonders7796 opened this issue 1 year ago • 2 comments

Thank you very much for the open source project. After I tried it, it worked very well. Can you please give me some details about your training data set。

Mar 29 '24 07:03 wonders7796

looks like DocLaynet dataset for text lines and layout detection. (not sure for ocr, but doclaynet contains machine-generated ocr annotations)

Apr 02 '24 10:04 sralvins

how about the ordering model?

Apr 23 '24 09:04 vbonnivardprobayes

surya surya copied to clipboard

about training data set

surya
surya copied to clipboard