PICK-pytorch icon indicating copy to clipboard operation
PICK-pytorch copied to clipboard

tool used in preparing training data ?

Open ziodos opened this issue 3 years ago • 3 comments

can you provide some details about the method you used to prepare training data , I think you didn't use a classic ocr tool, thanks.

ziodos avatar Apr 01 '21 13:04 ziodos

you have to use an ocr tool. just mapping the text with corresponding label is an issue i would suggest you use labelImg to get the region and then use overlapping text region to make corresponding labels.

knitemblazor avatar Apr 21 '21 07:04 knitemblazor

I am using tesseract as text detection and text recognition tool , the author said that it wasn't good for result accuracy , I still don't know why

ziodos avatar Apr 21 '21 22:04 ziodos

There are issues in tesseract , it does not work with complex document structure and ocr also fails some time.

NeerajAI avatar May 20 '21 18:05 NeerajAI