donut icon indicating copy to clipboard operation
donut copied to clipboard

Erroneous Text output for IE task

Open riteshKumarUMass opened this issue 2 years ago • 0 comments

Hi, I tried fine tuning the model with custom receipt dataset for IE task and noticed issues with the output text extracted for given set of keys. It either misses out or add extra 1-2 characters to the actual text present in the document and this pattern is very frequent. I am using the default input_size: [1280, 960]. The images are really clear where any other off the shelf OCR model is able to extract text with no errors. I fine-tuned the model with 400 images with 15 keys and tested it on 100 samples. Has anyone encountered such issue?

riteshKumarUMass avatar Aug 12 '22 15:08 riteshKumarUMass