donut Integrate a customized internal OCR engine to Donut

Integrate a customized internal OCR engine to Donut

Open Altimis opened this issue 1 year ago • 2 comments

Hello guys. Thank you so much for this brilliant Model. I'm aware that Donut is an OCR-free model which does not rely on an OCR input. When I performed some tests (fine-tuning the model), I realized that the internal OCR-engine performance is not as good as Google Cloud Vision OCR. Is is possible to change the OCR engine by this one ? Thanks you !

Feb 02 '24 15:02 Altimis

Donut is not made to compete with OCR engines, it is pre-trained on generating OCR to give it a general understanding about characters and language that can be leveraged in fine tuning tasks, like extracting a specific information from an input image. If you want good OCR, I would recommend sticking to tesseract or cloud solutions like the one you suggested.

Feb 02 '24 20:02 felixvor

donut donut copied to clipboard

Integrate a customized internal OCR engine to Donut

donut
donut copied to clipboard