Imene KOLLI

Results 5 comments of Imene KOLLI

I'm having the same issue

any updates on this?

@cau-git It could be beneficial to save the pdf pages as images and then trigger the OCR in such cases. Also, I tried `PyPdfiumDocumentBackend` and the results are the same.

@PeterStaar-IBM here's an example: https://content.influencemap.org//site/data/000/982/Enel_corporate_website_energy_mix_June_2022_June_2022.pdf

@PeterStaar-IBM As I've mentioned in my issue description, the main issue seems to be the OCR not being applied even if I specifiy `do_ocr=True`. @cau-git mentioned that OCR is not...