Imene KOLLI
Imene KOLLI
I'm having the same issue
any updates on this?
@cau-git It could be beneficial to save the pdf pages as images and then trigger the OCR in such cases. Also, I tried `PyPdfiumDocumentBackend` and the results are the same.
@PeterStaar-IBM here's an example: https://content.influencemap.org//site/data/000/982/Enel_corporate_website_energy_mix_June_2022_June_2022.pdf
@PeterStaar-IBM As I've mentioned in my issue description, the main issue seems to be the OCR not being applied even if I specifiy `do_ocr=True`. @cau-git mentioned that OCR is not...