Imene KOLLI comments

Repositories
Issues
Comments

Results 5 comments of


                                            Imene KOLLI

python ingest.py fail

I'm having the same issue

Python 3.13 support

any updates on this?

Docling Produces Unreadable Text Output for PDF with non-standard Font Encoding, OCR Appears Not to be Applied

@cau-git It could be beneficial to save the pdf pages as images and then trigger the OCR in such cases. Also, I tried `PyPdfiumDocumentBackend` and the results are the same.

Docling Produces Unreadable Text Output for PDF with non-standard Font Encoding, OCR Appears Not to be Applied

@PeterStaar-IBM here's an example: https://content.influencemap.org//site/data/000/982/Enel_corporate_website_energy_mix_June_2022_June_2022.pdf

Docling Produces Unreadable Text Output for PDF with non-standard Font Encoding, OCR Appears Not to be Applied

@PeterStaar-IBM As I've mentioned in my issue description, the main issue seems to be the OCR not being applied even if I specifiy `do_ocr=True`. @cau-git mentioned that OCR is not...