Christoph Auer
Christoph Auer
@langzichai Several improvements, especially for GPU acceleration and layout processing, were released since you last reported, would you mind checking again with docling==2.14.0?
I am concluding that this is no longer a concern with the recent docling versions. Please feel free to re-open if you have new evidence of slow-downs. Thanks.
@hisan-ideamaker it is likely a limitation of the RapidOCR performance with english/latin material in PP-OCR v5 models. You have the choice of going back to EasyOCR which was the previous...
@sebihoefle The bounding boxes docling infers for elements on a page are paragraph-scoped for text. If a chunk is created with a subset of a paragraph (e.g. sentence level), it...
You can test the docling extraction pipeline for this: https://docling-project.github.io/docling/examples/extraction/
@JViktoRArtola the main reason you see this is because "full-page pictures" are mostly classified as background art. The picture description works if the picture is embedded in a natural context...
@dghoffra can you please provide more details to reproduce this? I would like to understand the exact settings and an input PDF which exposes the problem.
@simjak I agree we need a fix for RapidOcr, but I would like to have `RapidOcrOptions` in the Union instead. I think it is necessary for discovery of legal CLI...
@simjak we will close this in favour of https://github.com/DS4SD/docling/pull/544 which includes a fix.
@mudassir206 We have so far taken care of correct representation of arabic script from _digital PDF text_. For embedded bitmaps (e.g. scanned pages) we currently depend on the capabilities of...