docling icon indicating copy to clipboard operation
docling copied to clipboard

RapidOCR fails for NoneType

Open simjak opened this issue 11 months ago • 0 comments
trafficstars

File "/usr/local/lib/python3.12/site-packages/docling/pipeline/base_pipeline.py", line 52, in execute
raise e
File "/usr/local/lib/python3.12/site-packages/docling/pipeline/base_pipeline.py", line 44, in execute
conv_res = self._build_document(conv_res)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/site-packages/docling/pipeline/base_pipeline.py", line 162, in _build_document
raise e
File "/usr/local/lib/python3.12/site-packages/docling/pipeline/base_pipeline.py", line 149, in _build_document
for p in pipeline_pages: # Must exhaust!
^^^^^^^^^^^^^^
File "/usr/local/lib/python3.12/site-packages/docling/pipeline/base_pipeline.py", line 116, in _apply_on_pages
yield from page_batch
File "/usr/local/lib/python3.12/site-packages/docling/models/page_assemble_model.py", line 59, in __call__
for page in page_batch:
^^^^^^^^^^
File "/usr/local/lib/python3.12/site-packages/docling/models/table_structure_model.py", line 93, in __call__
for page in page_batch:
^^^^^^^^^^
File "/usr/local/lib/python3.12/site-packages/docling/models/layout_model.py", line 281, in __call__
for page in page_batch:
^^^^^^^^^^
File "/usr/local/lib/python3.12/site-packages/docling/models/rapid_ocr_model.py", line 136, in __call__
for ix, line in enumerate(result)
^^^^^^^^^^^^^^^^^
TypeError: 'NoneType' object is not iterable

For this document verdaus img2img.pdf

simjak avatar Dec 09 '24 10:12 simjak