IndexError in lattice
Describe the bug
An IndexError is raised in _reduce_index method in lattice.py It happens when a text is starting within the table, but finishing outside.
Steps to reproduce the bug read_pdf of attached file
Expected behavior
No error !
Code
import camelot
camelot.read_pdf(myfile, flavor='lattice', split_text=True)
PDF camelot.pdf
Environment
- OS: Windows
- Python version: 3.9.18
- Numpy version: 1.26.0
- OpenCV version: 4.6.0
- Ghostscript version: 10.02.1
- Camelot version: 0.11.0
Additional context
N/A
Hey!
As https://github.com/camelot-dev/camelot/issues/343 this repo is no longer maintained, we try to build a maintained fork at pypdf_table_extraction.
Do you want to check out the cod ethere to see if the issue still persists. If so please open an issue there.
Could not replicate this bug with the current release. Not sure if there was some specific action to address this bug but it no longer seems to be an issue so closing.