xavctn
xavctn
Hello, Thanks for the feedback. I am aware of this issue whenever columns are not greatly separated. I will try to work on it in upcoming updates.
Hello, As of now, I do not plan to create a lighter version. Considering what is being used in the processing of the algorithm, only beautifulsoup and xlsxwriter would be...
Hello, Right now, the cells height/width are supposed to be autofitted to their content but it might be possible to do it. I will check if I can do it...
Hello, That should not be happening. What version of the library have you installed ?
Hello, I made some modifications to the algorithm that are going to be included in the next release. I am not sure if it is going to handle those tables...
Hello, Can you provide the numba version installed in your environment ? I will try to replicate the issue.
Hi, I tried to replicate the issue on Windows using Python 3.11 but I was not able to. I do not know what is happening on your end TBH. I...
Hello, This is not really supposed to happen. Can you apply the extraction **without** any OCR and check the number of columns in your table (using the `extract_tables` method) ?...
Hello, I took a look at it and this is due to the poor quality of the table header that messes up the table detection. As of now, I won't...
Hello, As mentionned in the documentation, when processing PDFs, all pages are converted to images using a DPI of 200. The table coordinates returned by the library correspond to this...