Tabula App and tabula-java Command line Utility giving different Outputs
I have attached a pdf , which gives me a different output while using tabula-java command line utility and the tabula app. Using the app, by clicking autodetect tables, I'm getting all the tables in all the pages. But using the command line utility , I am getting only the tables in the last two pages.
This is the command that i used: java -jar tabula-1.0.1-jar-with-dependencies.jar --guess -d -p all -t FT801167430603.pdf
Thanks, @Kanz95
Which version of the Tabula app are you using? The current one (1.2.0) only detects tables in the last two pages as well.
Tabula_App_Output.zip Thanks @jazzido . I used the latest tabula 1.2.0 app (tabula-jar.zip version for linux) and i have attached the screenshot and the output csv format from the app. Its detecting the tables in all the 4 pages.
@jazzido Can you please look into this issue as soon as possible, cause we are building an app above it, that has a very close deadline.
Hi @Kanz95,
No.
I don't know when I'm going to be able to look at this (Tabula is a side project, I don't get paid for it). If you're building an app, you might want to devote some of your resources to fix the issue yourself.
…and contribute back the fix, if you're so inclined.
The bug is in the file: https://github.com/tabulapdf/tabula-java/blob/master/src/main/java/technology/tabula/detectors/NurminenDetectionAlgorithm.java line: 488
if (edgeCountsPerLine[i][TextEdge.LEFT] > 2 &&
it should be:
if (edgeCountsPerLine[i][TextEdge.LEFT] > 1 &&
The bug is in the file: https://github.com/tabulapdf/tabula-java/blob/master/src/main/java/technology/tabula/detectors/NurminenDetectionAlgorithm.java line: 488
if (edgeCountsPerLine[i][TextEdge.LEFT] > 2 &&it should be:
if (edgeCountsPerLine[i][TextEdge.LEFT] > 1 &&
I tried this and was not the answer, the result became worst.