tabula-java icon indicating copy to clipboard operation
tabula-java copied to clipboard

Extract tables from PDF files

Results 151 tabula-java issues
Sort by recently updated
recently updated
newest added

It took a bit of troubleshooting to determine my issue, but when I was trying to use cmd for batch processing it kept erroring out stating "Invalid directory or filename"....

[FT801167430603.pdf](https://github.com/tabulapdf/tabula-java/files/1802272/FT801167430603.pdf) I have attached a pdf , which gives me a different output while using tabula-java command line utility and the tabula app. Using the app, by clicking autodetect tables,...

Hello, First of all, a word of praise for this tool. Would it be possible to set an option in cli to use a different separator for CSV export? (see...

Hi, First of all, thanks for the great tool. I was trying to extract the data from the following pdf : [pdf](http://www.isuresults.com/results/gpf0910/gpf0910_Men_SP_Scores.pdf) If I extract the following two tables like...

Hello Have you noticed any difference in using the 'all' pages option and calling each page separately? I had a case where the table extraction process delivered better results when...

I dont see commas when test your file src/test/resources/technology/tabula/eu-002.pdf using java. So i am getting out the rectangle and trying to extract , but getting UnsupportedOperationException. // the algorithm we're...

Hi, I found [this](https://github.com/tabulapdf/tabula) project, in which you can build/save templates to extract data from a pdf. Is there a way to use such a template to extract the data...

I'm getting some odd behaviour trying to extract names from [this pdf](http://brandshatchstages.co.uk/sites/brandshatchstages.co.uk/files/2019/Results/MGJE%20BHWS%202019%20Entry%20List.pdf). In particular, it looks like some characters are being extracted several times? ``` 0 | 1 | MaMrka...

Extract wrong ruling in page 7. [1548063103893.pdf](https://github.com/tabulapdf/tabula-java/files/2786271/1548063103893.pdf) ![1548063103893-7](https://user-images.githubusercontent.com/18567513/51592308-3cc2d600-1f2a-11e9-96c9-cd37a71e49b8.jpg)

Hi I am using tabula-java in python like os.system(" ".join(["java -jar", jar, "-b ", nFolder, " -n -p all"])) nFolder is directory. I have 36 pdf files, which are looking...