tabula-java icon indicating copy to clipboard operation
tabula-java copied to clipboard

Extract tables from PDF files

Results 151 tabula-java issues
Sort by recently updated
recently updated
newest added

Hello All, Firstly, thank you VERY much for publishing this amazing library! I am working on an integration with Apache Drill which enables users to query PDF files directly using...

Hello I try to extract table from PDF that contains Arabic latter but when I extract the table I get **???** for all Arabic letter this issue happens only when...

We have been able to extract PDF with ANSI encoding. However, we started seeing some PDFs using Identity-H encoding with TrueType CID Font. Does Tabula support this scenario?

I am trying to extract tables from a PDF with no lines, i.e., I am using the `stream` option. The table stretches over several pages with the header being repeated...

In both java based tabula and its python wrapper tabula-py , even when all pages option is given only 1st page is converted. Currently to overcome this i need to...

Add option to specify specific delimeter as field seperator

For each page I would like to specify the area to extract the table. `-p 1 -a y1, x1, y2, x2 -p 2 -a Y1, X1, Y2, X2` Could that...

The "scratch file already closed" message when building is related to the premature closing of the document in Utils.pageConvertToImage(). Please remove "document.close();". However even that isn't really enough. The PDDocument...

### Issue summary: when use area options, empty cells in the table is removed and the cells below are shifted up automatically. but if not use area option, output remains...

Hi Team, I have been working with Tabula and pdfbox for quite some time, and my issue here is nurminen detection alogirithm is not ignoring page headers and footers while...