tabula-java icon indicating copy to clipboard operation
tabula-java copied to clipboard

Extract tables from PDF files

Results 151 tabula-java issues
Sort by recently updated
recently updated
newest added

Some fonts from PDF is not converting properly u1e5 f0o.r0 r0ef -- u1e5 f0o.r0 r0ef u1e5 f0o.r0 r0ef u1e5 f0o.r0 r0ef u1e5 f0o.r0 r0ef u1e5 f0o.r0 r0ef u1e5 f0o.r0 r0ef...

PDF: 读取内容: FMD-2016 Failure Distribution Data 2- __________________________________________________________________________________________________________________________________________ Part Description Norm Fail Failure Mode/Mechanism Dist Dist Data Details Source Quantit __________________________________________________________________________________________________________________________________________ Absorber,Overvoltage 1 Sourc Failed To Operate 100.0% 100.0% Failed...

Has anyone tried to compile this using GraalVM to make a static binary? For calling from bindings such as python this would massively improve startup times and reduce the need...

## Test failure Reproduction ``` mvn install -pl . -am -DskipTests -Dsign.skip mvn -pl . edu.illinois:nondex-maven-plugin:2.1.1:nondex -Dtest=technology.tabula.TestSpreadsheetExtractor#testRTL ``` [Non-Dex](https://github.com/TestingResearchIllinois/NonDex) detected flakiness and got the error message. More precisely as shown...

I'm having issues with extracting tables. The document is a 2+ page credit card statement. Page 1 always works find but the subsequent pages do not. I have tried the...

Hey, thank you for maintaining this useful library! I'm currently working with pdf table parsing and I am expecting `page_number` in the output for the extracted tables. I found [PR...

I am working on a PDF file which might include watermark when extracting the table. The watermark might occur at different locations. 2 approaches I am thinking but I am...

Summary of your issue Refer: https://github.com/chezou/tabula-py/issues/349 I encountered an issue while processing a PDF file where a specific page consistently triggers a "CalledProcessError" with the following command: ['java', '-Dfile.encoding=UTF8', '-jar']....