tabula-java icon indicating copy to clipboard operation
tabula-java copied to clipboard

No output from convert_into, No error raised in terminal

Open jksshack opened this issue 9 months ago • 0 comments

When I run this script in cmd, there is a pause after the print statement before tabula.convert_into. Then it quits with no output csv file created or error message

I don't really know where to go given the lack of information as to what is going on behinds the scenes... I'm not super familiar with wrappers or java so any help here is appreciated!

Script import tabula input_folder="<path to file>"

input_file_name="<file name.pdf>" input_path_name=input_folder + "/" + input_file_name

output_path_name= input_path_name[:-3] + "csv"

print("before convert_into") stops running after printing this statement

tabula.convert_into(input_path=input_path_name,output_path = output_path_name, output_format="csv", pages="all")

print("finished")

Already Attempted Troubleshooting:

Output: from running tabula.environment_info() Python version: 3.12.8 (tags/v3.12.8:2dc476b, Dec 3 2024, 19:30:04) [MSC v.1942 64 bit (AMD64)] Java version: java version "24" 2025-03-18 Java(TM) SE Runtime Environment (build 24+36-3646) Java HotSpot(TM) 64-Bit Server VM (build 24+36-3646, mixed mode, sharing) tabula-py version: 2.10.0 platform: Windows-11-10.0.22631-SP0 JDK has also been installed and is in the Java Folder with JRE

In Environmental Variables (User variables) Java is on my Path as both C:\Program Files\Java\jre1.8.0_441\bin and C:\Program Files\Java\jdk-24 In Environmental Variables (System variables) Path contains: C:\Program Files\Common Files\Oracle\Java\javapath; C:\Program Files (x86)\Common Files\Oracle\Java\java8path

where java in cmd outputs: C:\Program Files\Common Files\Oracle\Java\javapath\java.exe C:\Program Files (x86)\Common Files\Oracle\Java\java8path\java.exe C:\Program Files\Java\jre1.8.0_441\bin\java.exe C:\Program Files\Java\jdk-24\bin\java.exe Are the first two results the cause of the problems I'm running into? Should they be reordered? Should the last two be moved from path in User Variables to path in System Variables?

Tabula was pip installed as pip install tabula-py NOT pip install tabula

Tabula module is successfully imported after the import statement

Typing java in cmd is recognized and prints a list of options

Command prompt has been restarted multiple times

When running the code in Jupyter, this message is output The Kernel crashed while executing code in the current cell or a previous cell. Please review the code in the cell(s) to identify a possible cause of the failure. [error] Disposing session as kernel process died ExitCode: 3221226356, Reason:

jksshack avatar Mar 20 '25 16:03 jksshack