Using Tabula to export values from multiple PDFs to single output csv

Open viko188 opened this issue 5 years ago • 0 comments

#!/bin/bash for f in /path/to/dir/*.pdf; do java -jar /path/to/tabula/tabula-0.9.0-jar-with-dependencies.jar -p all -a 49.5,52.3285714,599.6571428,743.91428571 -o $f.csv $f done

So i know to put my own paths to my tabula.jar's location, I understand the -pages, have pulled my -areas, designating my -output csv etc. Ive successfully pulled from one PDF using CMD prompt on windows.

However, I want to be able to take all PDF files in one folder and output the values in the given areas into one .csv file as an iterative process until the directory runs out of PDFs. To do that you allegedly use the formula above.

However, I do not understand what the F is representing in the beginning, and shouldn't I just put my regular output csv at the end?

also, every time I input #!/bin/bash and then continue, it tells me that #! is not a recognized file, command, etc.

I'm missing something...can someone please break this code block down for me?

Thank you for understanding my novice level, and helping.

Jan 09 '20 07:01 viko188