Thomas J. Leeper

Results 206 comments of Thomas J. Leeper
trafficstars

@leopoldinho A couple of general suggestions: - `ghit::install_github(c("leeper/tabulizerjars", "leeper/tabulizer"), INSTALL_opts = "--no-multiarch", verbose = TRUE)` will give you some more details on what is failing. - Make sure **rJava** is...

It might be your `JAVA_HOME` environment variable. Try something like `Sys.setenv(JAVA_HOME = "C:/Program Files/Java/jdk1.8.0_92")` maybe?

I think this is an inherent limitation of PDF format. As I understand it, white space is not represented as actual "space" characters but rather as horizontal offsets for the...

Oh actually `extract_text()` isn't a tabula feature. It just uses pdfbox. If it looks like it possible directly with PDFbox, I can try to implement it but I don't think...

Thanks. I will take a look as soon as I can.

This works on Windows, but is screwy on Linux. Must investigate more.

I don't think I understand PDF well enough to know whether that makes any sense.

@soedr @jrcunning Are you also on Mac? Versions?

@jrcunning Thanks. Can you tell me what version of Java you installed? (The underlying java library - tabula - has some updates and it looks like they're causing Mac-specific issues.)

It seems the latest version of the tabula library is likely the issue. Consider re-installing an older version of tabulizerjars with, for example: ```R ghit::install_github("ropensci/[email protected]", verbose = TRUE) ```