tabulapdf
tabulapdf copied to clipboard
Row with multiple lines
Hi, This is a question about package functionality. I was wondering if there are recommendations for how to extract tables in which rows span multiple lines as in the tabula example here [https://github.com/tabulapdf/tabula-extractor/issues/23]? My own data are somewhat simper in that there are no subheaders, but the same issue arrises - rows spanning multiple lines. Thank you for a great package!
library("tabulizer")
specify filename
file <- '~/Desktop/atipp_coordinators_govt_agencies.pdf'
read file
out1 <- extract_tables(file, output = "data.frame", method = 'lattice', pages=2)
Same
I have the same issue