pdftools
pdftools copied to clipboard
Suggestion of table to data.table (or data.frame) toolbox
I love everything this package wants to be. I have a suggestion. I read that turning pdf tables into R objects is not easy since there are many ways one can setup a table in a PDF. But surely there are several common ways or several variations on a theme. Would it be possible to include (in the package or outside of it) a few tools or suggestions to get the naïve or very part-time coder started? Or how about a blog post tutorial? I am sure if I saw 3 worked examples it would help me tremendously.
I have just released a blog post based around the use of pdftools including code for creating a data.frame from it Not the most elegant - but may get you started
See also http://www.brodrigues.co/blog/2018-06-10-scraping_pdfs/