Twitter-Source-Bot icon indicating copy to clipboard operation
Twitter-Source-Bot copied to clipboard

Processing PDF on Scraper

Open CryogenicPlanet opened this issue 4 years ago • 1 comments

  • How to process pdf files or files were the text is not in the document body

  • First how do we identify these files

  • Second how do we process them

Note this also includes books from books.google.com

CryogenicPlanet avatar May 10 '20 21:05 CryogenicPlanet

This solution depends on #50 Resolving itself first otherwise no point

CryogenicPlanet avatar May 10 '20 21:05 CryogenicPlanet