Twitter-Source-Bot
Twitter-Source-Bot copied to clipboard

Published 20 hours ago •

Reame
Issues

Processing PDF on Scraper

Open CryogenicPlanet opened this issue 4 years ago • 1 comments

How to process pdf files or files were the text is not in the document body
First how do we identify these files
Second how do we process them

Note this also includes books from books.google.com

May 10 '20 21:05 CryogenicPlanet

This solution depends on #50 Resolving itself first otherwise no point

May 10 '20 21:05 CryogenicPlanet