headless-chrome-crawler Question: Handling of pdf files

Question: Handling of pdf files

Open eladbitton opened this issue 5 years ago • 1 comments

I want to create a general purpose crawler with this project. By general purpose i mean - if the url leads to pdf i want it to render the pdf, and if its html i want it to render html.

How is this project handle files like pdf? Is there any example i can take a look at? Is there a docker example for this project?

Sep 21 '19 08:09 eladbitton

Hey @eladbitton! At the moment this project is not handling pdfs well. Actually, it's simply crashing. So, this is a valid point to improve. Did you figure how to achieve what you want? If not, pls elaborate more on what is your final goal.

Oct 17 '20 06:10 kulikalov

headless-chrome-crawler headless-chrome-crawler copied to clipboard

Question: Handling of pdf files

headless-chrome-crawler
headless-chrome-crawler copied to clipboard