headless-chrome-crawler
headless-chrome-crawler copied to clipboard
Question: Handling of pdf files
I want to create a general purpose crawler with this project. By general purpose i mean - if the url leads to pdf i want it to render the pdf, and if its html i want it to render html.
How is this project handle files like pdf? Is there any example i can take a look at? Is there a docker example for this project?
Hey @eladbitton! At the moment this project is not handling pdfs well. Actually, it's simply crashing. So, this is a valid point to improve. Did you figure how to achieve what you want? If not, pls elaborate more on what is your final goal.