scrapper
scrapper copied to clipboard
Web scraper with a simple REST API living in Docker and using a Headless browser and Readability.js for parsing.
Thx for your little project, its very nice and powerfull i'm using it to get article from my medium account i have also made a recent image with all dependency...
Currently, when i make a request, the response fields when using `/api/articles` is a huge list of fields. This is especially not great when using the `full-content=true` flag. I get...
Can you please add the markdown support as output format?
Hi, Is there a way to access shadowDOM elements? Thanks
Most of the news article use different byline from Author so it is good idea to have these both.
Is it possible to add a URL type field in the return result, which indicates whether this URL is for the article list page or the article content page? Just...