puppeteer-daily-menu-scraper
puppeteer-daily-menu-scraper copied to clipboard
Puppeteer daily menu scraper from downtown Budapest.
currently it look like this: ![image](https://user-images.githubusercontent.com/43063460/78241978-8e2d0480-74e1-11ea-93ba-46aeb52ebcf6.png) it is confusing as the GitHub Actions cron uses UTC timezone without respecting summer time/winter time clock turns. aim is to display only one...
...or introduce regex, now for some days we are harvesting empty paragraphs: ![image](https://user-images.githubusercontent.com/43063460/67945199-a28d6e00-fbde-11e9-94e9-8081298aa9a8.png)
currently we cannot tell what is the problem if the js stops at one point, then we need to detect manually if the issue is on the free endpoint by...
currently many of the reataurant menus contain: - day names, - multiple spaces (using `\\s\\s+` vas not an option, causes trim error!), - no spaces between words, - incostistent upper...
currently if someone forks the repo is forced to get lots of settings on Hungarian (e.g. MomentJs format, replacementMap.json) the aim is to have a global config to be exported....
the aim is to have two (or three) templates which reduces boilerplates and makes the scrapers maintanable in case of the architecture changes