scrapy-training
scrapy-training copied to clipboard
Reddit spider in unit 1 is no longer valid
Hi,
The spider supplied in unit 1 for scraping reddit.com no longer appears to be valid.
Their newer site looks to be using react styled components which generates CSS classnames, probably not wise to use them for ease of maintainability.
On the plus side, adding .json
to the URLs gives a JSON document: https://www.reddit.com/r/programming.json