crawler-python
crawler-python copied to clipboard
the yelp crawler is too slow
need optimize the review crawling parts (i.e., reviews_in_page function).
Options
- use the lxml instead of beautifulsoup to extract the information we wanted?