scrapelib icon indicating copy to clipboard operation
scrapelib copied to clipboard

Add ability to cache via redis or memcache

Open patcon opened this issue 9 years ago • 2 comments

Any preference? My pupa scraper is running on heroku, so filesystem-based caches are non-ideal :)

Would you accept a PR for one of these options?

patcon avatar Aug 08 '16 22:08 patcon

if you're running in production it might be best to disable as you likely won't use the cache as it isn't intended for actual HTTP caching but development. but if you want it, either is fine by me! much appreciated

On Aug 8, 2016 6:10 PM, "Patrick Connolly" [email protected] wrote:

Any preference? My pupa scraper is running on heroku, so filesystem-based caches are non-ideal :)

Would you accept a PR for one of these options?

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/jamesturk/scrapelib/issues/34, or mute the thread https://github.com/notifications/unsubscribe-auth/AAAfYvMc90WBO_1rqcQNBmqCVTUgiSDkks5qd6lAgaJpZM4Jfhth .

jamesturk avatar Aug 08 '16 22:08 jamesturk

Thanks! I see what you're saying, but the city of toronto started throttling the heck out of my scrapers, so I'm trying to sort out how to not restart each time I figure out the limit ;)

patcon avatar Aug 09 '16 19:08 patcon