crawly
crawly copied to clipboard
Feature request: make request storage configurable and pluggable
As far as I understand, requests are necessarily stored in a GenServer's state (Crawly.RequestStorageWorker
) and it is not possible to plug its own storage module.
This has some disadvantages:
- it doesn't support restart of the server
- it cannot be distributed on several nodes
It would be nice if this module was configurable. That would make backends such as Mnesia or SQL databases possible, enabling distributed crawling.
Yes, it's absolutely true. We had plans both for Request/Items storages to be able to share and recover states... however it turned out that for now, we did not have that demand. At least not yet, so this part of the work was abandoned for a while.