[WIP] Added Cassandra-Backend
Added Cassandra as Backend on base of SQLAlchemy Code.
In general looks awesome! It definitely requires some small changes, but this is already a great contribution!
So tests are broken, we either have to find a way how to test it with Travis CI, or disable this test for now.
How can i activate an re-test in travis after i've made changes?
OK, my fork now runs without errors through travis: https://travis-ci.org/wpxgit/frontera/builds/120262078
I'm marking these PR as WIP. Meaning it's Work in Progress, and we shouldn't merge it. OK?
Hey @wpxgit what do you think of all that? Do you plan to contribute more?
Hello @sibiryakov @wpxgit , is there any plan to continue development?
I haven't heard anything @maisumbruno . At Scrapinghub we're fine with HBase so far.
We are comfortable with how Cassandra works. If there are no plans to implement, @sibiryakov would there be any hints on how I can do this myself?
@maisumbruno Definitely. I would recommend to inspire from HBaseBackend, where you can find a queue suitable for large scale crawling. Also you can start implementing it by parts, say first States, then Queue and Metadata if needed. You can send a PR any time and I'll have a look.
But you know, the most important part is battle testing, on a large volume storages are starting to work slower and this often require refactoring, schema change or various optimizations.
Thanks @sibiryakov