scrapy-redis icon indicating copy to clipboard operation
scrapy-redis copied to clipboard

Is possible to use bloom filter on scrapy-redis?

Open zhangslob opened this issue 6 years ago • 4 comments

I offen come up a situation using scrapy-redis, with too many dupefilters in redis, may be 1 trillions or more.My redis has only 16GB...

Others has made ScrapyRedisBloomFilter which is only for scrapy but not for scrapy-redis. So how can I use BloomFilter in scrapy-redis?

Looking forward to your reply~

zhangslob avatar Mar 26 '19 07:03 zhangslob

可以的,大佬,我好像读了你的知乎 链接:https://pan.baidu.com/s/1hryRHoe8GM1kpy4aBrq7nw 提取码:j5iu ,这个好像讲了

LIpengcccc avatar May 23 '19 02:05 LIpengcccc

@LIpengcccc Thanks!

zhangslob avatar May 23 '19 09:05 zhangslob

The bloom filter requires your own implementation. It only provides redis connections and there is no way to use rebloom directly, 布隆过滤器需要你自己实现,它只是提供redis连接,也没有办法直接使用rebloom

bqlwx111 avatar Aug 06 '19 10:08 bqlwx111

https://github.com/leffss/ScrapyRedisBloomFilterBlockCluster

leffss avatar Dec 19 '19 09:12 leffss