weibo_terminater icon indicating copy to clipboard operation
weibo_terminater copied to clipboard

Final Weibo Crawler Scrap Anything From Weibo, comments, weibo contents, followers, anything. The Terminator

Results 13 weibo_terminater issues
Sort by recently updated
recently updated
newest added

all_comment_pages 这个如果没有评论的话,会报出out of index

bug

作者你好!关注到您4年前发起的微博终结者项目,想请问,项目爬取的语料是否保存,开源?应该如何获取?谢谢您!

类似这样的: ![微信图片_20200806164920](https://user-images.githubusercontent.com/1441981/89511695-d5673100-d804-11ea-97c2-ed663c2abab9.jpg) 自有资金创业,目前有3家公司很多项目为这个新项目注入现金流。 感兴趣的话加我wechat: pinball1973 谢啦!

"身陷囹圄"是坐牢的意思吧?

报错内容: error, account id [email protected] is not valid, pass this account, you can edit it and then update cookies. PhantomJS 也装了 MAC\win\云上的linux都是这条错误,上述三台主机IP也不在一个网段 Any suggestions?

出现错误:error, account id 5403168675 is not valid, pass this account, you can edit it and then update cookies. 请问这是什么原因啊,微博账号在网页能正常登陆,

由于存在微博账号登录在不同环境下表现不同的问题(#47),建议添加ip代理, 以下代码仅供参考: ``` import re import requests import pymysql import time import random class SpiderProxy(object): def __init__(self): self.req = requests.Session() self.headers = { 'Accept-Encoding': 'gzip, deflate, br', 'Accept-Language': 'zh-CN,zh;q=0.8', 'Referer':...

查看了一下代码,没有找到读取文件多id的代码实现。还请指教~ ``` def _init_multi_mode(self): pass ```

![screenshot from 2017-10-18 17-24-54-a](https://user-images.githubusercontent.com/14347369/31711103-a8f5f8f6-b429-11e7-8ed4-5faa78a9d8fc.png) 我这爱豆发博比较长些…… 顺便,可以在哪改每爬一页休息五分钟这个设定? 谢谢