weibo-crawler icon indicating copy to clipboard operation
weibo-crawler copied to clipboard

新浪微博爬虫,用python爬取新浪微博数据,并下载微博图片和微博视频

Results 248 weibo-crawler issues
Sort by recently updated
recently updated
newest added

'since_date' Traceback (most recent call last): File "C:\Users\cnghw\Desktop\weibo-crawler-masterN\weibo.py", line 1849, in get_pages if self.get_user_info() != 0: File "C:\Users\cnghw\Desktop\weibo-crawler-masterN\weibo.py", line 388, in get_user_info self.user_to_database() File "C:\Users\cnghw\Desktop\weibo-crawler-masterN\weibo.py", line 320, in user_to_database self.user_to_csv()...

首先我只能爬取66页,然后我的cookie失效怎么办

单个ID不存在问题,当读入的是txt文件时,会出现此错误 ![image](https://user-images.githubusercontent.com/126164609/236419926-e3520615-4003-462e-8b2b-fdf8f0088e43.png)

定期自动爬取微博能否展开讲讲,可以甚至不打开吗?像rss订阅了一样

可以增加用户的注册时间信息吗

请问能否只爬微博信息,不需要用户信息,直接把输出的json文件里user的键值直接移除呢

下载最新版后执行weibo.py文件,每次都自动爬取”Lo娘嘀咕嘀咕“(6074526225)的发帖。即便在config.json指定了应该爬取的对象,程序依然只爬取”Lo娘嘀咕嘀咕“(6074526225) 经查询,”Lo娘嘀咕嘀咕“(6074526225)的信息出现在js.json中。尝试将信息删除,但结果依然。

stat: path should be string, bytes, os.PathLike or integer, not NoneType Traceback (most recent call last): File "D:\weibo-crawler-master4\weibo-crawler-master\weibo.py", line 1883, in get_pages self.write_data(wrote_count) File "D:\weibo-crawler-master4\weibo-crawler-master\weibo.py", line 1836, in write_data self.write_csv(wrote_count)...

开着代理无法抓取微博,虽然没有报错但是没有抓到微博内容,也没有生成文件 只有关闭代理才能抓取