weibo-crawler issues

读取user_id_list 文本中日期问题

2

读取user_id_list 文本中日期问题如果只间隔一天的话，有的能读取微博更新，有的不能(但全部读是没问题的)，不知道什么原因 user_id_list 文本中日期为2022-04-17 ，今天是2022-04-18，而且微博有更新，有的读得出，有的读不出

cnghw

抓取的微博头条文章url为空

3

抓取的uid 5044429589（填了cookie），抓的excel里头条文章url都是空的，不知道什么问题？ ![image](https://user-images.githubusercontent.com/23287853/163302036-cb6ccfbe-399b-4d00-b048-f27e4baa6cc0.png)

sushengbuhuo

![image](https://user-images.githubusercontent.com/63344963/163291254-9c6615e6-d5f3-42ac-a782-0f8c969873b3.png) bao'cbaocuo1内容如上图内容写入user表中正确，但微博内容写入weibo表中错误，当前mysql数据库中weibo表为空 ![image](https://user-images.githubusercontent.com/63344963/163291340-748b9cd1-c81b-4a78-95e5-9656595c674c.png) 请问这种情况该如何解决？

banlangen1111

list index out of range报错

1

2022-04-06 16:38:54,226 - ERROR - weibo.py[:1621] - list index out of range Traceback (most recent call last): File "weibo.py", line 1585, in get_pages if(self.get_user_info() != 0): File "weibo.py", line 334,...

LY12emp

请教代码报错问题KeyError: 'mblog'

9

大佬你好！我想请教你一个问题，爬新浪微博数据时，像下边的错误是什么原因呢？ ![image](https://user-images.githubusercontent.com/61675148/103048707-7004b880-45ca-11eb-8c5e-3ab5a2de44d3.png)

konglq1025

start_page日数太多爬不了

1

start_page后面输入了200但是爬不了（直接无输出就已经Process finished），请问这个应该怎么解决？非常感谢！

selinqun

突然获取不到用户页下的微博内容

5

您好~我的程序在运行了两天没有问题后，突然程序报错，获取不到所有用户页下的微博内容。应该不是封号，使用了ip池并且换了多次小号仍然报错。【最初报错】 ------------------------------已获取寰亚SYHP(5393135816)的第1页微博------------------------------ Traceback (most recent call last): File "E:/code/微博爬虫/weibo评论用户信息爬取/weibo.py", line 747, in get_one_weibo is_long = True if weibo_info.get('pic_num') > 9 else weibo_info.get('isLongText') TypeError: '>' not supported between instances...

iyiniyin

尝试了一下，添加了cookie也只能爬200页，之后就会timeout

9

00exia00

已实现坐标爬取的拓展功能，请问如何和其他数据一样一起保存到CSV,就大佬指教

2

![微信图片_20220226045245](https://user-images.githubusercontent.com/75606302/155801444-a0f724f0-dd8e-4d4a-ace0-19de627e82f3.png) ![微信图片_20220226045256](https://user-images.githubusercontent.com/75606302/155801448-bef1ff49-7af9-41f1-9da2-e04547769654.png)

lqz19980404

这个是通过什么方法进行爬取的？API接口吗？

1

tx-1011

weibo-crawler
weibo-crawler copied to clipboard

Metadata

读取user_id_list 文本中日期问题

抓取的微博头条文章url为空

写入mysql数据库报错

list index out of range报错

请教代码报错问题KeyError: 'mblog'

start_page日数太多爬不了

突然获取不到用户页下的微博内容

尝试了一下，添加了cookie也只能爬200页，之后就会timeout

已实现坐标爬取的拓展功能，请问如何和其他数据一样一起保存到CSV,就大佬指教

这个是通过什么方法进行爬取的？API接口吗？

← Metadata

Owner

Metadata

weibo-crawler weibo-crawler copied to clipboard

Metadata

← Metadata

Owner

Metadata

weibo-crawler
weibo-crawler copied to clipboard