geek_crawler issues

可以下载指定课程

在原来的代码基础上简单的修改了一下，实现下载指定的课程修改点1.使用原来的exclude变量，存储想要下载的课程，大概在539行左右 ``` # 将exclude设置为指定要爬取的文章 exclude = ['快速上手C++数据结构与算法'] ``` 修改点2.将297行左右的 ``` if product.get('title', '') in self.exclude: 修改为 if product.get('title', '') not in self.exclude: ```

yunCrush

图片我看是超链接，会一直有效吗

1

L-Block-C

非法图形验证码

1

![image](https://user-images.githubusercontent.com/38487617/150719963-d1a47f0d-8083-4a84-aff9-6b2a1b25375c.png)

Brannua

抓取报错

1

大神来看下呀： /Users/bo/PycharmProjects/pythonProject/main.py[line:550] - ERROR: 请求过程中出错了，出错信息为：Traceback (most recent call last): File "/Users/bo/PycharmProjects/pythonProject/main.py", line 547, in run(cellphone, pwd, exclude=exclude, get_comments=get_comments) File "/Users/bo/PycharmProjects/pythonProject/main.py", line 513, in run geek._article(aid, pro, file_type=file_type, get_comments=get_comments) # 获取单个文章的信息...

noood

返回的文章列表不能大于100

2

在一个专栏里有大于100个的文章时，该脚本最大只能保存100个文章。查看代码后发现 _articles 方法中的 'data = res.json().get('data', {})' 返回值中的list最大只有100。如图： ![image](https://user-images.githubusercontent.com/58510192/125604313-0ad9f472-7c8a-464d-8318-e6865103341a.png)

sunset-x