WeReadScan icon indicating copy to clipboard operation
WeReadScan copied to clipboard

扫描“微信读书”已购图书并下载本地PDF的爬虫

Results 13 WeReadScan issues
Sort by recently updated
recently updated
newest added

[竞争战略.pdf](https://github.com/Algebra-FUN/WeReadScan/files/9036206/default.pdf) 降低selenium版本或者重写S函数之后,成功把demo运行起来了,并下载得到了《竞争战略》的pdf。 然而里面没有文字。如果是类似于截屏的扫描,应该有文字显示出来才对。 这个问题是否是腾讯的反爬虫?我们应该如何解决呢? 请项目主抽空看一看。 ![image](https://user-images.githubusercontent.com/48126062/177076504-a9f8ac08-5219-45d0-899c-88753ff9c12c.png)

Wait for QRCode Scan...0/15turns Login Succeed. Task launching... navigate to https://weread.qq.com/web/reader/50532a905cde3050538da2b preparing to scan "竞争战略" scanning chapter "推荐序1" Traceback (most recent call last): File "/Users/zhigangyang/development/WXRead/WX.py", line 15, in weread.scan2pdf('https://weread.qq.com/web/reader/50532a905cde3050538da2b') File...

https://weread.qq.com/web/reader/9f332400723f409f9f3ffc4kc81322c012c81e728d9d180 只能下载整合前两章不确定是什么原因

https://ftp.bmp.ovh/imgs/2021/03/3f1bda48705c27b8.png 设置quality也没用 weread.scan2pdf('https://weread.qq.com/web/reader/77d32500721a485577d8eee',quality=300)

微信读书https://weread.qq.com/web/reader/3d232ad0718487b83d2b2bb 第十章图片下载不到,导致合成pdf时报错

报错图: https://ibb.co/F8C8hW5

希望加入下载失败检测,重新下载功能。 https://ftp.bmp.ovh/imgs/2021/03/6570828b6a3675da.png

Output exceeds the [size limit](command:workbench.action.openSettings?[). Open the full output data [in a text editor](command:workbench.action.openLargeOutput?7c2445ad-994e-49d9-8287-3fb60b23ff31) --------------------------------------------------------------------------- ElementClickInterceptedException Traceback (most recent call last) in 27 weread.login() #? login for grab the whole...

raise TimeoutException(message, screen, stacktrace) selenium.common.exceptions.TimeoutException: Message: Stacktrace: #0 0x55dbd32e1e23

![image](https://github.com/Algebra-FUN/WeReadScan/assets/46594083/59ccd0c8-6635-47d3-afe3-e0bf14c8baf4) 不知道是不是被反爬了,如图,我现在已经爬不出主要内容了