examples-of-web-crawlers icon indicating copy to clipboard operation
examples-of-web-crawlers copied to clipboard

一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、微信读书、豆瓣、QQ等网站。(Some interesting examples of python crawlers that are friendly to beginners. )

Results 17 examples-of-web-crawlers issues
Sort by recently updated
recently updated
newest added

背后贵u工业化

扫码登录后,终端显示如下信息: 请扫描二维码登录微信读书... 登录微信读书成功! 用户id:3************* ./导出资料/我的书架.xls【追加】写入【已读完的书籍】数据成功! ./导出资料/我的书架.xls【追加】写入【最近阅读的书籍】数据成功! ./导出资料/我的书架.xls【追加】写入【所有的书籍】数据成功! 0it [00:00, ?it/s] 在导出资料文件夹中:仅有一个我的书架.xls , 我的笔记文件夹为空。

xml.parsers.expat.ExpatError: mismatched tag: line 63, column 4

装了最新的3.9版本,执行第一条就显示以下内容,请问怎么才能运行啊,谢谢。 D:\WX>pip uninstall -y -r requirement.txt ERROR: Exception: Traceback (most recent call last): File "C:\Users\qwh\AppData\Local\Programs\Python\Python39\lib\site-packages\pip\_internal\cli\base_command.py", line 167, in exc_logging_wrapper status = run_func(*args) File "C:\Users\qwh\AppData\Local\Programs\Python\Python39\lib\site-packages\pip\_internal\commands\uninstall.py", line 78, in run for parsed_req...

C:\Users\Administrator\Downloads\examples-of-web-crawlers\2.天猫商品数据爬虫(已模拟登录)>tmall_crawler.py DevTools listening on ws://127.0.0.1:57658/devtools/browser/ce173f91-456f-46c5-b389-189dd26c187f [21908:20924:0630/184159.033:ERROR:device_event_log_impl.cc(208)] [18:41:59.033] Bluetooth: bluetooth_adapter_winrt.cc:1150 RequestRadioAccessAsync failed: RadioAccessStatus::DeniedByUserWill not be able to change radio power. Traceback (most recent call last): File "C:\Users\Administrator\Downloads\examples-of-web-crawlers\2.天猫商品数据爬虫(已模拟登录)\tmall_crawler.py", line 216, in a.login()...

验证码这关过不去啊/(ㄒoㄒ)/~~

登录成功之后,系统有时候会出现要求再次进行滑条,这个时候就滑不过去了。手动去滑也不行

ERROR: Could not find a version that satisfies the requirement PyQt5==5.13.0 (from versions: 5.12.3, 5.14.0, 5.14.1, 5.14.2, 5.15.0, 5.15.1, 5.15.2, 5.15.3, 5.15.4, 5.15.5, 5.15.6, 5.15.7) ERROR: No matching distribution found...

采用selenium4.9.0打开Chrome时报错 ![屏幕截图 2023-12-15 232845](https://github.com/shengqiangzhang/examples-of-web-crawlers/assets/117899886/4eeb7e9f-d780-443e-9e6f-20b3eb78d07a) 驱动路径设置为 ![image](https://github.com/shengqiangzhang/examples-of-web-crawlers/assets/117899886/d5d84142-00e0-4a69-93ea-7cbc09b96de3) 驱动版本正确

![image](https://github.com/shengqiangzhang/examples-of-web-crawlers/assets/78016475/0d63e9f0-c2a4-4e73-a7ad-3ef4d9b86219)