PythonSpiderNotes
PythonSpiderNotes copied to clipboard
Python入门网络爬虫之精华版
### Description of the Bug 1.Hi Developers im new to coding. i tried to make this video uploading app to my firebase storage but im not that good at coding...
网易新闻爬取项目输出的txt文档出现乱码, 当15,16行做出如下修改后,输出的txt文档正常了。 ``` with open(path, "w+") as fp: for s in slist: print(s[0], s[1]) fp.write("%s\t\t%s\n\n" %(s[0],s[1])) ```
r = session.post('http://www.zhihu.com/login/email', data=loginData, headers=header) json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0) 为什么不能转化为 json呢?是不是 知乎的登陆路径错了呢?
crd@crd-PC:~/Desktop/ccgp_crack/PythonSpiderNotes/Captcha1$ python tess_test.py PNG (200, 70) 1 Traceback (most recent call last): File "tess_test.py", line 224, in filter_enhance(midu_image_name, midu_image_name_pro1) File "tess_test.py", line 91, in filter_enhance im = enhancer.enhance(2) File "/usr/local/lib/python2.7/dist-packages/PIL/ImageEnhance.py",...
可以讲个cluster的专题,结合kafka或者redis之类的
我最近也在学习下Python的网络爬虫,非常感谢你的分享。 我今天在搭建好环境后尝试Spider_Python项目时遇到一个问题,无法连接mongodb,提示的错误是pymongo不存在Connection模块,然后我在网上找了下pymongo的用法,做了如下修改后可以正常运行并存入mongodb。 ` # 连接数据库,db和posts为数据库和集合的游标 def Connection(self): #connect to mongo(localhost:27017) mongoclient = pymongo.MongoClient() mongodb = mongoclient[self.database] posts = mongodb.posts return posts `