python_scripts issues

关于知乎的自动登录，验证码问题

您好，请问一下，目前知乎的验证码变成了倒立汉字，请问应该如何做呢？

一点小的改动

把获得的cookies保存到cookies.txt文件里，以便以后使用。有两行 if __name__ == '__main__': 有一行应该是多余的，删掉了一行。

(unicode error) 'utf-8'

File "crawler.py", line 35 """ SyntaxError: (unicode error) 'utf-8' codec can't decode byte 0xc5 in position 5: invalid continuation byte

xiaoqf96

关于beautifulsoup3不支持python2

关于beautifulsoup3不支持python2，是不是作者写错了。不支持python3吧？

jiangsujww

Update runoob2pdf.py

fix bugs in ‘func’ function and improve its performance

SmithLiu95

给h1标签设置居中 body.find('h1')['style'] = "text-align:center;"

1

``` #给 h1 tag 设置居中属性 body.find('h1')['style'] = "text-align:center;" ``` 这时候要用 ``` body = soup.find(class_="article-intro") #body = soup.find_all(class_="article-intro") #如果用find_all 那后面就要用 html = h[1:-1] 去掐头去尾去掉 [ 和 ] ```

pendave

关于图片正则表达式的错误的纠正

``` def func(m): if not m.group(3).startswith("http"): rtn = m.group(1) + get_domain(url) + "/" + m.group(2) + m.group(3) #rtn = m.group(1) + domain + m.group(2) + m.group(3) return rtn else: return...

pendave

ImportError: No module named 'pdfkit'

2

root@raspberrypi:/home/pi/python/crawler_html2pdf/pdf# python3 crawler.py Traceback (most recent call last): File "crawler.py", line 14, in import pdfkit ImportError: No module named 'pdfkit' 这是为啥？

xpguan