newspaper4k
newspaper4k copied to clipboard
gibberish in Chinese web site
Issue by Jenson66
Thu Apr 19 13:46:56 2018
Originally opened as https://github.com/codelucas/newspaper/issues/555
url='http://news.dichan.sina.com.cn/2018/04/12/1257865.html'
article = Article(url, language='zh')
article.download()
article.parse()
print(article.title)
print(article.text)
It's not work, the output of Chinese is scrambled.