Jason Ward
Jason Ward
It would not be difficult to create a custom parser that strips out all the tags. It's something we've wanted to include anyway, so if you end up using that...
I saw this as well. This issue is fixed in python2.7 because of an update in HTMLParser. You should be able to backport the 2.7 version to fix this issue.
I created a backport and put it on Pypi. https://pypi.python.org/pypi/HTMLParser
Have you tried encoding `f.read().encode('utf8')`?
Thanks for the pull request :)
@JeffSpies it seems I lost permission to merge pull requests in the move. It's also likely that our travis-ci stuff will need to be re-hooked up.
I just ran into this updating from v0.10.8 (the version right before the bug)