python-boilerpipe
python-boilerpipe copied to clipboard
python-boilerpipe setup.py fails using Python3
setup.py fails due to changes in urllib package and unicode() function. With the following changes, build succeeds on Windows8.1/Cygwin and Mac OS X 10.9.5. Other software: Oracle JDK 8u20, latest build of JPype1-py3. Python3 changes to python-boilerpipe-master/src/boilerpipe/extract/init.py import urllib.request # line 2 request = urllib.request.Request(kwargs['url'], headers=self.headers) # line 35 connection = urllib.request.urlopen(request) # line 36 self.data = str(self.data, encoding) # line 41 self.data = str(self.data, charade.detect(self.data)['encoding']) # line 45
In case anyone is interested in an update since 2014... I was able to run setup.py on python 3.5 in virtualenv on macOS 10.12.5 today. I was able to extract some text from a website with the default extractor.