Google Code Exporter

Results 21344 comments of Google Code Exporter

``` i think the problem is because they do not use an user agent when asking for the html, and thus creates an error 403 in some websites, but you...

``` I'm wondering as a workaround can we have a unpatched boilerpipe jar and configure boilerpipe to use tagsoup in place of nekohtml? struggling to find documentation on how this...

``` Also filed here: https://code.google.com/p/boilerpipe/issues/detail?id=81 ``` Original comment by `[email protected]` on 19 Nov 2014 at 9:38

``` The hot patch is a workaround for this issue: http://sourceforge.net/p/nekohtml/bugs/98/ ``` Original comment by `[email protected]` on 19 Nov 2014 at 9:39

``` What you should do when you want to hot patch a library is to repackage the entire library source using a different package name. There are build tools that...

``` Seems this library is no longer maintained... can we fix this by contributing to this project? ``` Original comment by `[email protected]` on 20 Nov 2014 at 1:58

``` Please raise this issue at nekohtml. The project is not dead. Given the limitations of Google Code and SVN (and the limitations of my spare time) I just can't/won't...

``` The above error got resolved. I did not select them in the Order and Export in Java Build Path. ``` Original comment by `[email protected]` on 3 Dec 2012 at...

``` I have selected them in the Order and Export in Java Build Path but still get the same errors ``` Original comment by `[email protected]` on 10 Apr 2014 at...

``` here is what i did ArticleExtractor EXTRACTOR = ArticleExtractor.getInstance(); HTMLHighlighter HH = HTMLHighlighter.newExtractingInstance(); InputSource inputSource = new InputSource(new StringInputStream(html)); TextDocument htmlDoc = new BoilerpipeSAXInput(inputSource).getTextDocument(); EXTRACTOR.process(htmlDoc); html = HH.process(htmlDoc, html);...