Google Code Exporter
Google Code Exporter
``` Building from source will fix this issue. ``` Original comment by `[email protected]` on 24 Jan 2013 at 8:33
``` I have done a build from SVN. Still Japanese URL gives bad results. Same works gr8 on the Web App. Code snippet below: URL url = new URL("http://d.hatena.ne.jp/mkusunok/20130817/p1"); String...
``` Issue 45 has been merged into this issue. ``` Original comment by `ckkohl79` on 25 Mar 2012 at 2:12
``` Adding FORM as an ignorable element at the highlighter (but not at the Extractor itself) has two disadvantages: 1. The highlighted HTML will not be consistent with the TextDocument's...
``` Adding FORM as an ignorable element at the highlighter (but not at the Extractor itself) has two disadvantages: 1. The highlighted HTML will not be consistent with the TextDocument's...
``` Hi François, thanks for pointing this out. The addition of meta and base was a deliberate decision (it was just easier to append it in front of the highlighted...
``` NAV, FOOTER, and HEADER should also help eliminate chunks of unwanted text. ``` Original comment by `[email protected]` on 15 Mar 2012 at 8:13
``` Sample HTML5 article with appropriate use of some of the tags mentioned above: http://www.forbes.com/sites/forbestravelguide/2012/01/19/the-best-international- airports-for-layovers/ ``` Original comment by `[email protected]` on 22 Mar 2012 at 8:53
``` Thanks for this request. The source code of boilerpipe-web might indeed be released as Open-Source at some point in time. Right now, I don't see it as a high...
``` The results between demo at app engine and my local boilerpipe project are not same. The app engine demo gets better results for same urls. ``` Original comment by...