Readability4J icon indicating copy to clipboard operation
Readability4J copied to clipboard

Prune aside tags

Open lenzls opened this issue 4 years ago • 0 comments

Hello,

mozilla's readbility filters out <aside> tags before processing the html further, as can be seen in https://github.com/mozilla/readability/blob/master/Readability.js#L633.

Readbility4J however does not do this https://github.com/dankito/Readability4J/blob/master/src/main/kotlin/net/dankito/readability4j/processor/ArticleGrabber.kt#L753

I understood, that this library tries to be an exact copy of the one from mozilla, so I want to file this as a bug.

regards and thanks for the awesome software!

Simon

lenzls avatar Jul 30 '20 14:07 lenzls