readability icon indicating copy to clipboard operation
readability copied to clipboard

article.content only contains a part of the content

Open westlinkin opened this issue 7 years ago β€’ 2 comments

First of all, great great library! You've done a wonderful job here.

When use this url, the result is wrong. The article.content only contains a part of the content, here the value:

<div class="field field-paragraph field-paragraph--full field-type-text-long field-type-text-long--full"><p>β€œIt was the saddest movie I've ever filmed, to be honest with you. I've never had a more difficult film to film,” Olmos lamented. β€œIt was too close to the time when she was actually killed, it was only 13 months after when we were filming. Nobody wanted to film it, the parents didn't, we didn't, nobody wanted to. We'd rather she be alive. But we had to."</p></div>

If you click on the link, you'll see article.content only contains the first paragraph.

westlinkin avatar Oct 24 '17 06:10 westlinkin

yes this is one of the biggest limitations of this lib: it doesn't work well on deeply nested HTML structure:

image

wong2 avatar Nov 01 '17 04:11 wong2

Yes we are also facing same kind of issue. It won't return full content of html. It returns some random div from the page. I used this link here

raju1988 avatar Sep 16 '19 09:09 raju1988