node-unfluff icon indicating copy to clipboard operation
node-unfluff copied to clipboard

Doesn't seem to work for sites that use <div> tags instead of <p>

Open iannshan opened this issue 8 years ago • 2 comments

I tried this with a CNN.com article and it didn't work because they don't use paragraphs. Any suggestions for a work-around?

iannshan avatar Nov 03 '16 02:11 iannshan

What specific article?

ageitgey avatar Nov 03 '16 03:11 ageitgey

This is the one I tried: http://cnn.com/2016/11/01/politics/hillary-clinton-2016-campaign/index.html

The output was just the first paragraph of the article, which actually is in a p tag unlike the rest of the article.

I tried a number of other sites including Medium, NBC News, and a few random blogs and they all worked great. When I inspected the article on CNN though I saw their use of div tags and figured that could be the problem.

iannshan avatar Nov 03 '16 03:11 iannshan