ReadabiliPy icon indicating copy to clipboard operation
ReadabiliPy copied to clipboard

Define handling rules for <iframe>

Open jemrobinson opened this issue 6 years ago • 2 comments

How should

jemrobinson avatar Dec 13 '18 12:12 jemrobinson

Note that the default BeautifulSoup behaviour is to not parse anything inside an <iframe>. Here is a minimal example taken from the failing code in #37.

> print(str(BeautifulSoup("<iframe><span></span></iframe>")))
> <html><head></head><body><iframe>&lt;span&gt;&lt;/span&gt;</iframe></body></html>

jemrobinson avatar Dec 20 '18 16:12 jemrobinson

Easiest solution here is to blacklist iframe. Can revisit this in the future.

jemrobinson avatar Dec 20 '18 17:12 jemrobinson