feedparser
feedparser copied to clipboard
Parse feeds in Python
I'm specifically looking to access the itunes:owner element of a podcast feed, but I think that access to arbitrary non-standard elements would be useful.
Is there any way to get the favicon of the rss feed url?
Hi, I'm trying to parse a RSS 2.0 feed : http://www.legrandmix.com/data/rss.xml It is OK with w3c validator : https://validator.w3.org/feed/check.cgi?url=http%3A%2F%2Fwww.legrandmix.com%2Fdata%2Frss.xml But I've an error with feedparser : SAXParseException('not well-formed (invalid token)'...
Hello, I'm trying to parse a Atom 1.0 XML File which has multiple authors elements that contain further elements. The authors get correctly parsed to `authors` but in `author_details` only...
Version: 5.2.1
Example feed https://tobiw.de/tbdm/feed When parsing this feed `bozo` is 1 because of an SSL error. But `entries` is created but empty. But `status` is not. I understand the latter. But...
Example rss: http://www.atilim.edu.tr/site/rss ```python3 >>> import feedparser >>> url = 'http://www.atilim.edu.tr/site/rss' >>> result = feedparser.parse(url) ### Waiting ``` Solution: https://stackoverflow.com/questions/9772691/feedparser-with-timeout
Hi, I've been getting a bozo exception like the following: `In [5]: feedparser.parse("owlturd.com/rss") Out[5]: {'bozo': 1, 'bozo_exception': xml.sax._exceptions.SAXParseException('not well-formed (invalid token)'), 'encoding': u'utf-8', 'entries': [], 'feed': {}, 'namespaces': {}, 'version':...
My [personal UDD todo list](https://udd.debian.org/dmd/?email1=anarcat%40debian.org&email2=&email3=&packages=&ignpackages=photofloat&nosponsor1=on&format=rss#todo) breaks feedparser. If you add the tests to the "illformed" directory, tox says: ``` GLOB sdist-make: /home/anarcat/dist/feedparser/setup.py py27 create: /home/anarcat/dist/feedparser/.tox/py27 py27 inst: /home/anarcat/dist/feedparser/.tox/dist/feedparser-5.2.1.zip py27 installed:...
When I try to parse [this rss](http://paleonews.ru/index.php?option=com_sdrsssyndicator&feed_id=1&format=raw), **entry.link** doesn't match that I expect. I think that the Feedparser incorrectly works with **\** block. Here is my code and rss content...