feedparser
feedparser copied to clipboard
Parse feeds in Python
From the very first code bit on the [Introduction page](https://pythonhosted.org/feedparser/introduction.html): ``` py >>> import feedparser >>> d = feedparser.parse('http://feedparser.org/docs/examples/atom10.xml') ``` All seems to work OK, except: ``` py >>> d["feed"]["title"]...
The documentation use some example feeds. e.g. In https://pythonhosted.org/feedparser/introduction.html this is used http://feedparser.org/docs/examples/atom10.xml But the URL is not valid. The server answers with 403.
Hi all, I am facing trouble downloading some feed urls, like: https://meaculpa.gr/politiki?format=feed The error I get is: Traceback (most recent call last): File "find_feed.py", line 26, in print(is_feed('https://meaculpa.gr/politiki?format=feed')) File "find_feed.py",...
Hi! I'm using feedparser to great effect with a project to parse various RSS feeds from a site I help administrate for posting on the attached Discord server. When parsing...
I would like to add support for [RFC 6721](https://tools.ietf.org/html/rfc6721) to a tool I'm building using feedparser. The short version of it is that you add the `http://purl.org/atompub/tombstones/1.0` namespace (specified as...
Parsing The Verge: https://www.theverge.com/rss/full.xml returns ``` Parsed Feed: {'bozo_exception': SAXParseException('syntax error',), 'feed': {}, 'bozo': 1, 'version': '', 'entries': [], 'encoding': 'utf-8', 'namespaces': {}} Bozo: 1 Exception type: :2:0: syntax error...
Because `feedparser` constructs entries as flat dictionaries, any repeated elements can get clobbered, even if they are nested in separate elements. For example, the following example feed repeats `` under...
There are several places where `deepcopy` is used with `FeedParserDict` instance. It raises KeyError since there is a row `copier = getattr(x, "__deepcopy__", None)` within `copy.deepcopy` function: ``` KeyError: '__deepcopy__'...
``` python In [27]: f['entries'][0]["published"] Out[27]: u'2016/6/29 15:07:41' In [28]: f['entries'][0]["published_parsed"] Out[28]: time.struct_time(tm_year=2016, tm_mon=6, tm_mday=1, tm_hour=0, tm_min=0, tm_sec=0, tm_wday=2, tm_yday=153, tm_isdst=0) In [29]: datetime.datetime.fromtimestamp(time.mktime(f['entries'][0]["published_parsed"])) Out[29]: datetime.datetime(2016, 6, 1, 0, 0)...
Hi, so i'm trying to parse RSS with additional fields like https://mosregtoday.ru/rss/ or http://www.vesti.ru/vesti.rss Their items have additional fields like **yandex:related** or **turbo:content**. And there're 2 cases with feedparser, depending...