feedparser
feedparser copied to clipboard
Parse feeds in Python
Examples: - https://www.warhammer-community.com/feed - https://thehill.com/feed/ Currently using 6.0.10 version
``` $ python3 -c 'import feedparser; import pprint; pprint.pp(feedparser.parse("/dev/null"));' {'bozo': False, 'entries': [], 'feed': {}, 'headers': {}} ``` This makes it look like `/dev/null` is a valid feed with no...
``` $ python3 -c 'import feedparser; import pprint; pprint.pp(feedparser.parse("http://httpstat.us/500"));' {'bozo': 1, 'entries': [], 'feed': {}, 'headers': {'content-length': '25', 'connection': 'close', 'content-type': 'text/plain', 'date': 'Tue, 23 Jul 2024 09:26:01 GMT', 'server':...
The latest pip-installable release is still relying on the old custom http client code (in http.py). This was causing errors when passing in a string of the feed content. Installing...
https://www.36kr.com/feed
Hi! First off, huge thanks for maintaining feedparser. It's legendary! We're all lucky to have it. I hit a new (to me) `AssertionError` today when parsing the RSS at https://snrk.de/feed/...
Previously these errors were ignored, since an exception is raised only on fatal errors. With this change, when a non-fatal error occurs, the bozo bit is still set, but the...
feedparser imports cchardet or chardet depending on what's installed: https://github.com/kurtmckee/feedparser/blob/11990ea1d8791acc76c67781f1d2011daf0c3a99/feedparser/encodings.py#L37-L40 Although these libraries are mostly equivalent, they return slightly different encoding strings, even though both are correct and lead to...
When parsing an RSS feed string that starts with a newline before the XML declaration inside a `ProcessPoolExecutor`, `feedparser.parse` returns a `FeedParserDict` object that cannot be pickled without raising: ```...
I stumbled across a feed with an `` where the `src` was busted, but the `srcset` is good, so here we are: this teaches the relative URL resolver how to...