feedparser
feedparser copied to clipboard
Parse feeds in Python
Hi, I just encountered an issue with the library (basically it's with the XML that is wrong, but I can't contact the owners to fix this): ```xml
This project seems to set the preferred XML SAX driver to `drv_libxml2`. This will only have an effect on a few systems where the libxml2 Python bindings are installed and...
This was broken in 1e4a5b25e337b18a3295d97e1a8957ac22275d1a, which moved the header case normalization logic into the http client. After that commit, users needed to pass lowercase headers in response_headers in order for...
While using latest CPython 3.13.0 I'm seeing the following deprecation warning: ``` .venv/lib/python3.13/site-packages/feedparser/html.py:152: DeprecationWarning: 'count' is passed as positional argument data = re.sub(r'
`content_text` incorrectly takes precedence over `content_html` when parsing JSON Feed, making it impossible to get `content_html` if both exist. https://github.com/kurtmckee/feedparser/blob/e43242143ed90ee8cbf05078faf972f8de35a798/feedparser/parsers/json.py#L88-L97 According to https://www.jsonfeed.org/version/1.1/, `content_text` and `content_html` are completely equal. >...
Related to #24 and #145, but only affects elements inside `` or ``. This specifically solves the incorrect parsing of arXiv Atom feeds mentioned in #145, where author affiliations were...
I maintain [feed2fedi](https://codeberg.org/marvinsmastodontools/feed2fedi) that uses feedparser and I have been asked to look into supporting sites / feeds based on Google's XML news-sitemap. On initial trial it looks like feedparser...
We needed a way to archive the data that feedparser uses when processing a url, for the purposes of troubleshooting, running tests and regression analysis. There were two options to...
I noticed that the project currently depends on deprecated library sgmllib3k. Could you share if there’s a plan or timeline to remove this dependency in a future release? `DEPRECATION: Building...
Passing raw Python exceptions in `bozo_exception` can cause issues when using multiprocessing. This change converts the exception to a string when running in a multiprocessing context. Closes #511