liferea icon indicating copy to clipboard operation
liferea copied to clipboard

problem with rss feed

Open cheese1 opened this issue 5 years ago • 3 comments

hi

i get errors while reading: https://security.netapp.com/advisory/index.xml

i get:

There were errors while parsing this feed! Details Could not detect the type of this feed! Please check if the source really points to a resource provided in one of the supported syndication formats!

XML Parser Output: Source points to HTML document.

adding "xmllint --recover -" did not help how would i debug that?

ps.: the feed is quite huge

cheese1 avatar Aug 29 '20 15:08 cheese1

Feeder under android parses that feed without problems

cheese1 avatar Sep 01 '20 05:09 cheese1

Huge indeed. It is 22M, and it chokes the libxml parser with internal error: Huge input lookup. libxml as a XML_PARSE_HUGE option but I don't see how to pass it to the SAX parser ...

Leiaz avatar Sep 05 '20 12:09 Leiaz

by reducing the file size i was able to read the feed. i used a filter to remove empty lines and spaces/tabs at the start of lines sed -e 's/^\s*//' -e '/^$/d' which reduces the size to 50%

seems to solve at least my problem

cheese1 avatar Sep 06 '20 05:09 cheese1

Having this feed in my test case set I noticed it suddenly working. I think at latest after the Ubuntu 22.04 upgrade. So I guess some limits in libxml2 might have become bigger (or the feed smaller).

@cheese1 Closing this as it is an old ticket (and works now).

lwindolf avatar Oct 11 '22 23:10 lwindolf