gofeed icon indicating copy to clipboard operation
gofeed copied to clipboard

XML syntax error on line 34: illegal character code U+0008

Open minhdanh opened this issue 3 years ago • 4 comments

Expected behavior

RSS feed parsed correctly

Actual behavior

gofeed cannot parse RSS feed, with the following error:

XML syntax error on line 34: illegal character code U+0008

Steps to reproduce the behavior

Parse this feed: http://newsletter.grokking.org/?format=rss

Apparently there's a "strange" character in line 34

minhdanh avatar Jun 05 '21 02:06 minhdanh

I can see there's a similar issue which has been fixed a long time ago: https://github.com/mmcdole/gofeed/issues/25 But this still happens with the latest version of gofeed (v1.1.3)

minhdanh avatar Jun 05 '21 02:06 minhdanh

I reproduced it.

I also ran into another char: U+0004:

XML syntax error on line 1681: illegal character code U+0004 https://changelog.com/posts/feed

purefun avatar Aug 17 '21 13:08 purefun

I also encountered this problem recently.

version:gofeed v1.1.3,go 1.17

XML syntax error on line 211: illegal character code U+0008

https://chegva.com/feed/

anzhihe avatar Aug 24 '22 06:08 anzhihe

I want to be able to parse feeds with illegal characters, so I've opened #206 to see how we should handle this.

mmcdole avatar Mar 25 '23 17:03 mmcdole