goaccess icon indicating copy to clipboard operation
goaccess copied to clipboard

Parse and display only rss readers

Open fekir opened this issue 10 months ago • 2 comments

There are the options --crawlers-only and --ignore-crawlers, but I'm actually interested in one particular category of "bots": rss readers.

Contrary from crawlers, they represent users, but have a different behavior from browsers.

Has it been considered to add an option for filtering only rss readers; something like --rss-only? Would it make sense to add it to goacces?

fekir avatar Feb 22 '25 08:02 fekir

I think #117 should be able to handle this, but do you have a few sample lines from your log and how these should be parsed and what output you're expecting? Thanks!

allinurl avatar Feb 26 '25 01:02 allinurl

Some entries from my logs, I di not validate all of them, but at least from the name they look like rss readers:

# N changes based on the amount of subscribers
"NewsBlur Feed Fetcher - N subscriber - https://www.newsblur.com/site/8128934/website (\"Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/14.0.1 Safari/605.1.15\")"
"Mozilla/5.0 (compatible; BazQux/2.4; +https://bazqux.com/fetcher; N subscribers)"
"Mozilla/5.0 (compatible; inoreader.com; N subscribers)"

"FreshRSS/1.21.0 (Linux; https://freshrss.org)"
"Newsboat/2.34.0 (Macintosh; Intel Mac OS X)"
"Newsboat/2.34.0 (Linux x86_64)"
"Newsboat/2.30.1 (FreeBSD amd64)"
"Tiny Tiny RSS/1.15.3 (http://tt-rss.org/)"
"SpaceCowboys Android RSS Reader / 2.6.0(284)"
"Mozilla/5.0 (compatible; WireReaderBot/1.0; +https://wirereader.app)"
"KOReader/2022.03.1 (https://koreader.rocks/) LuaSocket/3.0-rc1"
"NextCloud-News/1.0"
"Aggregator/2.22.000 (Android/8.0.0; universal8890)"
"Feedly/1.0 (+http://www.feedly.com/fetcher.html; 1 subscribers; like FeedFetcher-Google)"
"Vienna/8268 (Macintosh; Intel macOS 10_15_6)"
"Microsoft Office/16.0 (Windows NT 10.0; Microsoft Outlook 16.0.17029; Pro)"
"Microsoft Office Outlook 2014 (16.0.13127) Windows NT 10.0"

# not sure about this one, could be outlook too?
"Mozilla/4.0 (compatible; ms-office; MSOffice 16)"


# ???
"Rome Client (http://tinyurl.com/64t5n) Ver: UNKNOWN"

fekir avatar Mar 01 '25 07:03 fekir