grr icon indicating copy to clipboard operation
grr copied to clipboard

Remove unused feeds

Open cardre opened this issue 12 years ago • 3 comments
trafficstars

When all users have unsubscribed from a feed, crawler will still continue to index and records for articles still exist in database.

At beginning of each background crawl process, remove feeds and their corresponding articles if no longer used by any users.

cardre avatar Jul 05 '13 10:07 cardre

This would probably best be handled by adding an option to the administrative interface. I prefer the idea of permanent archival and crawling - since grr is a multiuser app, and there is always a possibility of users re-subscribing to an "abandoned" feed. It would be nice to preserve history by default (Google Reader's feeds went back to 2007).

0xe1f avatar Jul 06 '13 18:07 0xe1f

Yes, I guess this makes sense. Problem I've seen is some abandoned feeds that have been imported for the first time that will never resolve, will continue to be checked forever. It also leads me to thinking that some sort of UI component (maybe the menu option for each feed) could have a 'Status' item, that can show whether the feed is updating or any failures that are occurring etc.

So for my immediate issues, I've just manually removed them from the database. But agree that an admin function is probably the way to go.

cardre avatar Jul 07 '13 15:07 cardre

For those feeds, I think it something like the following would be useful:

Check a failed feed three times a day at most After third day, disable check permanently via a flag in the database schema

An administrator would have an option to re-enable a feed, but feeds would be disabled again if they kept failing.

I agree with you that a Status item would definitely be useful, as well as a detailed error log for the parser.

0xe1f avatar Jul 07 '13 16:07 0xe1f