[SHELL] Process Google Alerts for new repositories
At some point, I had a cron working that processed incoming google alerts to a given email address. It would be nice to bring this back.
- Once an hour, a shell retrieves a Google Alert rss feed
- Process the urls of each item in the list to retrieve the base url for the repository
- This should throw out anything in a specific blacklist, like
cakephp/cakephp - Throw out all non-repository urls. Maybe keep gist urls?
- This should throw out anything in a specific blacklist, like
- Match all repositories against existing repos. If they exist, throw out the item
- Check to see if the repository is a fork. If so, throw it out
- At some point, we may want to list forks, or at least have the option of making it the "mainline", especially for migrations to 2.0
- Get a list of all branches, the repository name, and some stats
- Insert the repos into a table for later processing
- Send an email to the site admin with a list of all new potential repos
The email would have some general data about the packages, including the link to the repo, the description, and other stats collected, as well as a link to automatically approve the package on plugins.cakephp.org. We can use a token system so that logging in is not required for approving the package.
Thoughts?
Sounds great and well planned out. I believe we can have the Google Alerts delivered to a public RSS feed as well. So we don't have to check an email address (unless you're already suggesting this in the first bullet).
Once an hour, a shell retrieves a Google Alert rss feed
:)
:thumbsup:
Do you want to work on this next? Seems like an interesting project to work on, and potentially can be made into a plugin for opensource win. On Apr 5, 2012 12:15 PM, "Kyle Robinson Young" < [email protected]> wrote:
:thumbsup:
Reply to this email directly or view it on GitHub: https://github.com/cakephp/cakepackages/issues/50#issuecomment-4979921
Yeah absolutely. I'll put it next on my todo list.