github-bot
github-bot copied to clipboard
Bot uptime monitoring
As this bot moves into production and starts getting integrated in various flows where people start depending on it, I think it is time to think about how we can monitory uptime and response-time of the bot.
Here are some suggestions which I have experience with:
| Name | Price | License | Remarks |
|---|---|---|---|
| Pingdom | $14.95/mo | ❌Proprietary | Easy to set up, but expensive 😕 |
| Pagerduty | $9/mo | ❌Proprietary | Very advanced. Much enterpizy 😐 |
| fzaninotto/uptime | Free | ✅MIT | Unfortunately this project is not maintained any more 😢 |
| DIY | Free | ✅MIT | Make something our self similar to what I have made for monitoring Jenkins Slaves |
We could send a shout out to Pingdom and Pagerduty to hear if they would like to sponsor the Node.js Foundation – it might be worth a shot.
Great initiative! It would be great to a least be confident it's up-n-running, which I guess the services you mention and Statuscake (free) could help us with.
For response-time monitoring though, I'm not sure it'll be much value for those services to request a URL on the bot.. What we're really interested is the time it took to handle webhook requests from github.com or jenkins, and maybe how long the bot is waiting for third party APIs to respond (github, travis or jenkins).
I think if we're to do such a thing we should look at the bigger scale for build resources. We have an email alert tool but we can improve lots here.
@Starefossen perhaps we can expand the jenkins status monitor to do this check too?
Yes, that would absolutely be possible!
The bot already has a /ping endpoint which could be used?
On Friday, 26 August 2016, Hans Kristian Flaatten [email protected] wrote:
Yes, that would absolutely be possible!
— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/nodejs/github-bot/issues/63#issuecomment-242801842, or mute the thread https://github.com/notifications/unsubscribe-auth/ABLLExwIowSc0dWmaZQlvbPcRJ-CzpKVks5qjyU-gaJpZM4JiZGx .
@phillipj @Starefossen I run https://noty.im for uptime monitoring. I would like to sponsor for this with all of features that we supports (SMS, Phone Call alert, Team Access, Status Page etc)
We also have a chat bot to control from there(add check- ping endpoint, review stat) https://www.youtube.com/watch?v=z2o8DhoN0dg
My whole stack is open source as well at github.com/notyim
Let me know what I can do to help setting this up.