scale-network
scale-network copied to clipboard
Implementing Grafana dashboards for overall network and device health
In SCALE 21x, we discussed about potential improvements in Grafana installation. Currently, we only use OpenWRT dashboard, but it does not highlight overall network status. So, this issue is a place to keep track each request/idea.
I would like to highlight some of these requests/ideas from the team members.
- [ ] Overall state/health of each networked device such as Raspberry Pis, Access Points (OpenWRT), Switches (Juniper).
- Total of +200 devices to graph
- [ ] Conference map overlay to visualize status of physically located device
- [ ] Bandwidth usage and health check on Juniper ports
- [ ] Some team members concerned about alarm fatigue. Alarms will only highlight critical problems
- tech team members less concerned about Slack and notifications
- [ ] Collect stats from DHCP (Kea) and DNS
This list is not limited, so you are encouraged to add more items in here.
Thanks @BerkhanBerkdemir this looks like a great list to start with
I couldn't attend this weekend to work on this issue. Do we have anyone who wants to work with me on this feature, @sarcasticadmin? I'd like to close this issue with proper PR before December.
Hey @BerkhanBerkdemir, not too familiar with Grafana but willing to learn and help out as much as possible.
@BerkhanBerkdemir thanks for checking back in and no worries about not being able to make the work party.
I think setting a goal of December sounds like a good idea. I think it'd be good to work with @jshcmpbll and @fossadmin on this. I know @jshcmpbll has been working on getting a vmTest together for grafana to do some simple sanity checks on its deployment.
I'd be happy to setup a call to sync up with those who are interested if that sounds like a good idea.