Alerting and monitoring system brainstorm
Existing systems
- #monitoring-unms/
uispUISP GrafanaGrafana/Prometheus- public, setup 4 years ago: https://stats.nycmesh.net
- Mesh only, Omni's etc: http://10.70.90.82:3000/dashboards
Requirements
- Must alert Slack team when key infrasture goes offline
Requirements questions
- frequency? ~1 point/hour