Jump to navigation Jump to search
SRE Observability documentation
The starting point for observability resources at Wikimedia SRE.
- icinga.w.o/alerts: view of legacy alerting. See also Icinga.
- alerts.w.o: view all active alerts. See also Alertmanager,
- Alerting infrastructure roadmap PDF
- Logstash: central logging platform. See also Logstash.
- Logging infrastructure design document PDF
- grafana.w.o: central observability platform. See also Grafana.
- Prometheus, recommended and supported metrics toolkit
- Graphite, supported but deprecated time series framework
- Statsd, supported but deprecated metrics aggregation
- Observability/Dashboard_guidelines, ideas towards better dashboards