Portal:Cloud VPS/Admin/Runbooks/MetricsinfraAlertmanagerDown

From Wikitech
The procedures in this runbook require admin permissions to complete.

The MetricsinfraAlertmanagerDown alert fires when the production Prometheus AlertManager dashboard can't communicate with the alerting setup in the metricsinfra project. This is either an issue with the metricsinfrar setup, or a sign of a more general network outage in Cloud VPS.

Debugging

Common issues

Add here any new common issues you find.

Related information

Support contacts

Taavi knows both about the network setup and the metricsinfra alerting setup.

Old incidents

Add here any new tasks for incidents you might encounter.