Portal:Toolforge/Admin/Runbooks/ToolforgeKubernetesHAproxyUnknown
Appearance
This happens when prometheus has no data from haproy on the server status (server_up
).
The procedures in this runbook require admin permissions to complete.
Error / Incident
This usually comes in the form of an alert in alertmanager.
There you will get which project (tools, toolsbeta, ...) is the one it's failing for.
Debugging
This is a tricky one and it will be related to the way we gather metrics on tools/toolsbeta.
Note that this is not directly related to the metricsinfra monitoring project, but toolforge's own setup.
You can start by going to the project's prometheus page and trying to get the stats there, example for tools:
Common issues
Add new issues here when you encounter them!
Related information
- See HAProxy for generic information like commands/etc.
- Karma UI for cloud vps (use
project=tools
orproject=toolsbeta
for filtering) - Tools prometheus
- Toolsbeta prometheus
- Alerts repository
- Toolforge admin docs
- See also any other *Unknown runbooks.
Old incidents
Add any incident tasks here!