Portal:Toolforge/Admin/Runbooks/ToolforgeKubernetesNodeNotReady

From Wikitech
Jump to navigation Jump to search
The procedures in this runbook require admin permissions to complete.

The ToolforgeKubernetesCapacity alert fires when a Toolforge Kubernetes node is marked as not ready. A paging alert also fires when at least 5 nodes are marked as not ready.

Debugging

On a bastion run as your own user:

$ kubectl sudo get node
$ kubectl sudo describe node <node>

Related information

Support contacts

Old incidents