GitLab/Runbook

From Wikitech

A runbook is a set of instructions for a human what to do. More specifically what to do when a certain monitoring alert triggers. This page contains GitLab related runbooks, linked from alertmanager rules with the runbook annotation.

GitLabCIJobErrors

More than 50% of GitLab CI jobs are failing.

Check what kind of job error is elevated https://grafana.wikimedia.org/d/Chb-gC07k/gitlab-ci-overview?orgId=1.

Also check the past and currently running Jobs at https://gitlab.wikimedia.org/admin/jobs (admin privileges required).

GitLabCIPipelineErrors

More than 50% of GitLab CI pipelines are failing.

Check what kind of pipeline error is elevated https://grafana.wikimedia.org/d/Chb-gC07k/gitlab-ci-overview?orgId=1.

Also check the past and currently running Jobs at https://gitlab.wikimedia.org/admin/jobs (admin privileges required).

GitLabCIPipelineLatency

Pipeline createn takes more than 10 seconds.

Check number of active pipelines at https://grafana.wikimedia.org/d/Chb-gC07k/gitlab-ci-overview?orgId=1.

Also check the past and currently running Jobs at https://gitlab.wikimedia.org/admin/jobs (admin privileges required).

Check if all runners are available and marked online at https://gitlab.wikimedia.org/admin/runners.

If needed, stop jobs which need too much resources.