Jump to content

Portal:Toolforge/Admin/Logging

From Wikitech
This page is currently a draft.
Material may not yet be complete, information may presently be omitted, and certain parts of the content may be subject to radical, rapid alteration. More information pertaining to this may be available on the talk page.

Logging on Toolforge, both for user workloads and for the infrastructure, is being moved to a setup based on Grafana Loki.

Overview

Storage

Log storage is handled by Grafana Loki, with persistant storage in the Ceph cluster via the S3 interface. The s3 buckets exist in separate projects, tools-logging and toolsbeta-logging, as our RadosGW implementation does not allow for more specific than per-project access control restrictions. The buckets are created via tofu-provisioning system.

The different Loki deployments in each project (tools and toolsbeta) are:

tools
Log storage for tool workloads. (So everything in an individual tool- namespace.)

Collection

Each Kubernetes worker node runs a Grafana Alloy pod that forwards logs from pods running on that node to the appropriate Loki instance.

Deployment

The entire logging stack is deployed via the logging component of toolforge-deploy.

Operations

Upgrading Loki or Alloy

Monitoring

A Grafana dashboard is available.

See also