Jump to content

News/2025 New Toolforge log storage

From Wikitech
This page is currently a draft.
Material may not yet be complete, information may presently be omitted, and certain parts of the content may be subject to radical, rapid alteration. More information pertaining to this may be available on the talk page.

This page contains information about the new log storage system being introduced for Toolforge jobs.

What is changing?

The log storage system for Toolforge jobs is being migrated to a Grafana Loki deployment, instead of relying on the internal log storage handled by the Kubernetes (the container orchestrator we use). This change means that the toolforge jobs logs command is now able to query logs not just from currently running containers.

In particular, toolforge jobs logs is gaining the following features:

  • Logs will now be visible from containers that are no longer running. Logs are no longer lost when, for example, a job is restarted or a cron job launch completes
  • Logs can be queried from jobs that have since been deleted (phab:T400913)
  • Logs can be queried for a specific time frame (phab:T400917)

The log viewing functionality of the webservice command is unaffected by these changes.

Timeline

  • Yes Done 2025-07-31: toolforge jobs logs is changed to query logs from Loki.
  • Underway In progress New features are added to toolforge jobs logs to take advantage of Loki's functionality.
  • TBD: File logging will be disabled by default for new jobs.
  • TBD: File logging is removed as a built-in option. Current jobs using file logging will be migrated to use internal logging instead.

What should I do?

Why are we doing this?

This change is being made to improve the reliability of Toolforge by reducing usage of the shared storage (NFS) server.

See also

Communication and support

Support and administration of the WMCS resources is provided by the Wikimedia Foundation Cloud Services team and Wikimedia movement volunteers. Please reach out with questions and join the conversation:

Discuss and receive general support
  • Chat in real time in the IRC channel #wikimedia-cloudconnect or the bridged Telegram group
  • Discuss via email after you have subscribed to the cloud@ mailing list
Stay aware of critical changes and plans
Track work tasks and report bugs

Use a subproject of the #Cloud-Services Phabricator project to track confirmed bug reports and feature requests about the Cloud Services infrastructure itself

Read stories and WMCS blog posts

Read the Cloud Services Blog (for the broader Wikimedia movement, see the Wikimedia Technical Blog)