MariaDB/monitoring

From Wikitech

Icinga

Example of icinga checks

Specific checks:

  • Replication running (both IO and SQL thread)
  • Replication lag
  • Disk space
  • Memory used
  • Read_only status
  • mariadb process

Metrics

Prometheus

Hosts to be monitored at prometheus are controlled by the instance and server inventory at Zarcillo (db1215). In order to update prometheus, hosts have to be inserted, updated or deleted from Zarcillo and then run /usr/local/sbin/mysqld_exporter_config.py on the prometheus hosts (e.g. prometheus1003 and prometheus1004 for eqiad). This scripts run automatically every 30 minutes to check for changes.

The standard mysqld-prometheus-exporter is used for most metrics.

Grafana

Connection anomaly detected through Prometheus metrics+Grafana

Relevant dashboards:

Orchestrator

Orchestrator

Logstash/Kibana

Mediawiki database errors

Useful dashboards: