Portal:Cloud VPS/Admin/Runbooks/CephSlowOps

From Wikitech
The procedures in this runbook require admin permissions to complete.

Ceph slow ops will usually block some writes to the Ceph cluster which makes VMs unhappy.

Debugging

Log in to any cloudcephmon node, and then use sudo ceph health detail to find OSDs having issues. Use sudo ceph osd find osd.NNN to find which cloudcephosd node is having issues.

On the cephosd node, use sudo ceph daemon osd.NNN ops_in_flight detail about the stuck operations.

Common issues

Disk issues

Related information

Support contacts

Old incidents