Server Lifecycle/reclaim checklist

From Wikitech
Jump to navigation Jump to search

This checklist is able to be copied and pasted into phabricator hardware request tasks for reclaiming systems to spare or decom.

[] - all system services confirmed offline from production use
[] - set all icinga checks to maint mode/disabled while reclaim/decommmission takes place.
[] - remove system from all lvs/pybal active configuration
[] - any service group puppet/hiera/dsh config removed
[] - remove site.pp (replace with role::spare::system if system isn't shut down immediately during this process.)

START NON-INTERRUPPTABLE STEPS

[] - disable puppet on host
[] - remove all remaining puppet references (include role::spare)
[] - power down host
[] - disable switch port
[] - switch port assignment noted on this task (for later removal)
[] - remove production dns entries
[] - puppet node clean, puppet node deactivate

END NON-INTERRUPPTABLE STEPS

[] - system disks wiped (by onsite)
[] - IF DECOM: system unracked and decommissioned (by onsite), update Netbox with result
[] - IF DECOM: switch port configration removed from switch once system is unracked.
[] - IF DECOM: mgmt dns entries removed.
[] - IF RECLAIM: system added back to spares tracking (by onsite)