Incident documentation meeting/QR201407/group2

From Wikitech

Quarterly Review of post-mortems - 2014-07

Questions we want to be able to answer

  • Have all of the issues that came out of the post-mortem been addressed? If not, why not?
  • Are we satisfied with the current state of that part of the infra? Are there further actions to take (upon further reflection)?
  • anything else?

Agenda

  • Go through the post-mortems and their respective action items and make sure they have been followed up appropriately.
    • If you have details that are relevant to the post-mortem in BZ/etc, please link from the post-mortem.
  • Discuss if there is anything else that we learned from the situation and follow up to better inform future decisions.
  • Notes written up by all, collaboratively, so that others in the organization will learn from these as well.

Notes

HERE!

The post mortems

20140328-DB-Queries

Bryan, Reedy Incident documentation meeting/20140328-DB-Queries

20140403-Deploy

Bryan, Reedy Incident documentation meeting/20140403-Deploy

20140503-Thumbnails

Antoine, Faidon Incident documentation meeting/20140503-Thumbnails

20140517-bits

Timo Incident documentation meeting/20140517-bits

20140529-appservers

Ori Incident documentation meeting/20140529-appservers

20140608-Kafka

Andrew O Incident documentation meeting/20140608-Kafka

20140612-Math

Greg Incident documentation meeting/20140612-Math

20140618-Wikitech

Andrew B, Marc Incident documentation meeting/20140618-Wikitech