Jump to content

Incident documentation meeting/QR201407

From Wikitech

Quarterly Review of post-mortems - 2014-07

Questions we want to be able to answer

  • Have all of the issues that came out of the post-mortem been addressed? If not, why not?
  • Are we satisfied with the current state of that part of the infra? Are there further actions to take (upon further reflection)?
  • anything else?

Agenda

  • Go through the post-mortems and their respective action items and make sure they have been followed up appropriately.
    • If you have details that are relevant to the post-mortem in BZ/etc, please link from the post-mortem.
  • Discuss if there is anything else that we learned from the situation and follow up to better inform future decisions.
  • Notes written up by all, collaboratively, so that others in the organization will learn from these as well.

Groupings

The post mortems

Sean, Nuria Incident documentation meeting/20140318-EventLogging

Sean Incident documentation meeting/20140328-DB-Queries

Bryan, Reedy Incident documentation meeting/20140403-Deploy

Antoine, Faidon Incident documentation meeting/20140503-Thumbnails

Nuria Incident documentation meeting/20140509-EventLogging

Timo Incident documentation meeting/20140517-bits

Sean Incident documentation meeting/20140526-m1

Ori Incident documentation meeting/20140529-appservers

Nik Incident documentation meeting/20140607-Elasticsearch

Andrew O Incident documentation meeting/20140608-Kafka

Greg Incident documentation meeting/20140612-Math

Giuseppe Incident documentation meeting/20140613-Videoscalers

Andrew O Incident documentation meeting/20140618-Wikitech

Sean Incident documentation meeting/20140619-parsercache

Sean Incident documentation meeting/20140622-es1006

Filippo Incident documentation meeting/20140622-imagescaler

Nik Incident documentation meeting/20140625-CirrusSearch