The concept of blameless culture has been around for a long time in other industries, and while the history isn’t clear, you could argue that it became an “official” part of the tech industry with the publication of the definitive book Site Reliability Engineering in 2016. My summary of blameless culture is: when there is […]| cat /dev/brain
I’ve never heard of a company that has a business, that doesn’t also occasionally have things go wrong. Something going wrong might turn into a support ticket, an angry email, or an alert popping up on an on-call engineer’s phone. If there is user or business impact, and an engineer might need to respond, then it becomes an incident. After the incident, the folks involved in mitigation write an Incident Review Template, and the that document is discussed in this meeting, the Incident Re...| infraeng.dev