In this guide, we’ll show technologies and examples of full stack observability for an application running on Kubernetes, OpenTelemetry and AWS.| Logz.io
How Evernote and Home Depot adpted SLOs to enhance reliability. Learn from their experiences with SLos and error budgets for improved service quality.| sre.google
Strategies for enhancing data processing pipelines, including pipelines design, best practices, and case studies to boost efficiency and reliability.| sre.google
Discover how canary release can improve deployment safety by testing new changes on a small portion of users before a full rollout.| sre.google
Our technical blog.| source.coveo.com
Comments/Insights/Contributions from * Niall Murphy * Toby Burress * Štěpán Davidovič * Sal Furino (Note that when I say "we" below, I don't specifically intend to speak for these fine people, I'm just using the academic "we". -Niall) Introduction If you don’t already know about SLOs, we can recommend Alex Hidalgo’| RelyAbility Blog
Introduction to SLI, examples, counterexamples and tips| blog.alexewerlof.com
SREs optimize their time by eliminating toil, the repetitive, predictable tasks related. The characteristics of toil and operational efficiency.| sre.google