Scalable infrastructure is key in DevOps environments. Explore 10 strategies and best practices for achieving infrastructure scalability.| Spacelift
The tale of the July 4th surprise $2700 AWS bill. It is more nuanced than you think and might have exposed a bug.| Chris Short
An AWS Disaster Recovery plan is a set of procedures that ensure the continuity and recovery of IT systems and data in the event of a disaster.| N2W Software
At some companies, it is expected that certain engineers will serve as part of a formal on-call rotation. On paper, this seems like a reasonable way to ensure the reliability of the product. In practice, it is a miserable burden and you probably won't even get paid for it.| Scott Smitelli
There were 511 AWS announcements in pre:Invent season. I breakdown and snark about 43 of them relating to security and governance.| https://www.chrisfarris.com/
Planning the work for infrastructure engineering organization can be a challenge, in part due to a lack of clarity around what such an organization contributes value to the company it operates within. I have thoughts, and a simple thinking aid, for that.| lethain.com
Technical infrastructure is never complete. System processes can always run with less overhead or be bin-packed onto fewer machines. Data can be retrieved more quickly and stored at a cheaper cost per terabyte. System design can broaden the gap between failure and user impact. Transport layers can be more secure.| lethain.com
The concept of blameless culture has been around for a long time in other industries, and while the history isn’t clear, you could argue that it became an “official” part of the tech industry with the publication of the definitive book Site Reliability Engineering in 2016. My summary of blameless culture is: when there is […]| cat /dev/brain