| Heinrich Hartmann on HeinrichHartmann.com
| Heinrich Hartmann on HeinrichHartmann.com
Abstract In the talk, we will discuss the frontiers of reliability engineering, reflecting on a decade of advancements and identifying the key challenges that remain in building reliable, observable software systems. We take inspiration from our journey at Zalando, where we have embraced trends like hardware outsourcing to AWS, packaging applications in Docker, and fully automating deployments with CI/CD. We’ve also implemented distributed tracing for microservice observability. However, ne...| Heinrich Hartmann on HeinrichHartmann.com
Heinrich Hartmann discusses Zalando’s approach to reliability engineering at the GOTO Amsterdam 2024 conference, covering best practices from small to large scale operations.| Heinrich Hartmann on HeinrichHartmann.com
This video is a re-recording of the SLO section of the Statistics for Engineers talk at SRECon EMEA 2023. Abstract In the video, we will delve into the concept of Service Level Objectives (SLOs) within engineering, particularly focusing on their role in steering management and investment decisions through data-driven reliability assessments. Our discussion will cover the fundamental goal of SLOs in balancing reliability, productivity, and team health by defining clear objectives. We intend to...| Heinrich Hartmann on HeinrichHartmann.com
Abstract In the talk, we will discuss the statistical methods that are most relevant to your daily work as an SRE. As an SRE, we are constantly confronted with a wealth of telemetry data collected from our systems. Interpreting this data to extract operational information is a key part of our job. Statistics is here to help! Statistics is the art of extracting information from data. We will get up to speed with the basics and see how they apply to the operational domain. Furthermore, we will ...| Heinrich Hartmann on HeinrichHartmann.com
Abstract Measuring Latency for Monitoring and Benchmarking purposes is notoriously difficult. There are a lot of pitfalls with collecting, aggregating and analyzing latency data. In the talk, we will make an effort to visit this topic from a top-down perspective and compile known complications and best-practice approaches on how to avoid them. This will include: Measurement Overhead Queuing effects Coordinated omission Histograms for Aggregation and Visualization Percentile aggregation Latenc...| Heinrich Hartmann on HeinrichHartmann.com
Abstract In this talk, we are going to survey different available technologies to capture (latency) distributions and store them in time-series databases. This includes the theoretical underpinnings, accuracy and performance, operational aspects, and adoption. Our aim is to provide an overview of related work in the histogram space and share evaluation results. We also discuss the importance of histograms for latency SLOs and the recent developments in open-source technology that are embracin...| Heinrich Hartmann on HeinrichHartmann.com
| Heinrich Hartmann on HeinrichHartmann.com
Abstract In the talk, we will explore the challenges of measuring and aggregating latency for Service Level Objectives (SLOs) effectively. Latency serves as a crucial metric in assessing service quality, yet its measurement, particularly via percentile metrics, faces challenges in situations requiring extensive aggregation across time periods and nodes. We identify these challenges, explaining the pitfalls of using percentile metrics for latency SLOs and offering three practical solutions: us...| Heinrich Hartmann on HeinrichHartmann.com
Abstract In the talk, we will discuss the importance of latency as a key indicator of service quality, highlighting the challenges in measuring it accurately. Traditional methods like CPU utilization or request counts don’t capture the complexities of latency. We will explore the shortcomings of popular percentile metrics, especially for setting Service Level Objectives (SLOs) over extended periods. Our presentation will delve into the pitfalls of current practices and propose three practic...| Heinrich Hartmann on HeinrichHartmann.com
Abstract In this talk, we will discuss the Circonus telemetry platform’s architectural evolution from bare-metal to Google Cloud and the lessons learned from various design failures. As SREs, we use Service Level Objectives (SLOs) to manage Circonus and have faced numerous recursive lessons reflective of data handling complexities. Along this journey, we’ve committed several ‘crimes against computing’, and through this session, we aim to explore in detail the systems architecture from...| Heinrich Hartmann on HeinrichHartmann.com
[00:05] Hello, thanks for having me. I'm really excited to have the opportunity to speak here on a topic that is very close to my heart and interests, namely statistics for engineers. Yay, statistics! Who loves statistics? Yeah, that's very nice to see. Before I dive into all the things, I just want to say a few words about myself. My name is Hinrich, and I live in Munich at the moment. For those not familiar with Munich, it's a city in the EU, still in the EU, and I'm very happy to be a memb...| www.heinrichhartmann.com
2023-10-21 Geometry HeinrichHartmann.com # @CX-REMOVE !pip install Pillow Requirement already satisfied: Pillow in /opt/conda/lib/python3.11/site-packages (10.1.0) Convolution Algebras and Discrete Geometry - a Study¶We start our exploration of numerical geometry with a function on a plane. We represent this function by an array: F[x,y] where x,y are points on an equidistant grid with step size h. # Example Function #@CX-TOGGLE import numpy as np import matplotlib.pyplot as plt def f(x, y): ...| Posts on HeinrichHartmann.com
Statistics for Engineers @ SRECon EMEA 2023¶| www.heinrichhartmann.com
Investment Calculator - Solar Plant¶| www.heinrichhartmann.com
The Solar Plant Investment¶| www.heinrichhartmann.com
Credits| www.heinrichhartmann.com
Services| www.heinrichhartmann.com