Introduction A while back, my colleagues and I, ran a Kubernetes cluster with large nodes with about 300-400 containers running on each node. It was running on Linux CentOS 7, with a linux-3.10.0-1160.88.1.el7 kernel. And after about three years of mostly stable cluster, our nodes started randomly freezing. It usually started in the morning, and […] The post Debugging a Random Node Lock Up in a Linux Kernel appeared first on Povilas Versockas.