What are we solving? When you are working in a cloud-native environment, you cannot continue making the assumptions that you…| distributed-computing-musings.com
Couple of years back I wrote a post describing the thundering herd problem. Now that I look back, I had very cursory knowledge about the problem. I knew what it was & how can it impact an application. I knew a possible solution that can be used to solve the problem. But these were individual pieces of information. There was nothing connecting them to give me an end to end view & that was primarily because I had neither seen the problem nor the solution in action. I knew the fundamental blocks...| Distributed Computing Musings
9 min read | distributed-computing-musings.com
What are we solving?| Distributed Computing Musings
9 min read | distributed-computing-musings.com
API gateways are probably one of the first few keywords you hear when you enter the world of cloud computing. […]| Distributed Computing Musings
Continuing from our previous cloud bits post about service configuration, we will explore the topic of service discovery in this […]| Distributed Computing Musings
Today most of the application software we develop ends up getting deployed to the cloud & along with it comes […]| Distributed Computing Musings
“Observability is the lens through which the invisible becomes visible, turning complex systems into understandable narratives.“ So before taking a […]| Distributed Computing Musings
In the last post, we explored how we can leverage tools such as Prometheus & Grafana for monitoring our applications. […]| Distributed Computing Musings
In the last post, we touched upon the requirement for observability & understood the basic components that form the common […]| Distributed Computing Musings
Imagine you are working on an issue in an existing feature. You started on Monday by reproducing the issue, you […]| Distributed Computing Musings
Microservices usually come into architecture discussion when you are facing challenges with the existing infrastructure (in most cases a monolith). […]| Distributed Computing Musings
Deep learning(DL) models have different access requirements from the data store when compared to typical workloads for which existing storage systems are built. Also the data scale required for training these models range in petabytes. Hence having a storage system customized for this use case is essential & Nvidia has developed AIStore, an open sourced solution to tackle this problem. As part of this post, we will go through the paper describing the architecture for AIStore along with custom...| Distributed Computing Musings
10 min read | distributed-computing-musings.com
As part of my next post, I was looking to review the paper for Raft consensus algorithm. But this is one paper which I found has a lot of implementation details as part of it. On a high level it does says that we rely on consensus across nodes for performing an operation and have a leader election component when a leader node goes down. But Raft presents a very simple and understandable approach for these two components. Understandability is the main focus of the paper and it emphasizes that ...| Distributed Computing Musings
Viewstamped replication(VR) is a replication technique that takes care of failures when one or more nodes end up crashing in…| distributed-computing-musings.com
Viewstamped replication(VR) is a replication technique that takes care of failures when one or more nodes end up crashing in a cluster. It works as a wrapper on top of a non-distributed system & allows the underlying business logic to be applied independently while the protocol itself takes care of replication. The protocol was introduced in the paper and then was revised with a set of optimizations under a new paper known as Viewstamped replication revisited. | Distributed Computing Musings