Ananth Shrinivas Srinath

Reliability Challenges in Large Distributed Systems

A technical overview of reliability challenges that underlie some of the most damaging outages affecting large distributed systems. Instead of studying individual outages, we will look at common patterns that repeatedly show up over time across systems in the industry.
 

back to overview

Watch Recording
Speaker Image
 

Biography

I am a Technical Leader with two decades of experience in building and operating large scale distributed systems. I have worked at all layers of the application and infrastructure stack with deep expertise in storage and cloud systems. I lead teams towards long-term reliability goals with a hands-on approach to problem solving. I enjoy reasoning from from first principles, working on hard scaling problems, and partnering with nimble engineering teams to advance the state-of-the-art.