filmov
tv
Debugging Distributed Systems by Donny Nadolny
![preview_player](https://i.ytimg.com/vi/CmuA9yhCmNY/maxresdefault.jpg)
Показать описание
Despite our best efforts, our systems fail. Sometimes it’s our fault - code that we wrote or bugs that we caused. But sometimes the fault is with systems that we rely on.
ZooKeeper is a very useful distributed system that is often used as a building block for other distributed systems, like Kafka and Spark. It is used by PagerDuty for many critical systems, and for five months it failed on us a lot.
We will walk through the process of finding and fixing one cause of many of these failures. You will learn how to use various tools to stress test the network, some intricate details of how ZooKeeper works, and possibly more than you wanted to know about TCP including an example of machines having a different view of the state of a TCP stream.
Donny Nadolny is a developer at PagerDuty. He has been using Java for many years, becoming a Sun Certified Java Programmer (for Java 1.4) even before getting his drivers license, and is always interested in talking about distributed systems.
[FWD-0632]
ZooKeeper is a very useful distributed system that is often used as a building block for other distributed systems, like Kafka and Spark. It is used by PagerDuty for many critical systems, and for five months it failed on us a lot.
We will walk through the process of finding and fixing one cause of many of these failures. You will learn how to use various tools to stress test the network, some intricate details of how ZooKeeper works, and possibly more than you wanted to know about TCP including an example of machines having a different view of the state of a TCP stream.
Donny Nadolny is a developer at PagerDuty. He has been using Java for many years, becoming a Sun Certified Java Programmer (for Java 1.4) even before getting his drivers license, and is always interested in talking about distributed systems.
[FWD-0632]
Debugging Distributed Systems by Donny Nadolny
SE-Radio-Episode-282-Donny-Nadolny-on-Debugging-Distributed-Systems
Three years of breaking things to make them better by Donny Nadolny
A walk to remember: Debugging a distributed system failure
'Testing Distributed Systems w/ Deterministic Simulation' by Will Wilson
A walk to remember: Debugging a distributed system failure - Flavio Percoco - Midwest.io 2016
Debugging Production Issues: Junior Developer vs Senior Developer
Arch Linux Is Finally Getting Debug Packages!!
LISA19 - Why Are Distributed Systems So Hard?
Debugging Go Applications - Matt Boyle, Cloudflare
Dan Bentley on Build Systems a la Carte [PWL NYC]
PWLTO#8 – Donny Nadolny on Raft
Troubleshooting & Debugging Production Microservices in Kubernetes with Ray Tsang
Webinar: The Power of the Individual User Session in Debugging
Designing Distributed Systems with TLA+ • Hillel Wayne • YOW! 2019
Geoffroy Couprie - Parsing safely, from 500MB/S to 2GB/s | Code Mesh LDN 18
PWLSF - 03/2014 - Ryan Kennedy and Anjali Shenoy on Dapper
The Diabolical Developer's Guide to Perfomance Tuning Martijn Verburg
SREcon19 Asia/Pacific - Why Does My Monitoring Suck?
Coroutines for Java Developers by Eugene Petrenko
Architectures That Scale Deep - Regaining Control in Deep Systems
20 Whitepapers that changed the world [For Senior Software Engineers]
code::dive 2017 – Sławomir Zborowski – Distributed applications are easy. Just read the fine print...
Creating and Evolving Elixir (with José Valim)
Комментарии