Distributed TRACING & METRICS collection for SCALA | Carlos Teixeira

Показать описание

More often than we would care to admit, we've been stuck looking through endless lines of logs, code, documentation, StackOverflow posts, ... trying to find the reason behind the mysterious disappearance of some row in our database, or why some service always seems to fail for no apparent reason the first Tuesday of each month.

Unfortunately for us, the answer isn't always obvious and usually requires an analysis of multiple services and how they interact with each other. In this case, for instance, maybe a monthly report is generated on the first Tuesday of each month which makes way to many requests to our service causing our DB connection pool to throw random errors and, as a consequence, random fails to occur when inserting values in our database ;
funny enough, during the entire time, logs were being produced with the exact error but we just weren't able to see them in the middle of all the noise.

In this talk, we'll look into how we can integrate and use Kamon to give us a better view of what's going on in our applications and how they interact with each other. We'll look into how to measure CPU usage, query times, individual endpoints performance, custom metrics as well as how to add distributed tracing to our applications so that we can know how they work and how they interact with each other.

0:00 Introduction
0:58 Kamon
1:37 F{oo} Bar
2:45 Demo
3:46 Monitoring, Alerting, Tracing
12:46 Using Kamon