Post

Essential Monitoring Metrics for Cloud Native Systems - Part 2

Master cloud-native observability. Learn the essential monitoring metrics, Golden Signals - errors and saturation, to keep your distributed systems reliable and fast.

Essential Monitoring Metrics for Cloud Native Systems - Part 2

A practical guide

In the first part, I covered the two initial signals to diagnose that something is wrong:

  • Latency
  • Traffic

Those two alone explain a surprising number of production incidents. But they don’t explain everything. Rising latency tells you a problem is developing. Traffic tells you what the system is dealing with.

I mentioned two more signals:

  • Errors
  • Saturation

These two tell you something more important - whether the system is approaching failure. And this is where monitoring becomes truly operational. I will cover those two signals in this blog. Let us start with Errors.

Errors — The most misunderstood signal

… WIP

This post is licensed under CC BY 4.0 by the author.