"You can't fix what you can't see." Observability is how you understand system behavior in production.
SRE interviews test your ability to design monitoring, write alerts, and debug using telemetry. I'll cover the three pillars (metrics, logs, traces), common tools like Prometheus and Grafana, and how to avoid alert fatigue.