Grafana visualizes metrics from Prometheus and other data sources.
Dashboard design principles:
- USE method: Utilization, Saturation, Errors for resources
- RED method: Rate, Errors, Duration for services
- Top-down: Start with high-level health, drill down to details
Good dashboards:
- Answer "Is the system healthy?" at a glance
- Use consistent colors (red = bad, green = good)
- Show SLO status prominently
- Include links to runbooks
Interview question: "How would you design a dashboard for a new service?"
Start with RED metrics. Add resource utilization. Include SLO burn rate. Link to logs and traces.