You now understand reliability engineering at the level expected in SRE interviews.
What to remember:
- SLI measures behavior, SLO sets targets, SLA is the customer contract
- Error budget = allowed unreliability. Use it to balance features vs reliability
- Toil is manual, repetitive work. Target less than % time on toil
- Capacity plan by measuring growth, modeling demand, adding buffer
- Changes cause most outages. Use progressive rollouts and rollback plans
- Know Google SRE book principles: embracing risk, SLOs, eliminating toil
Next, you'll learn incident management and on-call practices.