1. SRE Principles & Practices
- What is Site Reliability Engineering?
- SRE & DevOps: What is the Difference?
- SRE Principles & Practices
2. Service Level Objectives & Error Budgets
- Service Level Objectives (SLO’s)
- Error Budgets
- Error Budget Policies
3. Reducing Toil
- What is Toil?
- Why is Toil Bad?
- Doing Something About Toil
4. Monitoring & Service Level Indicators
- Service Level Indicators (SLI’s)
- Monitoring
- Observability
5. SRE Tools & Automation
- Automation Defined
- Automation Focus
- Hierarchy of Automation Types
- Secure Automation
- Automation Tools
6. Anti-Fragility & Learning from Failure
- Why Learn from Failure
- Benefits of Anti-Fragility
- Shifting the Organizational Balance
7. Organizational Impact of SRE
- Why Organizations Embrace SRE
- Patterns for SRE Adoption
- On-Call Necessities
- Blameless Post-Mortems
- SRE & Scale
- Shifting the Organizational Balance
8. SRE, Other Frameworks, The Future
- SRE & Other Frameworks
- SRE Evolution and the Future
Course Code:
srebspk
Duration:
14 hours