Jeremiah Cargill
Senior Infrastructure & Platform Engineer
Home
Projects
Resume
About
Contact
Light mode
Observability & Reliability (SRE) Reference
Repository →
Architecture (sanitized)
Key Patterns
SLIs/SLOs and error budgets (documentation + examples)
Actionable alerting with burn-rate framing (reference)
Incident response loop and blameless postmortems
Reliability as an outcome: reduce MTTR, prevent recurrence
What this demonstrates
SRE mindset beyond dashboards: measurable reliability outcomes
Alerts tied to user impact and error budgets
Incident discipline: ownership, communication, learning loop
Template artifacts that scale across teams
Recommended next enhancements
Add a worked SLO example with burn-rate math
Add escalation roles and incident severity rubric
Add dashboard screenshot placeholders (sanitized)