Skip to content
devops

SRE

Site Reliability Engineering

Definition

SRE is a discipline that applies software engineering practices to operations and infrastructure, treating reliability as a software problem. Developed at Google, SRE practices include defining SLOs, managing error budgets, reducing toil through automation, blameless postmortems, and capacity planning.

SRE teams own the reliability of production systems and collaborate with development teams on production readiness.


Ship secure code faster

Crash Override integrates security into the developer workflow. No context switching, no waiting on reviews.