- Define SRE principles including observability and operational excellence.
- Drive adoption of SLIs, SLOs, and error budgets across the organization.
- Champion architectural improvements to enhance system reliability and deployment velocity.
- Conduct production readiness reviews and capacity planning.
- Manage the lifecycle of microservices from inception through operation.
- Lead incident management and conduct blameless postmortems.
- Mentor engineers on SRE practices and foster a culture of service ownership.
AWSMicroservicesDistributed Systems