Principal Site Reliability Engineer
New
C
Copper.coDigital Asset Infrastructure
Remote - EMEAFull-TimePrincipal
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Required Skills
- AWSMicroservicesDistributed Systems
Requirements
- Experience designing, analyzing, and troubleshooting distributed systems or micro-services architectures.
- Established expertise in observability and incident management.
- Proven experience in driving organizational change.
- Excellent communication skills with a systematic problem-solving approach.
- Experience working with production workloads in AWS (desirable).
- Experience working in financial services or similarly regulated environments (desirable).
- Interest in blockchain-based technologies or decentralized finance (desirable).
- Master's degree in Computer Science or Engineering (desirable).
Responsibilities
- Define SRE principles including observability and operational excellence.
- Drive adoption of SLIs, SLOs, and error budgets across the organization.
- Champion architectural improvements to enhance system reliability and deployment velocity.
- Conduct production readiness reviews and capacity planning.
- Manage the lifecycle of microservices from inception through operation.
- Lead incident management and conduct blameless postmortems.
- Mentor engineers on SRE practices and foster a culture of service ownership.
View Full Description & ApplyYou'll be redirected to the employer's site