Principal Site Reliability Engineer

New

Copper.coDigital Asset Infrastructure

Remote - EMEAFull-TimePrincipal

Salary not disclosed

Apply NowOpens the employer's application page

Job Details

Experience designing, analyzing, and troubleshooting distributed systems or micro-services architectures.
Established expertise in observability and incident management.
Proven experience in driving organizational change.
Excellent communication skills with a systematic problem-solving approach.
Experience working with production workloads in AWS (desirable).
Experience working in financial services or similarly regulated environments (desirable).
Interest in blockchain-based technologies or decentralized finance (desirable).
Master's degree in Computer Science or Engineering (desirable).

Define SRE principles including observability and operational excellence.
Drive adoption of SLIs, SLOs, and error budgets across the organization.
Champion architectural improvements to enhance system reliability and deployment velocity.
Conduct production readiness reviews and capacity planning.
Manage the lifecycle of microservices from inception through operation.
Lead incident management and conduct blameless postmortems.
Mentor engineers on SRE practices and foster a culture of service ownership.

View Full Description & ApplyYou'll be redirected to the employer's site