Principal Site Reliability Engineer

New
C
Copper.coDigital Asset Infrastructure
Remote - EMEAFull-TimePrincipal
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Required Skills
AWSMicroservicesDistributed Systems

Requirements

  • Experience designing, analyzing, and troubleshooting distributed systems or micro-services architectures.
  • Established expertise in observability and incident management.
  • Proven experience in driving organizational change.
  • Excellent communication skills with a systematic problem-solving approach.
  • Experience working with production workloads in AWS (desirable).
  • Experience working in financial services or similarly regulated environments (desirable).
  • Interest in blockchain-based technologies or decentralized finance (desirable).
  • Master's degree in Computer Science or Engineering (desirable).

Responsibilities

  • Define SRE principles including observability and operational excellence.
  • Drive adoption of SLIs, SLOs, and error budgets across the organization.
  • Champion architectural improvements to enhance system reliability and deployment velocity.
  • Conduct production readiness reviews and capacity planning.
  • Manage the lifecycle of microservices from inception through operation.
  • Lead incident management and conduct blameless postmortems.
  • Mentor engineers on SRE practices and foster a culture of service ownership.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now