Senior Site Reliability Engineer

New
H
HavocAIAutonomous maritime defense
RemoteFull-TimeSenior
Salary150,000 - 185,000 USD per year
Apply NowOpens the employer's application page

Job Details

Experience
7+ years
Required Skills
PythonKubernetesGoLinuxDistributed Systems

Requirements

  • 7+ years in SRE or infrastructure engineering
  • Strong experience with large-scale distributed systems
  • Linux
  • Networking
  • Cloud infrastructure
  • Proficiency in Kubernetes
  • Container orchestration
  • Programming/scripting (Go
  • Python)
  • Experience designing observability systems and leading incident response
  • Must be a U.S
  • Citizen.

Responsibilities

  • Design and evolve reliability architecture
  • Define SLIs/SLOs
  • Lead incident response
  • Conduct root cause analysis
  • Maintain observability systems
  • Build automation for deployment safety
  • Collaborate with security teams
  • Drive operational maturity through runbooks
  • Resilience testing
  • Capacity planning for mission-critical autonomy and data-intensive workloads.
View Full Description & ApplyYou'll be redirected to the employer's site
150,000 - 185,000 USD per year
Apply Now