Senior Site Reliability Engineer
New
H
HavocAIAutonomous maritime defense
RemoteFull-TimeSenior
Salary150,000 - 185,000 USD per year
Apply NowOpens the employer's application page
Job Details
- Experience
- 7+ years
- Required Skills
- PythonKubernetesGoLinuxDistributed Systems
Requirements
- 7+ years in SRE or infrastructure engineering
- Strong experience with large-scale distributed systems
- Linux
- Networking
- Cloud infrastructure
- Proficiency in Kubernetes
- Container orchestration
- Programming/scripting (Go
- Python)
- Experience designing observability systems and leading incident response
- Must be a U.S
- Citizen.
Responsibilities
- Design and evolve reliability architecture
- Define SLIs/SLOs
- Lead incident response
- Conduct root cause analysis
- Maintain observability systems
- Build automation for deployment safety
- Collaborate with security teams
- Drive operational maturity through runbooks
- Resilience testing
- Capacity planning for mission-critical autonomy and data-intensive workloads.
View Full Description & ApplyYou'll be redirected to the employer's site