Staff Site Reliability Engineer

100% Remote - Anywhere within the USFull-TimeStaff

Salary180,000 - 250,000 USD per year

Apply NowOpens the employer's application page

Job Details

At least 10 years of experience as a Staff SRE or senior-level infrastructure engineer supporting production systems at scale
Strong expertise in AWS and GCP cloud platforms
Familiarity with queue-based or event-driven architectures and autoscaling technologies
Experience designing and improving CI/CD infrastructure and deployment automation
Hands-on Kubernetes operations experience, including scalability and workload reliability
Experience with observability tools, monitoring strategies, and incident management practices
Proficiency with Infrastructure-as-Code tools such as Terraform or comparable technologies
Strong communication and collaboration skills across engineering teams
Experience using AI-assisted development tools such as Claude Code, Codex, or similar technologies

Design, improve, and maintain scalable CI/CD pipelines and deployment processes
Establish reliable staging and development environments aligned with production standards
Build and manage observability practices, including monitoring, alerting, dashboards, and SLO frameworks
Partner with engineering teams to support platform modernization and infrastructure reliability
Drive Infrastructure-as-Code (IaC) standards and multi-cloud operational consistency across AWS and GCP
Provide technical leadership on infrastructure architecture, reliability, and operational best practices
Support incident response, system reliability, and operational readiness initiatives
Utilize AI-assisted development tools to support infrastructure analysis and improvement efforts

View Full Description & ApplyYou'll be redirected to the employer's site