Staff Site Reliability Engineer
100% Remote - Anywhere within the USFull-TimeStaff
Salary180,000 - 250,000 USD per year
Apply NowOpens the employer's application page
Job Details
- Experience
- At least 10 years of experience
- Required Skills
- AWSGCPKubernetesCI/CDTerraformSaaS
Requirements
- At least 10 years of experience as a Staff SRE or senior-level infrastructure engineer supporting production systems at scale
- Strong expertise in AWS and GCP cloud platforms
- Familiarity with queue-based or event-driven architectures and autoscaling technologies
- Experience designing and improving CI/CD infrastructure and deployment automation
- Hands-on Kubernetes operations experience, including scalability and workload reliability
- Experience with observability tools, monitoring strategies, and incident management practices
- Proficiency with Infrastructure-as-Code tools such as Terraform or comparable technologies
- Strong communication and collaboration skills across engineering teams
- Experience using AI-assisted development tools such as Claude Code, Codex, or similar technologies
Responsibilities
- Design, improve, and maintain scalable CI/CD pipelines and deployment processes
- Establish reliable staging and development environments aligned with production standards
- Build and manage observability practices, including monitoring, alerting, dashboards, and SLO frameworks
- Partner with engineering teams to support platform modernization and infrastructure reliability
- Drive Infrastructure-as-Code (IaC) standards and multi-cloud operational consistency across AWS and GCP
- Provide technical leadership on infrastructure architecture, reliability, and operational best practices
- Support incident response, system reliability, and operational readiness initiatives
- Utilize AI-assisted development tools to support infrastructure analysis and improvement efforts
View Full Description & ApplyYou'll be redirected to the employer's site