Senior Site Reliability Engineer

Based in the United StatesFull-TimeSenior

Salary not disclosed

Apply NowOpens the employer's application page

Job Details

Senior-level experience in Site Reliability Engineering, DevOps, or Systems Engineering.
Proven track record of operating production systems at scale in cloud environments.
Deep hands-on expertise with Kubernetes and AWS (networking, compute, storage).
Highly proficient with infrastructure-as-code tools such as Terraform.
Strong experience with CI/CD pipelines and deployment automation (GitHub Actions, GitLab).
Comfortable working with Linux systems, debugging production issues, and writing scripts (Bash).
Effective communicator capable of creating documentation and runbooks.

Design and maintain scalable infrastructure-as-code solutions using Terraform and Kubernetes.
Build and operate observability systems including monitoring, logging, and alerting.
Lead incident response, postmortems, and reliability improvements.
Embed security and compliance practices into infrastructure and operational workflows.
Optimize system performance, reliability, and cloud costs.
Eliminate operational toil by developing automation tools.
Partner with product and platform teams to improve APIs, deployment systems, and developer experience.

View Full Description & ApplyYou'll be redirected to the employer's site