Senior Site Reliability Engineer
Based in the United StatesFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Required Skills
- AWSBashKubernetesCI/CDLinuxDevOpsTerraform
Requirements
- Senior-level experience in Site Reliability Engineering, DevOps, or Systems Engineering.
- Proven track record of operating production systems at scale in cloud environments.
- Deep hands-on expertise with Kubernetes and AWS (networking, compute, storage).
- Highly proficient with infrastructure-as-code tools such as Terraform.
- Strong experience with CI/CD pipelines and deployment automation (GitHub Actions, GitLab).
- Comfortable working with Linux systems, debugging production issues, and writing scripts (Bash).
- Effective communicator capable of creating documentation and runbooks.
Responsibilities
- Design and maintain scalable infrastructure-as-code solutions using Terraform and Kubernetes.
- Build and operate observability systems including monitoring, logging, and alerting.
- Lead incident response, postmortems, and reliability improvements.
- Embed security and compliance practices into infrastructure and operational workflows.
- Optimize system performance, reliability, and cloud costs.
- Eliminate operational toil by developing automation tools.
- Partner with product and platform teams to improve APIs, deployment systems, and developer experience.
View Full Description & ApplyYou'll be redirected to the employer's site