Senior Site Reliability Engineer
New
#BI-RemoteFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Experience
- 5+ years of professional experience
- Required Skills
- AWSPythonJavascriptKubernetesTypeScriptGoCI/CDLinuxTerraform
Requirements
- 5+ years of professional experience in Site Reliability Engineering, Infrastructure, or DevOps roles supporting production SaaS applications.
- Strong proficiency with AWS core services (including EC2, VPC, S3).
- Experience in Serverless frameworks and managed Kubernetes environments like EKS.
- Extensive experience writing and managing Infrastructure as Code (IaC) using Terraform.
- Familiarity with configuring, troubleshooting, and maintaining data layers such as MongoDB, Redshift, and OpenSearch.
- Experience managing or modernizing CI/CD pipelines and deployment workflows utilizing systems like Jenkins, AWS CodeBuild/CodePipeline, or GitHub Actions.
- Deep understanding of Linux operating system fundamentals and Unix shell scripting.
- Ability to read and debug code written in JavaScript/TypeScript, Python, or Go.
- Strong communication and collaboration skills.
Responsibilities
- Architect and maintain scalable, secure cloud infrastructure to ensure high availability for core products.
- Enhance observability and monitoring frameworks to deliver highly accurate alerts, minimizing noise and improving incident detection.
- Participate in on-call rotations and lead incident response, ensuring comprehensive post-mortems and RCAs are completed to drive systemic improvements.
- Optimize and modernize deployment pipelines and automation workflows to maximize engineering velocity and operational safety.
- Partner with product development teams to provide infrastructure support, review architectural changes, and promote reliability best practices.
- Implement and uphold robust security standards and compliance controls across all managed cloud infrastructure.
View Full Description & ApplyYou'll be redirected to the employer's site