Senior Site Reliability Engineer

New

#BI-RemoteFull-TimeSenior

Salary not disclosed

Apply NowOpens the employer's application page

Job Details

Experience: 5+ years of professional experience
Required Skills: AWSPythonJavascriptKubernetesTypeScriptGoCI/CDLinuxTerraform

5+ years of professional experience in Site Reliability Engineering, Infrastructure, or DevOps roles supporting production SaaS applications.
Strong proficiency with AWS core services (including EC2, VPC, S3).
Experience in Serverless frameworks and managed Kubernetes environments like EKS.
Extensive experience writing and managing Infrastructure as Code (IaC) using Terraform.
Familiarity with configuring, troubleshooting, and maintaining data layers such as MongoDB, Redshift, and OpenSearch.
Experience managing or modernizing CI/CD pipelines and deployment workflows utilizing systems like Jenkins, AWS CodeBuild/CodePipeline, or GitHub Actions.
Deep understanding of Linux operating system fundamentals and Unix shell scripting.
Ability to read and debug code written in JavaScript/TypeScript, Python, or Go.
Strong communication and collaboration skills.

Architect and maintain scalable, secure cloud infrastructure to ensure high availability for core products.
Enhance observability and monitoring frameworks to deliver highly accurate alerts, minimizing noise and improving incident detection.
Participate in on-call rotations and lead incident response, ensuring comprehensive post-mortems and RCAs are completed to drive systemic improvements.
Optimize and modernize deployment pipelines and automation workflows to maximize engineering velocity and operational safety.
Partner with product development teams to provide infrastructure support, review architectural changes, and promote reliability best practices.
Implement and uphold robust security standards and compliance controls across all managed cloud infrastructure.

View Full Description & ApplyYou'll be redirected to the employer's site