Site Reliability Engineer III

New
United StatesFull-TimeSenior
Salary148,320 - 185,400 USD per year
Apply NowOpens the employer's application page

Job Details

Experience
5+ years
Required Skills
AWSPythonBashJenkinsGoTerraformGitHubDatadogCloudFormation

Requirements

  • 5+ years of experience in SRE, DevOps, or a related engineering role
  • Advanced hands-on expertise in AWS production environments and core services including Lambda, ECS, S3, ALB, and GuardDuty
  • Strong proficiency in infrastructure-as-code tooling such as Terraform, CloudFormation, or CDK
  • Experience building and operating CI/CD pipelines using Jenkins and GitHub
  • Proficiency in Python, Go, or Bash for automation
  • Hands-on experience with Datadog or a comparable observability platform
  • Demonstrated experience leading incident response in complex, distributed systems
  • Familiarity with SOC 2 compliance frameworks and audit readiness

Responsibilities

  • Architect, implement, and operate scalable, resilient, and secure AWS infrastructure
  • Lead infrastructure-as-code initiatives to ensure all environments are reproducible
  • Design, maintain, and improve CI/CD pipelines using Jenkins and GitHub
  • Own the Datadog observability platform, including dashboards, monitors, and alerting
  • Serve as a senior technical responder across the full incident lifecycle
  • Refine, implement, and test disaster recovery plans to meet RTO/RPO objectives
  • Mentor junior SREs through code reviews, incident pairing, and documentation
View Full Description & ApplyYou'll be redirected to the employer's site
148,320 - 185,400 USD per year
Apply Now