Site Reliability Engineer III
New
United StatesFull-TimeSenior
Salary148,320 - 185,400 USD per year
Apply NowOpens the employer's application page
Job Details
- Experience
- 5+ years
- Required Skills
- AWSPythonBashJenkinsGoTerraformGitHubDatadogCloudFormation
Requirements
- 5+ years of experience in SRE, DevOps, or a related engineering role
- Advanced hands-on expertise in AWS production environments and core services including Lambda, ECS, S3, ALB, and GuardDuty
- Strong proficiency in infrastructure-as-code tooling such as Terraform, CloudFormation, or CDK
- Experience building and operating CI/CD pipelines using Jenkins and GitHub
- Proficiency in Python, Go, or Bash for automation
- Hands-on experience with Datadog or a comparable observability platform
- Demonstrated experience leading incident response in complex, distributed systems
- Familiarity with SOC 2 compliance frameworks and audit readiness
Responsibilities
- Architect, implement, and operate scalable, resilient, and secure AWS infrastructure
- Lead infrastructure-as-code initiatives to ensure all environments are reproducible
- Design, maintain, and improve CI/CD pipelines using Jenkins and GitHub
- Own the Datadog observability platform, including dashboards, monitors, and alerting
- Serve as a senior technical responder across the full incident lifecycle
- Refine, implement, and test disaster recovery plans to meet RTO/RPO objectives
- Mentor junior SREs through code reviews, incident pairing, and documentation
View Full Description & ApplyYou'll be redirected to the employer's site