Cloud Reliability & Recovery Engineer
IndiaFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Experience
- 5+ years of experience in cloud engineering, SRE, infrastructure, or disaster recovery roles, with at least 3+ years in AWS production environments
- Required Skills
- AWSPythonKubernetesCI/CDTerraformCloudFormation
Requirements
- 5+ years of experience in cloud engineering, SRE, infrastructure, or disaster recovery.
- 3+ years in AWS production environments at scale.
- Experience designing and operating multi-region disaster recovery architectures.
- Expertise in AWS resilience services, networking (VPC, DNS, VPN, Direct Connect), and storage/database replication.
- Hands-on experience with Terraform and/or CloudFormation.
- Proficiency in Python, Bash, or PowerShell.
- Understanding of Kubernetes-based deployments.
- Experience with CI/CD tools (e.g., GitHub Actions, CodePipeline, CodeBuild).
Responsibilities
- Design and implement highly available, multi-region and multi-AZ AWS architectures aligned with defined RTO/RPO objectives.
- Build and maintain disaster recovery (DR) solutions including automated failover/failback mechanisms.
- Develop and execute backup, restore, and data replication strategies across AWS services.
- Implement infrastructure as code using Terraform or CloudFormation.
- Create and maintain CI/CD-driven DR testing pipelines, including chaos engineering practices.
- Monitor system availability and resilience using CloudWatch and AWS health services.
- Conduct DR drills, tabletop exercises, and post-incident reviews.
View Full Description & ApplyYou'll be redirected to the employer's site