Apply

Senior Site Reliability Engineer (SRE) - Disaster Recovery Specialist (m/f/x)

Posted 2024-11-20

View full description

💎 Seniority level: Senior, 5+ years

🔍 Industry: Software Development

🗣️ Languages: English

⏳ Experience: 5+ years

Requirements:
  • Degree in Computer Science, Information Technology, or a related field.
  • 5+ years of hands-on experience in site reliability engineering, ideally with a focus on disaster recovery.
  • Strong expertise in designing and implementing disaster recovery solutions using leading technologies.
  • Proficiency in cloud platforms such as AWS, Azure, or Google Cloud Platform.
  • Experience with infrastructure as code (IaC) tools like Terraform or CloudFormation.
  • Excellent communication skills for collaboration with cross-functional teams and non-technical stakeholders.
Responsibilities:
  • Design, implement, and maintain disaster recovery solutions for a cloud-based SaaS environment.
  • Develop and document comprehensive disaster recovery plans, procedures, and runbooks.
  • Conduct drills and exercises to validate the effectiveness of disaster recovery plans.
  • Collaborate with engineering, operations, and security teams to identify and mitigate risks.
  • Proactively monitor system performance and health metrics, implement preventive measures.
  • Participate in incident response and post-incident reviews to analyze root causes and implement corrective actions.
Apply

Related Jobs

Apply

🧭 Full-Time

🔍 Software / SaaS

  • Degree in Computer Science, Information Technology, or a related field.
  • 5+ years of hands-on experience in site reliability engineering, ideally with a focus on disaster recovery.
  • Experience in a cloud-based SaaS environment.
  • Strong expertise in designing and implementing disaster recovery solutions using industry-leading technologies and methodologies.
  • Proficiency in cloud platforms such as AWS, Azure, or Google Cloud Platform.
  • Experience with infrastructure as code (IaC) tools such as Terraform or CloudFormation.
  • Excellent communication skills with the ability to effectively collaborate with cross-functional teams and communicate technical concepts to non-technical stakeholders.

  • Design, implement, and maintain disaster recovery solutions for cloud-based SaaS environments.
  • Develop and document comprehensive disaster recovery plans, procedures, and runbooks.
  • Conduct drills and exercises to test and validate the effectiveness of these plans.
  • Collaborate with engineering, operations, and security teams to identify and mitigate potential risks to system availability and data integrity.
  • Monitor system performance and health metrics; proactively identify areas for improvement.
  • Implement preventive measures to enhance system reliability and resilience.
  • Participate in incident response and post-incident reviews; analyze root causes of failures.
  • Implement corrective actions to prevent recurrence.
Posted 2024-11-21
Apply

Related Articles

Remote Job Certifications and Courses to Boost Your Career

August 22, 2024

Insights into the evolving landscape of remote work in 2024 reveal the importance of certifications and continuous learning. This article breaks down emerging trends, sought-after certifications, and provides practical solutions for enhancing your employability and expertise. What skills will be essential for remote job seekers, and how can you navigate this dynamic market to secure your dream role?

How to Balance Work and Life While Working Remotely

August 19, 2024

Explore the challenges and strategies of maintaining work-life balance while working remotely. Learn about unique aspects of remote work, associated challenges, historical context, and effective strategies to separate work and personal life.

Weekly Digest: Remote Jobs News and Trends (August 11 - August 18, 2024)

August 18, 2024

Google is gearing up to expand its remote job listings, promising more opportunities across various departments and regions. Find out how this move can benefit job seekers and impact the market.

How to Onboard Remote Employees Successfully

August 16, 2024

Learn about the importance of pre-onboarding preparation for remote employees, including checklist creation, documentation, tools and equipment setup, communication plans, and feedback strategies. Discover how proactive pre-onboarding can enhance job performance, increase retention rates, and foster a sense of belonging from day one.

Remote Work Statistics and Insights for 2024

August 13, 2024

The article explores the current statistics for remote work in 2024, covering the percentage of the global workforce working remotely, growth trends, popular industries and job roles, geographic distribution of remote workers, demographic trends, work models comparison, job satisfaction, and productivity insights.