Apply

Site Reliability Engineer (Remote, US-based)

Posted 5 months agoViewed

View full description

💎 Seniority level: Senior, 5+ years

📍 Location: United States

🔍 Industry: Database DevOps

🏢 Company: Liquibase

🗣️ Languages: English

⏳ Experience: 5+ years

🪄 Skills: AWSAgileGrafanaPrometheusServerlessCommunication SkillsCollaborationCI/CDTerraform

Requirements:
  • Prior SRE experience supporting a cloud-native SaaS platform with AWS.
  • Bachelor’s degree in Computer Science, Software Engineering, or related field (or equivalent work experience).
  • AWS Solutions Architect and/or AWS DevOps Professional Certifications.
  • Self-starter with strong communication skills in a distributed work environment.
  • 5+ years of hands-on experience in site reliability engineering roles.
  • Expert knowledge of AWS services including API Gateway, Lambda, Aurora Serverless, OpenSearch Serverless, Secrets Manager, and FusionAuth.
  • Expertise in AWS security services including WAF, Shield, and GuardDuty.
  • Strong experience with monitoring tools such as CloudWatch, Prometheus, Grafana.
  • Proven ability to design effective monitoring strategies.
  • Willingness for a 24x7 on-call rotation.
  • Extensive experience with Terraform for infrastructure as code.
  • Experience in building and securing multi-tenant SaaS applications.
  • Experience with IDPs like FusionAuth, Okta, or Auth0.
  • Strong understanding of information security principles.
Responsibilities:
  • Design, implement, and maintain highly resilient and secure infrastructure for SaaS platform using AWS services.
  • Ensure application security using AWS security services and implement best practices.
  • Develop robust monitoring and alerting solutions for platform reliability.
  • Facilitate incident response, triage, and retrospective analysis.
  • Lead post-mortems to drive reliability improvements.
  • Introduce strategies for system resilience and performance optimization.
  • Establish SRE principles for the team.
  • Build infrastructure as code with Terraform.
  • Provide architectural input and collaborate with various teams.
  • Educate Engineering teams on reliability and security best practices.
  • Participate in Agile Development lifecycle.
Apply