ApplySite Reliability Engineer (Remote, US-based)
Posted 5 months agoViewed
View full description
💎 Seniority level: Senior, 5+ years
📍 Location: United States
🔍 Industry: Database DevOps
🏢 Company: Liquibase
🗣️ Languages: English
⏳ Experience: 5+ years
🪄 Skills: AWSAgileGrafanaPrometheusServerlessCommunication SkillsCollaborationCI/CDTerraform
Requirements:
- Prior SRE experience supporting a cloud-native SaaS platform with AWS.
- Bachelor’s degree in Computer Science, Software Engineering, or related field (or equivalent work experience).
- AWS Solutions Architect and/or AWS DevOps Professional Certifications.
- Self-starter with strong communication skills in a distributed work environment.
- 5+ years of hands-on experience in site reliability engineering roles.
- Expert knowledge of AWS services including API Gateway, Lambda, Aurora Serverless, OpenSearch Serverless, Secrets Manager, and FusionAuth.
- Expertise in AWS security services including WAF, Shield, and GuardDuty.
- Strong experience with monitoring tools such as CloudWatch, Prometheus, Grafana.
- Proven ability to design effective monitoring strategies.
- Willingness for a 24x7 on-call rotation.
- Extensive experience with Terraform for infrastructure as code.
- Experience in building and securing multi-tenant SaaS applications.
- Experience with IDPs like FusionAuth, Okta, or Auth0.
- Strong understanding of information security principles.
Responsibilities:
- Design, implement, and maintain highly resilient and secure infrastructure for SaaS platform using AWS services.
- Ensure application security using AWS security services and implement best practices.
- Develop robust monitoring and alerting solutions for platform reliability.
- Facilitate incident response, triage, and retrospective analysis.
- Lead post-mortems to drive reliability improvements.
- Introduce strategies for system resilience and performance optimization.
- Establish SRE principles for the team.
- Build infrastructure as code with Terraform.
- Provide architectural input and collaborate with various teams.
- Educate Engineering teams on reliability and security best practices.
- Participate in Agile Development lifecycle.
Apply