Senior Site Reliability Engineer
Z
ZipdevCloud Infrastructure
Brazil. Honduras. Uruguay, Meaningful overlap with US Eastern or Mountain time zonesFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Languages
- Strong written and spoken English
- Experience
- 5+ years
- Required Skills
- Microsoft AzureDevOpsTerraformDatadogHIPAA
Requirements
- 5+ years of experience in SRE, DevOps, or cloud infrastructure roles.
- Strong hands-on experience with Microsoft Azure (Azure SQL, Cosmos DB, Container Apps, App Service).
- Experience with observability tooling such as Datadog or Azure Monitor.
- Proficiency in on-call incident response and management.
- Familiarity with Infrastructure as Code, with a preference for Terraform.
- Experience working in HIPAA-regulated environments, including handling PHI under a Business Associate Agreement (BAA).
- Familiarity with least-privilege, audited access controls.
- Strong written and spoken English proficiency.
- Availability to maintain meaningful overlap with US Eastern or Mountain time zones.
- Willingness to complete a healthcare-industry-standard background check.
Responsibilities
- Implement and maintain observability tooling, including Datadog, Azure Monitor, and App Insights.
- Define SLOs, alert rules, and synthetic checks to monitor system performance.
- Participate in a PagerDuty on-call rotation for SEV-1 and SEV-2 incidents.
- Build and maintain operational runbooks for incident response, rollback, and recovery.
- Contribute to deployment automation using blue/green or canary patterns and Infrastructure as Code.
- Optimize performance and costs for Azure SQL and Cosmos DB environments.
- Collaborate with US-based engineering team members during overlapping working hours.
View Full Description & ApplyYou'll be redirected to the employer's site