Senior Site Reliability Engineer
New
M
MozillaOpen-source software
Remote Canada, Consistent overlap with Pacific Time zone working hours.Full-TimeSenior
Salary108,000 - 125,000 CAD per year
Apply NowOpens the employer's application page
Job Details
- Experience
- 7+ years
- Required Skills
- AWSKubernetesCI/CDTerraformGitHub Actions
Requirements
- 7+ years of experience in infrastructure, platform engineering, or site reliability roles.
- Hands-on production Kubernetes experience in workload operations, troubleshooting, and cluster management.
- Hands-on experience with infrastructure-as-code on AWS using Terraform, OpenTofu, or Pulumi.
- Security awareness: identity, least privilege, secrets hygiene, and network controls.
- Excellent async written communication skills.
- Ability to collaborate effectively with software engineers and non-engineering stakeholders.
- Ability to learn, evaluate, and responsibly use emerging technologies.
Responsibilities
- Operate and evolve our EKS-based Kubernetes platform, supporting service migrations, platform improvements, and reliability initiatives.
- Design and develop CI/CD systems supporting websites, services, and Thunderbird desktop releases.
- Write and maintain infrastructure in Pulumi and/or Terraform/OpenTofu across multiple AWS accounts.
- Operate and evolve our observability stack (VictoriaMetrics, VictoriaLogs, Grafana, Vector).
- Apply security-conscious infrastructure practices, including least-privilege IAM and secrets management.
- Diagnose and debug production incidents; drive root-cause analysis.
- Participate in on-call rotation and collaborate with SDEs.
- Contribute to runbooks and architecture documentation.
View Full Description & ApplyYou'll be redirected to the employer's site