Apply📍 Canada
🧭 Full-Time
💸 111800.0 - 157300.0 CAD per year
🔍 Healthcare technology
- 8+ years of relevant work experience in the reliability field.
- Experience as a team lead or people manager, with a desire to advance in management.
- Experience with Cloud Providers, preferably GCP.
- Knowledge of monitoring systems such as Grafana/Prometheus.
- Experience with observability best practices.
- Familiarity with container orchestration systems like Kubernetes.
- Ability to write high-quality, testable code in languages like Python or Go.
- Experience with automation in critical production environments.
- Skill in analyzing and troubleshooting distributed systems.
- Lead the team in advancing reliability capabilities, including both reactive and proactive tools.
- Collaborate with cross-functional teams to adopt reliability tooling and processes.
- Report on and provide insights on the reliability of League's systems.
- Manage roadmap priorities, deadlines, and deliverables.
- Design, build, and improve capabilities for reliability.
- Support the adoption of Service Level Objectives and error budgets.
- Enhance measurement and monitoring of system health.
- Improve observability and incident response tooling.
- Mentor team members on reliability standards and practices.
PythonCloud ComputingGCPKubernetesGoGrafanaPrometheus
Posted 4 days ago
Apply