Senior Software Engineer, Cloud Development
New
M
MozillaTechnology
Remote CanadaFull-TimeSenior
Salary$95,000 — $139,000 CAD
Apply NowOpens the employer's application page
Job Details
- Experience
- 4–6 years
- Required Skills
- PythonGCPKubernetesGrafanaPostgresTerraformHelmDistributed Systems
Requirements
- Bachelor's degree with 4–6 years of relevant industry experience, or Master's degree with significant hands-on experience, or work experience equivalent.
- Strong, modern Python skills, including dependency management, linting, formatting, type checks, and pre-commit hooks.
- Advanced experience with database deployment and management (Postgres preferred).
- Proven experience deploying and operating workloads in cloud environments (GCP and GKE).
- Hands-on experience with Kubernetes and Helm, including environment configuration and progressive rollouts.
- Experience with Terraform for provisioning infrastructure across environments.
- Experience designing and running scalable APIs, including auth and health monitoring.
- Experience with Grafana or similar metrics and dashboarding tools.
- Strong problem-solving skills for debugging performance and reliability issues in distributed systems.
- Clear and effective communication skills.
- On-call experience, including incident response and post-incident reviews.
Responsibilities
- Design, build, and operate core platform services and APIs used to deploy and serve production workloads at scale.
- Own service reliability end-to-end, driving improvements in availability, scalability, performance, and operational excellence.
- Lead efforts to optimize backend services for throughput, latency, and cost efficiency across distributed infrastructure.
- Design and manage Kubernetes-based workloads, including GitOps deployment pipelines, environment configuration, and resource utilization optimization.
- Own and improve critical parts of the service lifecycle, including packaging, versioning, testing strategies, validation, and deployment automation.
- Implement and evolve observability practices (metrics, logging, tracing, alerting) to improve visibility and operational resilience.
- Partner closely with product, infrastructure, security, and data teams to design scalable platform capabilities.
- Contribute to technical design discussions, propose architectural improvements, and mentor junior engineers.
- Participate in and help improve operational processes, including incident response, on-call rotations, and post-incident reviews.
View Full Description & ApplyYou'll be redirected to the employer's site