Senior Software Engineer, Cloud Development

New
M
MozillaTechnology
Remote CanadaFull-TimeSenior
Salary$95,000 — $139,000 CAD
Apply NowOpens the employer's application page

Job Details

Experience
4–6 years
Required Skills
PythonGCPKubernetesGrafanaPostgresTerraformHelmDistributed Systems

Requirements

  • Bachelor's degree with 4–6 years of relevant industry experience, or Master's degree with significant hands-on experience, or work experience equivalent.
  • Strong, modern Python skills, including dependency management, linting, formatting, type checks, and pre-commit hooks.
  • Advanced experience with database deployment and management (Postgres preferred).
  • Proven experience deploying and operating workloads in cloud environments (GCP and GKE).
  • Hands-on experience with Kubernetes and Helm, including environment configuration and progressive rollouts.
  • Experience with Terraform for provisioning infrastructure across environments.
  • Experience designing and running scalable APIs, including auth and health monitoring.
  • Experience with Grafana or similar metrics and dashboarding tools.
  • Strong problem-solving skills for debugging performance and reliability issues in distributed systems.
  • Clear and effective communication skills.
  • On-call experience, including incident response and post-incident reviews.

Responsibilities

  • Design, build, and operate core platform services and APIs used to deploy and serve production workloads at scale.
  • Own service reliability end-to-end, driving improvements in availability, scalability, performance, and operational excellence.
  • Lead efforts to optimize backend services for throughput, latency, and cost efficiency across distributed infrastructure.
  • Design and manage Kubernetes-based workloads, including GitOps deployment pipelines, environment configuration, and resource utilization optimization.
  • Own and improve critical parts of the service lifecycle, including packaging, versioning, testing strategies, validation, and deployment automation.
  • Implement and evolve observability practices (metrics, logging, tracing, alerting) to improve visibility and operational resilience.
  • Partner closely with product, infrastructure, security, and data teams to design scalable platform capabilities.
  • Contribute to technical design discussions, propose architectural improvements, and mentor junior engineers.
  • Participate in and help improve operational processes, including incident response, on-call rotations, and post-incident reviews.
View Full Description & ApplyYou'll be redirected to the employer's site
$95,000 — $139,000 CAD
Apply Now