Apply

Manager, Software Engineering (Infrastructure Platform)

Posted 10 days agoViewed

View full description

💎 Seniority level: Manager

📍 Location: United Kingdom

💸 Salary: 140000.0 - 180000.0 GBP per year

🔍 Industry: Software Development

🏢 Company: Affirm👥 1001-5000💰 Post-IPO Equity about 4 years ago🫂 Last layoff about 2 years agoLendingFinancial ServicesPaymentsFinTech

🗣️ Languages: English

🪄 Skills: AWSDockerLeadershipPythonSQLBashCloud ComputingKubernetesPeople ManagementCross-functional Team LeadershipActiveMQApache KafkaGrafanaPrometheusCommunication SkillsCI/CDProblem SolvingAgile methodologiesRESTful APIsMentoringDevOpsTerraformMicroservicesJSONStrategic thinkingAnsible

Requirements:
  • Proven experience leading teams in platform engineering, infrastructure, or site reliability (SRE).
  • Deep expertise in reliability engineering, incident response, and operational tooling.
  • Strong decision-making skills, balancing technical depth with strategic business impact.
  • Experience defining and executing technical strategies for cloud computing, observability, CI/CD, and developer tooling.
  • Hands-on technical experience in infrastructure operations, with working knowledge of: Cloud platforms (AWS or equivalent), Kubernetes & container orchestration, Infrastructure as Code (Terraform, Helm, CloudFormation), Monitoring & Observability (Prometheus, Grafana, OpenTelemetry, or similar), CI/CD pipelines, automation, and software delivery best practices
Responsibilities:
  • Lead & Grow the Team: Manage, mentor, and develop engineers in Platform Engineering & SRE, fostering a culture of ownership, reliability, and automation.
  • Drive Operational Excellence: Improve incident response, on-call processes, and post-mortems to enhance system reliability and availability.
  • Scale & Automate Infrastructure: Advance self-healing automation, infrastructure as code (IaC), and CI/CD workflows to optimize efficiency and performance.
  • Enhance Observability & Performance: Improve monitoring, logging, and telemetry to proactively detect and resolve system issues.
  • Collaborate & Align Strategy: Work with Infrastructure, DevProd, and SRE teams to optimize multi-region deployments, cloud cost efficiency, and service scalability.
  • Foster Innovation & Inclusion: Promote best practices in reliability engineering, encourage automation-first approaches, and support team growth and internal mobility.
Apply