Apply

Engineering Manager, Site Reliability

Posted 17 days agoViewed

View full description

💎 Seniority level: Manager, 2+ years

📍 Location: Canada

🔍 Industry: Software Development

🏢 Company: Replicant👥 101-250💰 $78,000,000 Series B about 3 years agoArtificial Intelligence (AI)Information ServicesData Center AutomationMachine LearningInformation TechnologySoftware

🗣️ Languages: English

⏳ Experience: 2+ years

🪄 Skills: AWSLeadershipAgileCloud ComputingKubernetesPeople ManagementCollaborationCI/CDRESTful APIsLinuxDevOpsTerraformMicroservices

Requirements:
  • You bring 2+ years of experience thoughtfully leading remote teams in reliability, infrastructure, or software engineering
  • You take ownership, guiding high-impact projects from idea to completion, often in close partnership with others
  • You have a strong track record of delivering critical infrastructure while supporting and coordinating across teams to meet shared goals
  • You bring a deep understanding of modern public cloud infrastructure, observability practices, and distributed systems, and enjoy sharing that knowledge with others
  • You communicate clearly, plan collaboratively, and bring creativity and curiosity to complex challenges
  • You’re skilled at turning abstract ideas into approachable, incremental plans that create alignment and shared understanding
  • You care about growing others and bring a thoughtful approach to career development—supporting team members through mentorship, growth opportunities, and promotions.
  • You value data-informed decision-making and are passionate about using insights to improve how your team and product evolve
  • You're comfortable using tools like JIRA, Google Sheets, and dashboards to support transparency and make collaborative decisions
  • You're committed to fostering team health, consistent progress, and shared learning through agile practices and a culture of continuous improvement
Responsibilities:
  • Support a collaborative working culture in a remote-first environment
  • Leverage your technical expertise to drive project success by shaping architecture, guiding design decisions, and maintaining code quality through regular reviews—while occasionally contributing hands-on to non-critical path initiatives
  • Coach and mentor your team by creating individualized development plans, tracking progress through measurable goals, and uniting the team around our company vision
  • Partner with internal teams and business partners to scale a robust cloud infrastructure, ensuring operational excellence and cost efficiency for our virtual agent platform
  • Cultivate a "shift left" culture for sustainable shared ownership of platform reliability, scalability, and performance
  • Oversee On-Call and Incident Management processes and procedures
  • Collaborate with Engineering Leadership to shape best practices and drive operational and technical excellence across teams
Apply