Apply

Senior Manager, Site Reliability Engineering (EU CET)

Posted 11 days agoViewed

View full description

💎 Seniority level: Manager

📍 Location: European Union, CET

🔍 Industry: Fraud Prevention

🏢 Company: SEON Technologies

🗣️ Languages: English

🪄 Skills: AWSDockerPythonBashCloud ComputingJenkinsKubernetesGrafanaPrometheusCI/CDRESTful APIsLinuxDevOpsTerraformMicroservicesNetworkingJSONAnsibleScripting

Requirements:
  • Proven success in leading high-performing SRE or DevOps teams in a large-scale, fast-paced environment
  • Extensive experience running high-availability web services at a large scale, with comprehensive knowledge of cloud-native architectures and advanced networking concepts
  • Strong technical background with hands-on experience in cloud computing, system architecture, automation, and monitoring
  • Experience with tools and technologies such as AWS, Kubernetes, Terraform, Prometheus, Grafana, Jenkins, and similar.
Responsibilities:
  • Lead and grow a high-performing SRE team responsible for the reliability, performance, and scalability of production systems.
  • Own the incident management process, postmortems, and root cause analysis to improve system resilience.
  • Drive implementation of SLAs, SLOs, and error budgets across services to align operational goals with business objectives.
  • Champion the use of automation to reduce manual work and improve deployment and recovery times.
  • Collaborate with software engineering and Platform engineering teams to ensure systems are designed for reliability and operational efficiency.
  • Oversee system monitoring, alerting, and observability efforts using tools like Prometheus, Grafana, Datadog, or similar.
  • Manage on-call rotations, and ensure proper documentation, runbooks, and playbooks are maintained.
  • Identify and drive continuous improvement in system architecture, capacity planning, and deployment strategies.
  • Ensure compliance with security, privacy, and regulatory requirements within the infrastructure.
  • Provide mentorship, performance reviews, and career development opportunities for SRE team members.
  • You will communicate effectively with stakeholders at all levels, providing updates on team performance, project status, and incident resolutions.
  • You will advocate for the SRE team within the broader organization, representing their needs and concerns
Apply