Experience in an SRE role Strong knowledge of cloud technologies and SLA SLO SLI management Excellent communication and leadership skills Ability to analyze and improve operational processes and performance metrics Experience in software design, automation, and root cause analysis On-call support experience and customer-focused mindset Collaborative attitude with commercial and technical teams Launching and operating production Kubernetes clusters Designing and operating infrastructure on AWS and other providers Operating MongoDB (or other document database) clusters Operating Redis (or other key-value storage) clusters Administering Linux servers Operating Prometheus and Grafana Operating logging collection and analysis system Participating in the on-call rotation (4:00am - 16:00pm UTC) Kubernetes (administrator) Go and/or Python (advanced) AWS/ EKS (advanced) Linux (advanced) Terraform and IaC in general (proficient) Helm (proficient) MongoDB (or similar) Redis (or similar) Monitoring – prometheus, grafana, thanos (familiar) Grasp of networking concepts (subnets, routing, peering, load balancing, NAT, etc.) Common networking protocols (DNS, TCP/IP, HTTP, TLS, UDP) Proactive, energetic, innovative and change oriented A desire to lead/mentor a team