8+ years of experience as a Site Reliability Engineer or similar role. Experience supporting high-traffic consumer-facing websites. Proficiency with Terraform is a must. Strong experience working with AWS cloud-based infrastructure and services. Proficiency with Docker and Kubernetes is essential. Solid understanding of cloud-native systems design, including CDNs, load balancers, cloud networking, DNS, caching, and distributed systems. Troubleshooting and debugging skills. Experience designing and supporting CI systems (e.g., CircleCI, Jenkins, GitHub Actions). Familiarity with monitoring and alerting best practices (e.g., Datadog, Cronitor, Sentry, PagerDuty). Proven experience in on-call management best practices. Excellent verbal and written communication skills. Comfortable working in an AI-forward environment.