Established experience operating high-throughput, low-latency distributed systems in cloud production environments Strong coding skills for automation and systems development (Go, Python, or TypeScript) Proven experience operating Kubernetes at scale (EKS preferred) Applying IaC patterns (CDK, Terraform) Working knowledge of GitOps and reconciliation loops Solid experience with CI/CD systems (GitHub Actions, AWS CodePipeline) Skilled in designing monitoring and alerting pipelines Experience with large-scale streaming or ad-serving workloads Understanding of cloud security best practices