Bachelor’s degree in Computer Science, Engineering, or equivalent experience 5+ years in software engineering or Infra/DevOps/SRE roles (Python or Go preferred) Experience deploying cloud infrastructure via automation (e.g. Terraform, Pulumi, Bicep/ARM) Incident management experience in cloud/software engineering Hands-on experience operating production workloads in cloud environments Familiarity with Kubernetes (AKS, GKE, or EKS) Strong troubleshooting and root-cause analysis skills in distributed systems Experience with observability platforms (e.g., DataDog, Prometheus/Grafana, OpenTelemetry) Ability to define and implement metrics, dashboards, and alerting