University degree or college diploma in a technical program or equivalent work experience. 5+ years of experience as a Site Reliability Engineer. Proven experience with Terraform. Experience with Kubernetes. Experience with AWS. Demonstrated experience maintaining and improving an Incident Management process. Experience with a major observability platform (e.g., Prometheus, Grafana, Datadog, ELK Stack, Splunk, or New Relic). Experience with distributed systems. Experience with GitHub Actions for CI/CD. Experience in Backup and Recovery Scenarios. Ability to communicate efficiently and work collaboratively.