Site Reliability Engineer
New
A
accessoLeisure & Entertainment
Anywhere in the United KingdomFull-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Required Skills
- AWSDockerPythonBashGCPKubernetesAzureGrafanaPrometheusLinuxTerraformGitHub Actions
Requirements
- Professional exposure to Cloud Platforms (AWS/Azure/GCP)
- Practical Experience with Terraform
- Practical Experience with Docker
- Practical Experience with Cloud Managed Kubernetes (EKS/AKS/GKE)
- Practical Experience with monitoring tools
- Self-Managed training – learning new concepts, trialing them, and applying them
- Scripting ability using Python or Bash
- Familiarity with Linux systems and general command–line
- Understanding Ops and CI/CD concepts
- Good written and verbal communication; customer-focused approach
- Ability to work with minimal direction
- Willingness to learn, take direction, and work within a team
Responsibilities
- Provisioning and deploying accesso Horizon components to customer cloud accounts using Infrastructure as Code (Terraform) and ArgoCD
- Maintain, improve, and create CI/CD pipelines (GitHub Actions / ArgoCD) for application and infrastructure deployments
- Support monitoring, logging and alerting (Prometheus, Grafana, & Coralogix) and respond to alerts, along with acting as level 3 escalation
- Lead incident triage, root cause investigation, and follow-up tasks
- Follow security and compliance requirements for customer cloud environments (identity, secrets, network controls)
- Produce and maintain operational runbooks, deployment guides, and change notes
- Participate in monthly on-call rotation as an L3 responder
- Learn and apply accesso Horizon product architecture and configuration
View Full Description & ApplyYou'll be redirected to the employer's site