Site Reliability Engineer

New
A
accessoLeisure & Entertainment
Anywhere in the United KingdomFull-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Required Skills
AWSDockerPythonBashGCPKubernetesAzureGrafanaPrometheusLinuxTerraformGitHub Actions

Requirements

  • Professional exposure to Cloud Platforms (AWS/Azure/GCP)
  • Practical Experience with Terraform
  • Practical Experience with Docker
  • Practical Experience with Cloud Managed Kubernetes (EKS/AKS/GKE)
  • Practical Experience with monitoring tools
  • Self-Managed training – learning new concepts, trialing them, and applying them
  • Scripting ability using Python or Bash
  • Familiarity with Linux systems and general command–line
  • Understanding Ops and CI/CD concepts
  • Good written and verbal communication; customer-focused approach
  • Ability to work with minimal direction
  • Willingness to learn, take direction, and work within a team

Responsibilities

  • Provisioning and deploying accesso Horizon components to customer cloud accounts using Infrastructure as Code (Terraform) and ArgoCD
  • Maintain, improve, and create CI/CD pipelines (GitHub Actions / ArgoCD) for application and infrastructure deployments
  • Support monitoring, logging and alerting (Prometheus, Grafana, & Coralogix) and respond to alerts, along with acting as level 3 escalation
  • Lead incident triage, root cause investigation, and follow-up tasks
  • Follow security and compliance requirements for customer cloud environments (identity, secrets, network controls)
  • Produce and maintain operational runbooks, deployment guides, and change notes
  • Participate in monthly on-call rotation as an L3 responder
  • Learn and apply accesso Horizon product architecture and configuration
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now