Senior Site Reliability Engineer (SRE)
New
T
The Investigo GroupTechnology
Remote -UK (possible paid occasional travel to TIG Secure site locations as required)Full-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Required Skills
- KubernetesGoGrafanaPrometheusCI/CDLinuxTerraformAnsible
Requirements
- Strong experience running production Kubernetes environments.
- Strong Linux fundamentals, including systemd, networking, storage and performance troubleshooting.
- Experience with at least one Kubernetes distribution such as OKD, OpenShift, vanilla Kubernetes, Rancher, EKS, AKS or GKE.
- Solid infrastructure as code experience, including Ansible plus Terraform or equivalent.
- GitOps and CI/CD experience managing full application and component lifecycles.
- Proficiency with observability tools such as Prometheus, Grafana, Elastic Stack / LGTM, OpenTelemetry.
- Experience with virtualisation platforms such as KVM, libvirt or VMware.
- Scripting or tooling experience using Go, Python, shell scripting.
- Eligible to hold UK SC clearance.
Responsibilities
- Operate, harden and extend production OpenShift / OKD / Kubernetes clusters across on-premises and hybrid environments.
- Support the migration from VMware to KVM, helping modernise the underlying compute and storage layer.
- Own and improve CI/CD processes across the full lifecycle of platform and application components.
- Develop and mature GitOps deployment practices using tools such as Argo CD or Flux.
- Maintain and improve core platform services including identity, ingress, observability, certificate management, service mesh and container registry capabilities.
- Automate repeatable operational tasks using tools such as Ansible, Terraform, Helm, Kustomize, Go, Python or equivalent technologies.
- Lead incident response activity, support blameless post-mortems and drive systemic fixes.
- Create and maintain clear technical documentation, runbooks, design notes and operational guidance.
- Mentor other engineers and act as a senior technical authority across cloud and Kubernetes operations.
View Full Description & ApplyYou'll be redirected to the employer's site