Staff Software Engineer - Databases SRE
New
G
Grafana LabsObservability Platform
This is a remote opportunity and we are looking for candidates from the UK, Sweden, Spain, Germany or Ireland.Full-TimeStaff
Salary€117,600 - €141,120
Apply NowOpens the employer's application page
Job Details
- Experience
- 8+ years engineering experience, 4+ in SRE/CRE/production engineering.
- Required Skills
- PythonCloud ComputingKubernetesGoLinuxTerraformHelm
Requirements
- 8+ years engineering experience, including 4+ years in SRE/CRE/production engineering.
- Strong experience with Kubernetes in AWS, GCP, or Azure.
- Proficiency with infrastructure-as-code tools such as Helm, Terraform, or Jsonnet.
- Experience operating multi-tenant systems in production.
- Strong experience designing and implementing Service Level Objectives (SLOs).
- Proficiency in one or more programming languages (e.g., Go, Python, Java).
- Solid understanding of Linux OS internals, networking, cloud storage, and scaling.
- Proven troubleshooting and problem-solving skills.
- Experience participating in blame-free incident response and writing high-quality Post-Incident Reviews.
- Strong technical leadership skills, including mentoring other engineers.
- Ability to partner effectively with product engineering teams.
Responsibilities
- Partner closely with product engineering squads in an embedded model.
- Own production reliability for high-SLA and complex customer environments.
- Design and implement automation to scale reliability practices and reduce toil.
- Ensure customer environments meet established SLO targets and define per-tenant reliability models.
- Serve as a primary escalation point and lead incident response and post-incident reviews.
- Contribute to design documentation and perform code reviews.
- Influence feature design to ensure production scalability and operability.
- Improve alert quality and reduce noisy escalations.
View Full Description & ApplyYou'll be redirected to the employer's site