Staff Site Reliability & DevOps Engineer - Observability
C
CisionPR, Marketing, Social Media Management Technology
Remote - Hungary; Sofia, BulgariaFull-TimeStaff
Salary51000 - 65917 EUR per month
Apply NowOpens the employer's application page
Job Details
- Required Skills
- KubernetesGrafanaPrometheusLinuxTerraform
Requirements
- Strong experience with Prometheus
- Strong experience with Grafana
- Solid Linux and networking fundamentals
- Experience running observability stacks in Kubernetes environments
- Infrastructure as code experience (Terraform preferred)
- Familiarity with incident management and on-call practices
- Ability to debug production systems using metrics and logs
Responsibilities
- Design, build, and operate observability platforms based on Grafana and Prometheus
- Define and maintain metrics standards, dashboards, alerts, and SLOs
- Improve signal quality: reduce alert noise, tune thresholds, and improve runbooks
- Support incident response by providing actionable telemetry and post-incident analysis
- Integrate metrics, logs, and traces across distributed systems
- Work with engineering teams to instrument services correctly
- Automate observability configuration using infrastructure as code
- Contribute to reliability improvements through capacity planning and performance analysis
View Full Description & ApplyYou'll be redirected to the employer's site