Staff Site Reliability & DevOps Engineer - Observability

C
CisionPR, Marketing, Social Media Management Technology
Remote - Hungary; Sofia, BulgariaFull-TimeStaff
Salary51000 - 65917 EUR per month
Apply NowOpens the employer's application page

Job Details

Required Skills
KubernetesGrafanaPrometheusLinuxTerraform

Requirements

  • Strong experience with Prometheus
  • Strong experience with Grafana
  • Solid Linux and networking fundamentals
  • Experience running observability stacks in Kubernetes environments
  • Infrastructure as code experience (Terraform preferred)
  • Familiarity with incident management and on-call practices
  • Ability to debug production systems using metrics and logs

Responsibilities

  • Design, build, and operate observability platforms based on Grafana and Prometheus
  • Define and maintain metrics standards, dashboards, alerts, and SLOs
  • Improve signal quality: reduce alert noise, tune thresholds, and improve runbooks
  • Support incident response by providing actionable telemetry and post-incident analysis
  • Integrate metrics, logs, and traces across distributed systems
  • Work with engineering teams to instrument services correctly
  • Automate observability configuration using infrastructure as code
  • Contribute to reliability improvements through capacity planning and performance analysis
View Full Description & ApplyYou'll be redirected to the employer's site
51000 - 65917 EUR per month
Apply Now