Senior Site Reliability Engineer - Observability

G
Grupo QuintoAndarReal Estate Technology
We work from home and can live anywhere in BrazilFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Experience
Minimum of 7 years of experience as a SRE, Platform Engineer or Infrastructure Engineer
Required Skills
PythonKubernetesGoGrafanaPrometheusLinuxTerraform

Requirements

  • Minimum 7 years of experience as SRE, Platform Engineer, or Infrastructure Engineer.
  • Experience in large-scale production environments.
  • Advanced knowledge of observability (metrics, logs, distributed tracing).
  • Hands-on experience with Prometheus, Grafana, and OpenTelemetry.
  • Strong proficiency with Kubernetes and cloud ecosystems.
  • Deep understanding of Linux, networking, and distributed systems troubleshooting.
  • Practical experience with Infrastructure as Code (Terraform or similar).
  • Development skills in Go, Python, or similar.

Responsibilities

  • Shape the strategy for metrics, logs, and distributed tracing.
  • Translate raw data into actionable business intelligence.
  • Improve the daily experience of hundreds of developers.
  • Ensure platform reliability at high scale.
  • Perform architectural design and high-impact technical decision-making.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now