Senior Site Reliability Engineer - Observability
G
Grupo QuintoAndarReal Estate Technology
We work from home and can live anywhere in BrazilFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Experience
- Minimum of 7 years of experience as a SRE, Platform Engineer or Infrastructure Engineer
- Required Skills
- PythonKubernetesGoGrafanaPrometheusLinuxTerraform
Requirements
- Minimum 7 years of experience as SRE, Platform Engineer, or Infrastructure Engineer.
- Experience in large-scale production environments.
- Advanced knowledge of observability (metrics, logs, distributed tracing).
- Hands-on experience with Prometheus, Grafana, and OpenTelemetry.
- Strong proficiency with Kubernetes and cloud ecosystems.
- Deep understanding of Linux, networking, and distributed systems troubleshooting.
- Practical experience with Infrastructure as Code (Terraform or similar).
- Development skills in Go, Python, or similar.
Responsibilities
- Shape the strategy for metrics, logs, and distributed tracing.
- Translate raw data into actionable business intelligence.
- Improve the daily experience of hundreds of developers.
- Ensure platform reliability at high scale.
- Perform architectural design and high-impact technical decision-making.
View Full Description & ApplyYou'll be redirected to the employer's site