Staff Site Reliability Engineer
New
IndiaFull-TimeStaff
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Required Skills
- AWSPythonGCPKubernetesAzureGoGrafanaPrometheusDevOps
Requirements
- Extensive experience in Site Reliability Engineering or DevOps within high-scale SaaS environments.
- Strong programming or scripting skills in languages such as Python or Go.
- Hands-on experience with cloud infrastructure such as AWS, GCP, or Azure.
- Deep understanding of distributed systems, networking fundamentals (TCP/IP, DNS, HTTP/S), and Kubernetes.
- Proven experience managing production incidents, leading postmortems, and improving system reliability.
- Strong observability experience with logging, monitoring, and tracing tools.
- Excellent communication skills and problem-solving ability.
- Prior experience mentoring engineers and influencing architecture decisions.
Responsibilities
- Lead the design and implementation of reliability frameworks and self-service platforms that enable engineering teams to own the stability of their services, including “You Build It, You Run It” models.
- Act as Incident Commander during high-severity incidents, coordinating cross-functional response efforts, ensuring rapid resolution, and driving effective blameless postmortems.
- Architect and enhance observability solutions using modern tooling such as Prometheus, Grafana, and OpenTelemetry to improve detection and performance insights.
- Drive automation and AIOps initiatives to enable proactive detection, diagnostics, and remediation of system failures.
- Establish reliability engineering best practices across teams through production readiness reviews, design collaboration, and operational standards.
- Mentor engineers across SRE and product teams, strengthening technical capabilities and promoting a culture of operational excellence.
- Continuously improve system scalability, resilience, and performance across distributed cloud environments.
View Full Description & ApplyYou'll be redirected to the employer's site