Senior Site Reliability Engineer - Network Monitoring
New
India, RemoteFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Experience
- 6+ years
- Required Skills
- PythonBashKubernetesLinuxTerraformHelm
Requirements
- 6+ years in an SRE, platform engineering, or equivalent DevOps / infrastructure role
- Deep, hands-on Kubernetes experience: cluster administration, RBAC, networking
- Proficiency with at least one infrastructure-as-code tool: Pulumi, Helm, Kustomize, or Terraform
- Very strong scripting and automation skills in Python and Bash
- Very Strong Linux background
- Experience with AI-assisted development tooling (Claude Code, GitHub Copilot, Codex, or equivalent)
- Solid network troubleshooting fundamentals
Responsibilities
- Deploying and managing Kubernetes clusters and workloads hosted on OpenStack
- Operating and improving GitOps delivery pipelines using Rancher Fleet
- Developing new and maintaining existing Python tools and libraries within our Pulumi-based infrastructure-as-code environment
- Supporting internal customers when they experience issues with our services or tooling
- Directly shape how we monitor the global GoDaddy network and how we deliver reliable, automated services to our internal customers
- Maintain and extend devcontainer based working environments with scripting, AI rule governance and custom CLI tools
View Full Description & ApplyYou'll be redirected to the employer's site